Skip to content
View BradyFU's full-sized avatar
👋
👋
  • Lead NJU-MiG (Multimodal intelligence Group, 南京大学米格小组)

Organizations

@VITA-MLLM @MME-Benchmarks

Block or report BradyFU

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Awesome-Multimodal-Large-Language-Models Awesome-Multimodal-Large-Language-Models Public

    ✨✨Latest Advances on Multimodal Large Language Models

    17.1k 1.1k

  2. MME-Benchmarks/Video-MME MME-Benchmarks/Video-MME Public

    ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    705 26

  3. VITA-MLLM/VITA VITA-MLLM/VITA Public

    ✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    Python 2.5k 180

  4. VITA-MLLM/Long-VITA VITA-MLLM/Long-VITA Public

    ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

    Python 307 29

  5. VITA-MLLM/VITA-Audio VITA-MLLM/VITA-Audio Public

    ✨✨[NeurIPS 2025] VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    Python 669 60

  6. VITA-MLLM/Woodpecker VITA-MLLM/Woodpecker Public

    ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

    Python 642 30