👋
南京大学-研究员|助理教授|博导-中国科协青年人才托举工程|中科院院长特别奖 Lead MME & VITA & Awesome-MLLM
-
Lead NJU-MiG (Multimodal intelligence Group, 南京大学米格小组)
- https://bradyfu.github.io/
Pinned Loading
-
Awesome-Multimodal-Large-Language-Models
Awesome-Multimodal-Large-Language-Models Public✨✨Latest Advances on Multimodal Large Language Models
-
MME-Benchmarks/Video-MME
MME-Benchmarks/Video-MME Public✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
-
VITA-MLLM/VITA
VITA-MLLM/VITA Public✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
-
VITA-MLLM/Long-VITA
VITA-MLLM/Long-VITA Public✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy
-
VITA-MLLM/VITA-Audio
VITA-MLLM/VITA-Audio Public✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
-
VITA-MLLM/Woodpecker
VITA-MLLM/Woodpecker Public✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.