LMMs-Lab

community

https://www.lmms-lab.com/

EvolvingLMMs-Lab

AI & ML interests

Feeling and building the multimodal intelligence.

Recent Activity

THUdyh authored a paper 10 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Jingkang authored a paper 11 days ago

Sparse Mixture-of-Experts are Domain Generalizable Learners

Jingkang authored a paper 11 days ago

Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

View all activity

Papers

A Simple Baseline for Streaming Video Understanding

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

View all Papers

lmms-lab 's datasets 166

lmms-lab/RefCOCO

Viewer • Updated Mar 8, 2024 • 17.6k • 8.5k • 31

lmms-lab/Ferret-Bench

Viewer • Updated Mar 8, 2024 • 120 • 14 • 6

lmms-lab/HallusionBench

Viewer • Updated Mar 8, 2024 • 1.13k • 3.29k • 10

lmms-lab/COCO-Caption

Viewer • Updated Mar 8, 2024 • 81.3k • 4.43k • 12

lmms-lab/COCO-Caption2017

Viewer • Updated Mar 8, 2024 • 45.7k • 4.35k • 22

lmms-lab/flickr30k

Viewer • Updated Mar 8, 2024 • 31.8k • 4.91k • 18

lmms-lab/SEED-Bench-2

Viewer • Updated Mar 8, 2024 • 24.4k • 540 • 2

lmms-lab/SEED-Bench

Viewer • Updated Mar 8, 2024 • 18k • 12.3k • 4

lmms-lab/CMMMU

Viewer • Updated Mar 8, 2024 • 12k • 409 • 4

lmms-lab/MP-DocVQA

Viewer • Updated Feb 11, 2024 • 10.2k • 1.64k • 5

lmms-lab/ST-VQA

Viewer • Updated Feb 10, 2024 • 4.07k • 114 • 5

lmms-lab/RoboVQA

Preview • Updated Feb 4, 2024 • 41 • 1

lmms-lab/VQAv2

Viewer • Updated Jan 26, 2024 • 770k • 20.2k • 32

lmms-lab/DC200_CN

Viewer • Updated Jan 22, 2024 • 200 • 4

lmms-lab/DC100_EN

Preview • Updated Jan 22, 2024 • 15

lmms-lab/MME

Viewer • Updated Dec 23, 2023 • 2.37k • 36.5k • 29