asdasd's picture

asdasd

asdjghh

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

upvoted a paper about 2 months ago

BrowseComp-V^3: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents

upvoted a paper 2 months ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

View all activity

Organizations

upvoted a paper 7 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 10 days ago • 232

upvoted a paper about 2 months ago

BrowseComp-V^3: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents

Paper • 2602.12876 • Published Feb 13 • 12

upvoted a paper 2 months ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published Feb 2 • 16

updated a dataset 3 months ago

VIBE-Benchmark/BAGEL-think

Viewer • Updated Feb 1 • 1.18k • 112

updated a dataset 4 months ago

VIBE-Benchmark/Step1X-Edit-v1p2

Viewer • Updated Feb 1 • 934 • 9 • 1

published a dataset 4 months ago

VIBE-Benchmark/Step1X-Edit-v1p2

Viewer • Updated Feb 1 • 934 • 9 • 1

updated 8 datasets 4 months ago

VIBE-Benchmark/BAGEL

Viewer • Updated Feb 1 • 1.03k • 1.02k

VIBE-Benchmark/BAGEL-think

Viewer • Updated Feb 1 • 1.18k • 112

VIBE-Benchmark/UniWorld-V1

Viewer • Updated Feb 1 • 1.03k • 950

VIBE-Benchmark/OmniGen2

Viewer • Updated Feb 1 • 1.03k • 4 • 1

VIBE-Benchmark/FLUX2-dev

Viewer • Updated Feb 1 • 1.03k • 973

VIBE-Benchmark/VIBE-Qwen-Image-Edit

Viewer • Updated Feb 1 • 934 • 566

VIBE-Benchmark/Qwen-Image-Edit-2509

Viewer • Updated Feb 1 • 1.03k • 15

VIBE-Benchmark/Edit-R1-Qwen-Image-Edit-2509

Viewer • Updated Feb 1 • 1.03k • 1k

published 4 datasets 4 months ago

VIBE-Benchmark/Edit-R1-Qwen-Image-Edit-2509

Viewer • Updated Feb 1 • 1.03k • 1k

VIBE-Benchmark/Qwen-Image-Edit-2509

Viewer • Updated Feb 1 • 1.03k • 15

VIBE-Benchmark/BAGEL-think

Viewer • Updated Feb 1 • 1.18k • 112

VIBE-Benchmark/BAGEL

Viewer • Updated Feb 1 • 1.03k • 1.02k