Xiawu Zheng's picture

2

Xiawu Zheng

zhengxiawu

·

AI & ML interests

Model Compression

Recent Activity

upvoted a paper 7 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

upvoted a paper 29 days ago

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

authored a paper over 1 year ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 10 days ago • 232

upvoted a paper 29 days ago

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published 29 days ago • 248

authored 2 papers over 1 year ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published Jan 3, 2025 • 47

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Paper • 2408.05211 • Published Aug 9, 2024 • 50