Chao Zhang's picture

Chao Zhang

cz277

·

https://mi.eng.cam.ac.uk/~cz277

AI & ML interests

Spoken language processing, multimodal intelligence, medical AI, cognitive neuroscience

Recent Activity

upvoted a paper 2 days ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

updated a collection 25 days ago

Speech & Audio Processing

updated a collection 25 days ago

Speech & Audio Processing

View all activity

Organizations

upvoted a paper 2 days ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 20 days ago • 131

upvoted a paper about 1 year ago

video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model

Paper • 2502.11775 • Published Feb 17, 2025 • 9

upvoted 2 papers over 2 years ago

SALMONN: Towards Generic Hearing Abilities for Large Language Models

Paper • 2310.13289 • Published Oct 20, 2023 • 17

ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video

Paper • 2401.05314 • Published Jan 10, 2024 • 12