Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Chen Yang's picture
2 5 7

Chen Yang

kiiic
buaa42wxy's profile picture
·
https://scholar.google.com/citations?user=m0UcLMkAAAAJ&hl=zh-CN
  • Benioh

AI & ML interests

Speech&MLLM

Recent Activity

new activity about 17 hours ago
OpenMOSS-Team/MOSS-Audio-4B-Thinking:Japanese/Korean Support && Recommend Models for Subtitle Transcription with Timestamps
liked a model 1 day ago
OpenMOSS-Team/MOSS-Audio-8B-Instruct
liked a model 1 day ago
OpenMOSS-Team/MOSS-Audio-8B-Thinking
View all activity

Organizations

Shanghai Jiao Tong University's profile picture OpenMOSS's profile picture

upvoted a collection 2 days ago

MOSS-Audio

Collection
An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 4 items • Updated 1 day ago • 11
upvoted a paper 30 days ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 423
upvoted 3 papers 3 months ago

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Paper • 2505.13032 • Published May 19, 2025 • 4

MNV-17: A High-Quality Performative Mandarin Dataset for Nonverbal Vocalization Recognition in Speech

Paper • 2509.18196 • Published Sep 19, 2025 • 2

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published Jan 4 • 59
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs