Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kaixin Ma's picture
1 7

Kaixin Ma

kaixinm
xywang1's profile picture 21world's profile picture SteveSHEN's profile picture
·
https://mayer123.github.io/
  • Mayer123

AI & ML interests

NLP, ML

Organizations

None yet

upvoted 2 papers 3 months ago

SO-Bench: A Structural Output Evaluation of Multimodal LLMs

Paper • 2511.21750 • Published Nov 23, 2025 • 6

NarrativeTrack: Evaluating Video Language Models Beyond the Frame

Paper • 2601.01095 • Published Jan 3 • 8
upvoted 4 papers over 1 year ago

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Paper • 2410.10813 • Published Oct 14, 2024 • 16

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2, 2024 • 27

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 66

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 133
upvoted a paper over 2 years ago

LASER: LLM Agent with State-Space Exploration for Web Navigation

Paper • 2309.08172 • Published Sep 15, 2023 • 14
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs