Kaixin Ma's picture

Kaixin Ma

kaixinm

·

https://mayer123.github.io/

Mayer123

AI & ML interests

NLP, ML

Organizations

None yet

upvoted 2 papers 3 months ago

SO-Bench: A Structural Output Evaluation of Multimodal LLMs

Paper • 2511.21750 • Published Nov 23, 2025 • 6

NarrativeTrack: Evaluating Video Language Models Beyond the Frame

Paper • 2601.01095 • Published Jan 3 • 8

upvoted 4 papers over 1 year ago

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Paper • 2410.10813 • Published Oct 14, 2024 • 16

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2, 2024 • 27

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 66

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 133

upvoted a paper over 2 years ago

LASER: LLM Agent with State-Space Exploration for Web Navigation

Paper • 2309.08172 • Published Sep 15, 2023 • 14