Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shawn's picture
6

shawn

ares0728
csfufu's profile picture
·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 2 months ago

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published Feb 2 • 118

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155
upvoted a paper 3 months ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48
upvoted a paper 6 months ago

ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

Paper • 2510.08457 • Published Oct 9, 2025 • 13
upvoted a collection 6 months ago

ARES

Collection
🌴ARES is an open-source framework for adaptive multimodal reasoning, using difficulty-aware training and entropy-shaped policy optimization. • 4 items • Updated Mar 2 • 2
upvoted a paper 10 months ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published Jun 4, 2025 • 48
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs