Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PeterPP's picture
8

PeterPP

ZhSh1230

AI & ML interests

None yet

Organizations

None yet

upvoted 5 papers 2 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 36

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published Feb 9 • 43

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published Feb 2 • 32

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Paper • 2602.01640 • Published Feb 2 • 8
upvoted a paper 5 months ago

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 24
upvoted a paper 6 months ago

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30
upvoted a paper 7 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 148
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs