Yuxin Zuo's picture

Yuxin Zuo

yuxinzuo

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization

authored a paper 3 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

authored a paper 3 days ago

How Far Can Unsupervised RLVR Scale LLM Training?

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization

Paper • 2604.12290 • Published 4 days ago • 16

upvoted a paper 3 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 4 days ago • 77

upvoted 3 papers 10 days ago

Learning to Retrieve from Agent Trajectories

Paper • 2604.04949 • Published 19 days ago • 69

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published 11 days ago • 114

MedGemma 1.5 Technical Report

Paper • 2604.05081 • Published 12 days ago • 14

upvoted 4 papers about 1 month ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 180

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 424

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 58

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published Feb 28 • 64

upvoted a paper about 2 months ago

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Paper • 2602.21320 • Published Feb 24 • 12

upvoted a collection 2 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.54k

upvoted 6 papers 2 months ago

Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments

Paper • 2602.11964 • Published Feb 12 • 13

Towards Autonomous Mathematics Research

Paper • 2602.10177 • Published Feb 10 • 36

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

Code2World: A GUI World Model via Renderable Code Generation

Paper • 2602.09856 • Published Feb 10 • 202

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published Feb 5 • 27

upvoted a paper 3 months ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86

upvoted 2 papers 4 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 161