view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 14 days ago • 849
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published 19 days ago • 142
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 20 days ago • 50
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published Mar 5 • 37
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published Mar 3 • 194
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 Feb 12 • 32
view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model Jan 1 • 19