RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 5 days ago • 98
Can Natural Image Autoencoders Compactly Tokenize fMRI Volumes for Long-Range Dynamics Modeling? Paper • 2604.03619 • Published 14 days ago • 8
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning Paper • 2602.11748 • Published Feb 12 • 36
ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 9 days ago • 255
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 15 days ago • 359
GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers Paper • 2604.02648 • Published 15 days ago • 45
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 16 days ago • 478
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 21 days ago • 353
ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning Paper • 2603.28610 • Published 18 days ago • 20