RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 5 days ago • 98
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 16 days ago • 477
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 10 days ago • 313
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published Mar 17 • 51
Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning Paper • 2604.01152 • Published 16 days ago • 5
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published about 1 month ago • 308