Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 13 days ago • 469
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 7 days ago • 309
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published 29 days ago • 51
Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning Paper • 2604.01152 • Published 14 days ago • 5
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published 29 days ago • 308