Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 16 days ago • 479
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 15 days ago • 361
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published Mar 17 • 51
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling Paper • 2604.06916 • Published 10 days ago • 34
arithmetic-circuit-overloading/Llama-3.3-70B-Instruct-v2-3d-5M-500K-0.1-reverse-padzero-99-64D-3L-4H-256I Text Generation • 544k • Updated 9 days ago • 1.65k • 1
GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers Paper • 2604.02648 • Published 15 days ago • 45
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization Paper • 2603.28342 • Published 19 days ago • 26