view article Article Arcade-3B: SLM Optimization via Orthogonal Decoupling of Latent State Spaces Mar 15 • 1
CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models Paper • 2602.17684 • Published Feb 4 • 22
Efficient RLVR Training via Weighted Mutual Information Data Selection Paper • 2603.01907 • Published Mar 2 • 14
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning Paper • 2504.13914 • Published Apr 10, 2025 • 5
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published Feb 25 • 50
view article Article Exploring New Frontiers of LLMs: Adaptive Dual-Search Distillation (ADS) and the 30B Model Open Beta Mar 1 • 2
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 503
Kai Models Series Collection Kai Models Distilled via Adaptive Dual Search Distillation • 3 items • Updated Mar 2 • 2
Nacrith: Neural Lossless Compression via Ensemble Context Modeling and High-Precision CDF Coding Paper • 2602.19626 • Published Feb 23 • 3
view article Article Shattering the Memory Wall: O(1) Inference and Causal Monoid State Compression in Spartacus-1B Feb 25 • 2
Weight-sparse transformers have interpretable circuits Paper • 2511.13653 • Published Nov 17, 2025 • 2
Reasoning at the Edge (HF Preprints) Collection This collection traces the mathematical and empirical limits of machine reasoning. • 12 items • Updated Feb 28 • 1