The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning Paper β’ 2604.06427 β’ Published 6 days ago β’ 8
view article Article How I contributed a new model to the Transformers library using Codex 14 days ago β’ 45
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper β’ 2604.01161 β’ Published 12 days ago β’ 30
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated Mar 11 β’ 878k β’ 812
Running 28 Open Source AI Year In Review 2025 π 28 Reviewing Progress of the Open Source Ecosystem
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 β’ 68