view article Article Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens Mar 6 • 5
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 65