Running 166 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 166 Building and scaling RL environments for LLM training
Running RL Featured 79 Survival Island Game 🏝 79 Watch an AI agent survive on a procedurally generated island
Running Featured 84 Distilling 100B+ Models 40x Faster with TRL 📝 84 TRL distillation for 100B+ teachers, 40x faster
Running on CPU Upgrade 236 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 236 Explore synthetic data experiments on a virtual bookshelf
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 199k • • 2.85k
TeichAI/Qwen3-14B-Claude-4.5-Opus-High-Reasoning-Distill-GGUF Text Generation • 15B • Updated Feb 22 • 29.8k • 322