Running 95 Unlocking On-Policy Distillation for Any Model Family 📝 95 Visualize on-policy distillation for any model family
Running on Zero Agents 31 Gpt2 Multiplication Predictor 📈 31 Multiply large numbers using different reasoning methods
Running 596 Scaling test-time compute 📈 596 Run advanced search strategies to boost LLM problem solving