Running Featured 33 Distilling 100B+ Models 40x Faster with TRL 📝 33 TRL distillation for 100B+ teachers, 40x faster
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 5B • Updated 8 days ago • 15.7k • 28
Jackrong/Qwen3.5-0.8B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 0.9B • Updated Mar 6 • 1.77k • 10
Jackrong/MLX-Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-v2-4bit Text Generation • 0.7B • Updated 26 days ago • 1.04k • 5
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text • 27B • Updated 8 days ago • 361k • 562