Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 9 days ago • 589k • 2.64k
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 134
Running on CPU Upgrade Featured 3.1k The Smol Training Playbook 📚 3.1k The secrets to building world-class LLMs
Running 94 Unlocking On-Policy Distillation for Any Model Family 📝 94 Visualize on-policy distillation for any model family
intfloat/multilingual-e5-large-instruct Feature Extraction • 0.6B • Updated Jul 10, 2025 • 1.41M • • 614