SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 1.06M • 930 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 166k • 728 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 29.7k • 583 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 431k • 185
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 364k • 1.03k HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 8.23k • • 210 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 463 • 21 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 8.11k • 185
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 8.23k • • 210
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 7.69k • 669 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 11.8k • 979 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 84.4k • 217 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 10.7k • 2.94k
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 299 • 75 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 22 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 264 • 105 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 24.7k • 809
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 14.1k • 683 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 317 • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 1.14M • 3.45k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 217 • 134
SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 1.06M • 930 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 166k • 728 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 29.7k • 583 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 431k • 185
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 299 • 75 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 22 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 264 • 105 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 24.7k • 809
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 364k • 1.03k HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 8.23k • • 210 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 463 • 21 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 8.11k • 185
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 8.23k • • 210
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 14.1k • 683 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 317 • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 1.14M • 3.45k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 217 • 134
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 7.69k • 669 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 11.8k • 979 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 84.4k • 217 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 10.7k • 2.94k