deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 716k • • 1.47k
Running 3.78k The Ultra-Scale Playbook 🌌 3.78k The ultimate guide to training LLM on large GPU Clusters
oliverguhr/fullstop-punctuation-multilang-large Token Classification • Updated Nov 16, 2023 • 317k • • 174
Running Featured 1.33k FineWeb: decanting the web for the finest text data at scale 🍷 1.33k Read a detailed overview of the FineWeb web‑scale text dataset
Running on L4 1.18k ControlNet V1.1 📉 1.18k Generate edited images using edge, pose, and other guides