nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 Text Generation • 32B • Updated Mar 15 • 761k • • 336
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published Feb 25 • 43
MultiverseComputingCAI/Hypernova-60B-2602 Text Generation • 59B • Updated about 1 month ago • 1.89k • 13
Jofthomas/hermes-function-calling-thinking-V1 Viewer • Updated Feb 16, 2025 • 3.57k • 574 • 74
Running on CPU Upgrade Featured 3.11k The Smol Training Playbook 📚 3.11k The secrets to building world-class LLMs