nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 3 days ago • 1.02M • 229
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 3 days ago • 500k • 328
Qwen/Qwen3.5-397B-A17B Image-Text-to-Text • 403B • Updated about 1 month ago • 782k • • 1.44k
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 293
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 307
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated about 1 month ago • 1.57M • 712
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 8 days ago • 267