nvidia/Llama-3_1-Nemotron-Ultra-253B-v1-FP8 Text Generation • 253B • Updated Oct 15, 2025 • 4.49k • 12
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8 Text Generation • 50B • Updated Oct 15, 2025 • 56.2k • 27
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 Text Generation • 26B • Updated Nov 27, 2025 • 13.7k • 17
QuantFactory/Llama-3.1-Nemotron-Nano-8B-v1-GGUF Text Generation • 8B • Updated Mar 23, 2025 • 106 • 4
unsloth/Llama-3_1-Nemotron-Ultra-253B-v1-GGUF Text Generation • 253B • Updated May 26, 2025 • 2.94k • 9
ArtusDev/nvidia_Llama-3_1-Nemotron-Ultra-253B-v1_EXL3_1.35bpw_H6 Text Generation • 24B • Updated May 14, 2025 • 8
Panchovix/Llama-3_1-Nemotron-Ultra-253B-v1-3.6bpw-h6-exl3 Text Generation • 59B • Updated Apr 27, 2025 • 7
Panchovix/Llama-3_1-Nemotron-Ultra-253B-v1-3.25bpw-h6-exl3 Text Generation • 54B • Updated Apr 27, 2025 • 6
Panchovix/Llama-3_1-Nemotron-Ultra-253B-v1-3.45bpw-h6-exl3 Text Generation • 57B • Updated Apr 28, 2025 • 9 • 1
ArtusDev/nvidia_Llama-3_1-Nemotron-Ultra-253B-v1_EXL3_3.0bpw_H6 Text Generation • 50B • Updated May 14, 2025 • 7
unsloth/Llama-3.1-Nemotron-Nano-4B-v1.1-unsloth-bnb-4bit Text Generation • 5B • Updated May 24, 2025 • 382