inference-optimization/Qwen3-Next-80B-A3B-Instruct-GSM8K-MTP-finetuned 81B • Updated 24 days ago • 62
mconcat/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-NVFP4 Text Generation • 22B • Updated 23 days ago • 10.4k • 15
avagridworkit/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 21 days ago • 20
mlx-community/NVIDIA-Nemotron-3-Super-120B-A12B-6bit Text Generation • 121B • Updated 19 days ago • 684
mlx-community/NVIDIA-Nemotron-3-Super-120B-A12B-5bit Text Generation • 121B • Updated 19 days ago • 3.42k
mlx-community/NVIDIA-Nemotron-3-Super-120B-A12B-4bit Text Generation • 121B • Updated 19 days ago • 1.02k
mlx-community/NVIDIA-Nemotron-3-Super-120B-A12B-8bit Text Generation • 121B • Updated 19 days ago • 788
mconcat/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-FP8-Dynamic Text Generation • 27B • Updated 19 days ago • 2.38k • 2
mconcat/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ-4bit Text Generation • 29B • Updated 19 days ago • 2.34k • 2