Jackrong/MLX-Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-8bit Text Generation • 27B • Updated 26 days ago • 6.46k • 13
inferencerlabs/NVIDIA-Nemotron-3-Super-120B-A12B-MLX-4.5bit Text Generation • 121B • Updated Mar 14 • 4.38k • 6
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF Text Generation • 121B • Updated 27 days ago • 82k • 112
ParoQuant Collection Pairwise Rotation Quantization for Efficient Reasoning LLM Inference • 18 items • Updated 11 days ago • 18