NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

by Maverobot - opened Mar 13

Mar 13

Is it possible to turn this native 4bit version to mlx format? I think its performance would be better than quantizing the full-size model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment