Size, mxfp4

by dagb - opened Jan 31

Jan 31

ggerganovs MXFP4 gguf is 63.3 GB. So, "GGUF'ified", but otherwise with the original weights if I understand it correctly.
Is it possible to derestrict and maintain the MXFP4 weight format?

Calandracas

Owner Feb 1

ggerganovs MXFP4 gguf is 63.3 GB. So, "GGUF'ified", but otherwise with the original weights if I understand it correctly.
Is it possible to derestrict and maintain the MXFP4 weight format?

Maybe. This is a quantization of https://huggingface.co/ArliAI/gpt-oss-120b-Derestricted, which was only released with bf16 weights.

One possibility might be to use a lora adapter, which could be applied to the MXFP4 model, but unfortunately, @ArliAI did not released the lora adapter (not sure if one was used at all, could have been a full finetune)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment