This model was converted to GGUF MXFP4 format from ai-sage/GigaChat3-10B-A1.8B using llama.cpp version 8190:

llama-quantize GigaChat3-10B-A1.8-f32.gguf GigaChat3-10B-A1.8-GGUF-MXFP4.gguf MXFP4_MOE

GGUF

Model size

11B params

Architecture

deepseek2

Hardware compatibility

We're not able to determine the quantization variants.

Model tree for deepsweet/GigaChat3-10B-A1.8-GGUF-MXFP4

Base model

Quantized

Quantized

(10)

this model

Collection including deepsweet/GigaChat3-10B-A1.8-GGUF-MXFP4