This model was converted to MLX format from ai-sage/GigaChat3.1-10B-A1.8B using mlx-lm v0.31.1.

Multi-Token Prediction (MTP) had to be disabled ("num_nextn_predict_layers": 0) and related layers had to be removed (model.layers.26.*).

Thanks RockTalk/GigaChat3.1-10B-A1.8B-MLX-4bit for the tip.

mlx_lm.convert --hf-path ai-sage/GigaChat3.1-10B-A1.8B --mlx-path deepsweet/GigaChat3.1-10B-A1.8B-MLX-MXFP4 --quantize --q-mode mxfp4 --q-group-size 32

Downloads last month: 482

Safetensors

Model size

11B params

Tensor type

U32

BF16

MLX

Hardware compatibility

4-bit

Model tree for deepsweet/GigaChat3.1-10B-A1.8B-MLX-MXFP4

Base model

ai-sage/GigaChat3-10B-A1.8B-base

Quantized

ai-sage/GigaChat3.1-10B-A1.8B

Quantized

(4)

this model

Collection including deepsweet/GigaChat3.1-10B-A1.8B-MLX-MXFP4

GigaChat

Collection

5 items • Updated 2 days ago • 1