Configuration Parsing Warning:In config.json: "quantization_config.bits" must be an integer

GLM-4.5-Air-exl3-5.5bpw

Fits into 96gb vram with 65k+ context

Downloads last month
5
Safetensors
Model size
37B params
Tensor type
F16
·
I16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Minthy/GLM-4.5-Air-exl3-5.5bpw

Quantized
(60)
this model