Gemma-4-31B-it EXL3 6.0bpw

Quantized version of google/gemma-4-31B-it using ExLlamaV3 EXL3 format.

Property Value
Original size 62 GB (BF16)
Quantized size 25 GB (6.0 bpw)
Format EXL3 (QTIP-based)
Compression 2.5x

Requirements

Credits

Downloads last month
43
Safetensors
Model size
14B params
Tensor type
F16
I16
BF16
F32
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for HaoweiShen/Gemma-4-31B-it-EXL3-6.0bpw

Quantized
(165)
this model