Gemma 4 26B-A4B-IT GPTQ Int4

GPTQ INT4 quantization of google/gemma-4-26B-A4B-it with group_size=16 for TP=4 compatibility.

Key: group_size=16

Gemma 4 has intermediate_size=2112. With TP=4: 2112/4=528.

Spec	Value
Base Model	google/gemma-4-26B-A4B-it
Architecture	Gemma4ForConditionalGeneration (MoE)
Total Params	26B, Active: 3.8B/token
Experts	128 routed + 1 shared
Quantization	GPTQ INT4, group_size=16, sym=True
Tool	GPTQModel v6.1.0-dev
Calibration	128 samples, allenai/c4

All MLP (including shared) quantized INT4 uniformly.

Safetensors

Model size

26B params

Tensor type

BF16

I32

Base model

Quantized

(153)

this model