add quantization_config.ignore=1 (downstream audit fix) 06b09ed verified mattbucci commited on 10 days ago
Replace INT4 vision with BF16 originals (config.json) 1503276 verified mattbucci commited on 24 days ago
Gemma 4 26B MoE AWQ: forced-routing GPTQ calibration for all 128 experts, 30 tok/s on RDNA4 5b529db verified mattbucci commited on 24 days ago