add quantization_config.ignore=1 (downstream audit fix) 06b09ed verified mattbucci commited on 9 days ago
Replace INT4 vision with BF16 originals (model-00001-of-00001.safetensors) a3f58d6 verified mattbucci commited on 23 days ago
Replace INT4 vision with BF16 originals (config.json) 1503276 verified mattbucci commited on 23 days ago
Replace INT4 vision with BF16 originals (model.safetensors.index.json) 1eb5076 verified mattbucci commited on 23 days ago
Replace INT4 vision with BF16 originals (model-vision.safetensors) 66a15c0 verified mattbucci commited on 23 days ago
Update vision status: untestable due to server crash d47e216 verified mattbucci commited on 23 days ago
Gemma 4 26B MoE AWQ: forced-routing GPTQ calibration for all 128 experts, 30 tok/s on RDNA4 5b529db verified mattbucci commited on 23 days ago