Moved → dancinlab/supergemma4-e4b-abliterated-GGUF

This single-file repo is superseded. Get the same Q4_K_M plus the full quant ladder (Q2_K → BF16, plus imatrix-calibrated IQ quants) from the new consolidated repo:

https://huggingface.co/dancinlab/supergemma4-e4b-abliterated-GGUF

# llama.cpp / Ollama / LM Studio
ollama run hf.co/dancinlab/supergemma4-e4b-abliterated-GGUF:Q4_K_M
llama-server -hf dancinlab/supergemma4-e4b-abliterated-GGUF:Q4_K_M --jinja -c 8192

The Q4_K_M file here is functionally identical to the one in the new repo (same upstream weights, same llama.cpp quantize parameters). New downloads should target the new repo so the full quant ladder, imatrix variants, and unified model card are picked up.

Downloads last month
242
GGUF
Model size
8B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for dancinlab/supergemma4-e4b-abliterated-Q4_K_M-GGUF

Quantized
(14)
this model