Abiray
/

gemma-4-E4B-it-heretic-GGUF

Model card Files Files and versions

gemma-4-E4B-it-heretic-GGUF

This repository contains GGUF format quantizations of coder3101/gemma-4-E4B-it-heretic.

Quantizations Provided

The following quantization formats have been provided to accommodate various hardware setups:

Q8_0: Very high quality, near-unquantized performance.
Q6_K: High quality, excellent balance of size and performance.
Q5_K_M: Good quality, slightly smaller footprint.
Q4_K_M: Standard medium quantization, optimal for general use.
Q3_K_M: Maximum compression for tight VRAM/RAM constraints.

Downloads last month: 21,698

GGUF

Model size

8B params

Architecture

gemma4

Hardware compatibility

Log In to add your hardware

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Abiray/gemma-4-E4B-it-heretic-GGUF

Base model

google/gemma-4-E4B-it

Finetuned

coder3101/gemma-4-E4B-it-heretic

Quantized

(5)

this model

Collection including Abiray/gemma-4-E4B-it-heretic-GGUF

Gemma 4

5 items • Updated about 22 hours ago