gemma-4-E4B-it-heretic-GGUF

This repository contains GGUF format quantizations of coder3101/gemma-4-E4B-it-heretic.

Quantizations Provided

The following quantization formats have been provided to accommodate various hardware setups:

  • Q8_0: Very high quality, near-unquantized performance.
  • Q6_K: High quality, excellent balance of size and performance.
  • Q5_K_M: Good quality, slightly smaller footprint.
  • Q4_K_M: Standard medium quantization, optimal for general use.
  • Q3_K_M: Maximum compression for tight VRAM/RAM constraints.
Downloads last month
21,698
GGUF
Model size
8B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Abiray/gemma-4-E4B-it-heretic-GGUF

Quantized
(5)
this model

Collection including Abiray/gemma-4-E4B-it-heretic-GGUF