Gemma 4 Assistant GGUF Collection Gemma 4 MTP assistant drafters as GGUF (F16/Q8_0/Q5_K_M/Q4_K_M/Q4_K_S). Speculative-decoding heads for the atomic-llama-cpp-turboquant fork. • 4 items • Updated about 19 hours ago • 2
Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 65 items • Updated 7 days ago • 153
pin-chun/gemma-3-1b-it-qat-abliterated-Q4_0-GGUF Image-Text-to-Text • 1.0B • Updated Sep 13, 2025 • 33 • 1