OCR models ggml-org/GLM-OCR-GGUF 0.9B • Updated Mar 10 • 23.9k • 50 ggml-org/DeepSeek-OCR-GGUF 3B • Updated 18 days ago • 2.44k • 7 ggml-org/dots.ocr-GGUF 2B • Updated 7 days ago • 586 • 3 ggml-org/Qianfan-OCR-GGUF 4B • Updated 2 days ago • 248 • 2
NVIDIA Nemotron 3 Super Collection for Nemotron-3-Super-120B models ggml-org/Nemotron-3-Super-120B-GGUF 121B • Updated 27 days ago • 1.86k • 10
NVIDIA Nemotron 3 Collection for Nemotron-Nano-3-30B-A3B models ggml-org/Nemotron-Nano-3-30B-A3B-GGUF 32B • Updated Dec 16, 2025 • 1.87k • 14
GLM-V ggml-org/GLM-4.6V-Flash-GGUF 9B • Updated Jan 15 • 1.96k • 21 ggml-org/GLM-4.6V-GGUF 107B • Updated Jan 15 • 3.14k • 7 ggml-org/AutoGLM-Phone-9B-GGUF 9B • Updated Dec 17, 2025 • 261 • 2 ggml-org/GLM-4.5V-GGUF 107B • Updated Feb 17 • 307 • 5
EmbeddingGemma 300M ggml-org/embeddinggemma-300M-GGUF 0.3B • Updated Sep 4, 2025 • 277k • 26 ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated Sep 15, 2025 • 764 • 5 ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated Sep 15, 2025 • 47.9k • 15
ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated Sep 15, 2025 • 764 • 5
ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated Sep 15, 2025 • 47.9k • 15
GPT OSS ggml-org/gpt-oss-120b-GGUF 117B • Updated Oct 30, 2025 • 342k • 71 ggml-org/gpt-oss-20b-GGUF 21B • Updated Oct 30, 2025 • 86.6k • 141
Gemma 3n ggml-org/gemma-3n-E2B-it-GGUF 4B • Updated Aug 22, 2025 • 5.3k • 24 ggml-org/gemma-3n-E4B-it-GGUF 7B • Updated Jun 26, 2025 • 6.47k • 20
InternVL 3 and InternVL 2.5 ggml-org/InternVL3-1B-Instruct-GGUF 0.6B • Updated May 10, 2025 • 419 • 4 ggml-org/InternVL3-2B-Instruct-GGUF 2B • Updated May 10, 2025 • 213 • 5 ggml-org/InternVL3-8B-Instruct-GGUF 8B • Updated May 10, 2025 • 714 • 6 ggml-org/InternVL3-14B-Instruct-GGUF 15B • Updated May 10, 2025 • 108 • 4
Qwen 3 ggml-org/Qwen3-0.6B-GGUF 0.8B • Updated Sep 28, 2025 • 51.4k • 13 ggml-org/Qwen3-1.7B-GGUF 2B • Updated Apr 28, 2025 • 7.2k • 7 ggml-org/Qwen3-4B-GGUF 4B • Updated Apr 28, 2025 • 1.92k • 6 ggml-org/Qwen3-8B-GGUF 8B • Updated Apr 28, 2025 • 2.66k • 5
Gemma 3 ggml-org/gemma-3-270m-it-GGUF 0.3B • Updated Aug 15, 2025 • 1.93k • 22 ggml-org/gemma-3-1b-it-GGUF 1.0B • Updated Mar 12, 2025 • 24.2k • 28 ggml-org/gemma-3-4b-it-GGUF Image-Text-to-Text • 4B • Updated May 21, 2025 • 28.8k • 50 ggml-org/gemma-3-12b-it-GGUF Image-Text-to-Text • 12B • Updated May 21, 2025 • 3.63k • 31
GGUF LoRA adapters Adapters extracted from fine tuned models, using mergekit-extract-lora ggml-org/LoRA-Llama-3-Instruct-abliteration-8B-F16-GGUF 88.1M • Updated Nov 1, 2024 • 21 ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF 93.6M • Updated Jan 23, 2025 • 102 • 4 ggml-org/LoRA-Qwen2.5-3B-Instruct-abliterated-F16-GGUF 0.1B • Updated Jan 9, 2025 • 20 • 1 ggml-org/LoRA-Qwen2.5-7B-Instruct-abliterated-v3-F16-GGUF 90.9M • Updated Jan 8, 2025 • 43 • 3
ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF 93.6M • Updated Jan 23, 2025 • 102 • 4
Gemma 1.1 GGUFs ggml-org/gemma-1.1-2b-it-Q8_0-GGUF 3B • Updated Apr 5, 2024 • 195 • 1 ggml-org/gemma-1.1-7b-it-Q8_0-GGUF 9B • Updated Apr 5, 2024 • 10 ggml-org/gemma-1.1-7b-it-Q4_K_M-GGUF 9B • Updated Apr 5, 2024 • 245 • 4
Gemma 4 ggml-org/gemma-4-E2B-it-GGUF 5B • Updated about 20 hours ago • 68.2k • 55 ggml-org/gemma-4-E4B-it-GGUF 8B • Updated about 20 hours ago • 108k • 36 ggml-org/gemma-4-26B-A4B-it-GGUF 25B • Updated 10 days ago • 139k • 49 ggml-org/gemma-4-31B-it-GGUF 31B • Updated 10 days ago • 45.6k • 29
Devstral 2 Collection for Devstral-Small-2-24B-Instruct-2512 models ggml-org/Devstral-Small-2-24B-Instruct-2512-GGUF 24B • Updated Dec 18, 2025 • 683 • 6 ggml-org/Devstral-2-123B-Instruct-2512-GGUF 125B • Updated Dec 19, 2025 • 174 • 2
Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli GLM-V Collection 4 items • Updated Dec 17, 2025 • 13 Ministral 3 Collection 6 items • Updated Dec 16, 2025 • 3 Gemma 3 Collection 10 items • Updated Dec 16, 2025 • 22 Kimi-VL Collection 1 item • Updated Mar 2 • 1
Ministral 3 ggml-org/Ministral-3-3B-Reasoning-2512-GGUF Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 448 • 2 ggml-org/Ministral-3-8B-Reasoning-2512-GGUF Image-Text-to-Text • 8B • Updated Dec 2, 2025 • 365 • 1 ggml-org/Ministral-3-14B-Reasoning-2512-GGUF Image-Text-to-Text • 14B • Updated Dec 2, 2025 • 337 • 3 ggml-org/Ministral-3-3B-Instruct-2512-GGUF Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 708 • 3
ggml-org/Ministral-3-3B-Reasoning-2512-GGUF Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 448 • 2
ggml-org/Ministral-3-8B-Reasoning-2512-GGUF Image-Text-to-Text • 8B • Updated Dec 2, 2025 • 365 • 1
ggml-org/Ministral-3-14B-Reasoning-2512-GGUF Image-Text-to-Text • 14B • Updated Dec 2, 2025 • 337 • 3
ggml-org/Ministral-3-3B-Instruct-2512-GGUF Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 708 • 3
Gemma 3-270m Collection of models for Gemma 3-270m ggml-org/gemma-3-270m-GGUF 0.3B • Updated Aug 14, 2025 • 505 • 20 ggml-org/gemma-3-270m-it-GGUF 0.3B • Updated Aug 15, 2025 • 1.93k • 22 ggml-org/gemma-3-270m-qat-GGUF 0.3B • Updated Aug 14, 2025 • 11.7k • 9 ggml-org/gemma-3-270m-it-qat-GGUF 0.3B • Updated Aug 15, 2025 • 3.44k • 12
VAD Voice Activity Detection (VAD) models for whisper.cpp. ggml-org/whisper-vad Updated Nov 17, 2025 • 16
Qwen 2 VL and Qwen 2.5 VL ggml-org/Qwen2.5-VL-3B-Instruct-GGUF 3B • Updated Apr 30, 2025 • 5.46k • 6 ggml-org/Qwen2.5-VL-7B-Instruct-GGUF 8B • Updated Apr 30, 2025 • 5.4k • 10 ggml-org/Qwen2.5-VL-32B-Instruct-GGUF 33B • Updated May 15, 2025 • 294 • 5 ggml-org/Qwen2-VL-2B-Instruct-GGUF 2B • Updated Apr 30, 2025 • 1.25k • 2
SmolVLM GGUF ggml-org/SmolVLM2-2.2B-Instruct-GGUF 2B • Updated Apr 30, 2025 • 23.4k • 31 ggml-org/SmolVLM2-500M-Video-Instruct-GGUF 0.4B • Updated Apr 30, 2025 • 24.2k • 17 ggml-org/SmolVLM2-256M-Video-Instruct-GGUF 0.2B • Updated Apr 30, 2025 • 12.2k • 9 ggml-org/SmolVLM-Instruct-GGUF 2B • Updated Apr 30, 2025 • 15.5k • 9
llama.cpp presets Models that are used for presets in llama.cpp. ggml-org/gte-small-Q8_0-GGUF Sentence Similarity • 33.2M • Updated Feb 6, 2025 • 43 • 2 ggml-org/bge-small-en-v1.5-Q8_0-GGUF Feature Extraction • 33.2M • Updated Feb 6, 2025 • 347 • 6 ggml-org/e5-small-v2-Q8_0-GGUF Sentence Similarity • 33.2M • Updated Feb 6, 2025 • 65
ggml-org/bge-small-en-v1.5-Q8_0-GGUF Feature Extraction • 33.2M • Updated Feb 6, 2025 • 347 • 6
llama.vim Recommended models for the llama.vim and llama.vscode plugins ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF Text Generation • 0.5B • Updated Jan 31, 2025 • 109k • 8 ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF Text Generation • 2B • Updated Oct 28, 2024 • 4.09k • 17 ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF Text Generation • 3B • Updated Nov 26, 2024 • 2.68k • 8 ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF Text Generation • 8B • Updated Oct 28, 2024 • 3.16k • 9
ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF Text Generation • 0.5B • Updated Jan 31, 2025 • 109k • 8
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF Text Generation • 2B • Updated Oct 28, 2024 • 4.09k • 17
OCR models ggml-org/GLM-OCR-GGUF 0.9B • Updated Mar 10 • 23.9k • 50 ggml-org/DeepSeek-OCR-GGUF 3B • Updated 18 days ago • 2.44k • 7 ggml-org/dots.ocr-GGUF 2B • Updated 7 days ago • 586 • 3 ggml-org/Qianfan-OCR-GGUF 4B • Updated 2 days ago • 248 • 2
Gemma 4 ggml-org/gemma-4-E2B-it-GGUF 5B • Updated about 20 hours ago • 68.2k • 55 ggml-org/gemma-4-E4B-it-GGUF 8B • Updated about 20 hours ago • 108k • 36 ggml-org/gemma-4-26B-A4B-it-GGUF 25B • Updated 10 days ago • 139k • 49 ggml-org/gemma-4-31B-it-GGUF 31B • Updated 10 days ago • 45.6k • 29
NVIDIA Nemotron 3 Super Collection for Nemotron-3-Super-120B models ggml-org/Nemotron-3-Super-120B-GGUF 121B • Updated 27 days ago • 1.86k • 10
Devstral 2 Collection for Devstral-Small-2-24B-Instruct-2512 models ggml-org/Devstral-Small-2-24B-Instruct-2512-GGUF 24B • Updated Dec 18, 2025 • 683 • 6 ggml-org/Devstral-2-123B-Instruct-2512-GGUF 125B • Updated Dec 19, 2025 • 174 • 2
NVIDIA Nemotron 3 Collection for Nemotron-Nano-3-30B-A3B models ggml-org/Nemotron-Nano-3-30B-A3B-GGUF 32B • Updated Dec 16, 2025 • 1.87k • 14
Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli GLM-V Collection 4 items • Updated Dec 17, 2025 • 13 Ministral 3 Collection 6 items • Updated Dec 16, 2025 • 3 Gemma 3 Collection 10 items • Updated Dec 16, 2025 • 22 Kimi-VL Collection 1 item • Updated Mar 2 • 1
GLM-V ggml-org/GLM-4.6V-Flash-GGUF 9B • Updated Jan 15 • 1.96k • 21 ggml-org/GLM-4.6V-GGUF 107B • Updated Jan 15 • 3.14k • 7 ggml-org/AutoGLM-Phone-9B-GGUF 9B • Updated Dec 17, 2025 • 261 • 2 ggml-org/GLM-4.5V-GGUF 107B • Updated Feb 17 • 307 • 5
Ministral 3 ggml-org/Ministral-3-3B-Reasoning-2512-GGUF Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 448 • 2 ggml-org/Ministral-3-8B-Reasoning-2512-GGUF Image-Text-to-Text • 8B • Updated Dec 2, 2025 • 365 • 1 ggml-org/Ministral-3-14B-Reasoning-2512-GGUF Image-Text-to-Text • 14B • Updated Dec 2, 2025 • 337 • 3 ggml-org/Ministral-3-3B-Instruct-2512-GGUF Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 708 • 3
ggml-org/Ministral-3-3B-Reasoning-2512-GGUF Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 448 • 2
ggml-org/Ministral-3-8B-Reasoning-2512-GGUF Image-Text-to-Text • 8B • Updated Dec 2, 2025 • 365 • 1
ggml-org/Ministral-3-14B-Reasoning-2512-GGUF Image-Text-to-Text • 14B • Updated Dec 2, 2025 • 337 • 3
ggml-org/Ministral-3-3B-Instruct-2512-GGUF Image-Text-to-Text • 3B • Updated Dec 2, 2025 • 708 • 3
EmbeddingGemma 300M ggml-org/embeddinggemma-300M-GGUF 0.3B • Updated Sep 4, 2025 • 277k • 26 ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated Sep 15, 2025 • 764 • 5 ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated Sep 15, 2025 • 47.9k • 15
ggml-org/embeddinggemma-300M-qat-q4_0-GGUF Feature Extraction • 0.3B • Updated Sep 15, 2025 • 764 • 5
ggml-org/embeddinggemma-300m-qat-q8_0-GGUF Feature Extraction • 0.3B • Updated Sep 15, 2025 • 47.9k • 15
Gemma 3-270m Collection of models for Gemma 3-270m ggml-org/gemma-3-270m-GGUF 0.3B • Updated Aug 14, 2025 • 505 • 20 ggml-org/gemma-3-270m-it-GGUF 0.3B • Updated Aug 15, 2025 • 1.93k • 22 ggml-org/gemma-3-270m-qat-GGUF 0.3B • Updated Aug 14, 2025 • 11.7k • 9 ggml-org/gemma-3-270m-it-qat-GGUF 0.3B • Updated Aug 15, 2025 • 3.44k • 12
GPT OSS ggml-org/gpt-oss-120b-GGUF 117B • Updated Oct 30, 2025 • 342k • 71 ggml-org/gpt-oss-20b-GGUF 21B • Updated Oct 30, 2025 • 86.6k • 141
Gemma 3n ggml-org/gemma-3n-E2B-it-GGUF 4B • Updated Aug 22, 2025 • 5.3k • 24 ggml-org/gemma-3n-E4B-it-GGUF 7B • Updated Jun 26, 2025 • 6.47k • 20
VAD Voice Activity Detection (VAD) models for whisper.cpp. ggml-org/whisper-vad Updated Nov 17, 2025 • 16
InternVL 3 and InternVL 2.5 ggml-org/InternVL3-1B-Instruct-GGUF 0.6B • Updated May 10, 2025 • 419 • 4 ggml-org/InternVL3-2B-Instruct-GGUF 2B • Updated May 10, 2025 • 213 • 5 ggml-org/InternVL3-8B-Instruct-GGUF 8B • Updated May 10, 2025 • 714 • 6 ggml-org/InternVL3-14B-Instruct-GGUF 15B • Updated May 10, 2025 • 108 • 4
Qwen 2 VL and Qwen 2.5 VL ggml-org/Qwen2.5-VL-3B-Instruct-GGUF 3B • Updated Apr 30, 2025 • 5.46k • 6 ggml-org/Qwen2.5-VL-7B-Instruct-GGUF 8B • Updated Apr 30, 2025 • 5.4k • 10 ggml-org/Qwen2.5-VL-32B-Instruct-GGUF 33B • Updated May 15, 2025 • 294 • 5 ggml-org/Qwen2-VL-2B-Instruct-GGUF 2B • Updated Apr 30, 2025 • 1.25k • 2
Qwen 3 ggml-org/Qwen3-0.6B-GGUF 0.8B • Updated Sep 28, 2025 • 51.4k • 13 ggml-org/Qwen3-1.7B-GGUF 2B • Updated Apr 28, 2025 • 7.2k • 7 ggml-org/Qwen3-4B-GGUF 4B • Updated Apr 28, 2025 • 1.92k • 6 ggml-org/Qwen3-8B-GGUF 8B • Updated Apr 28, 2025 • 2.66k • 5
SmolVLM GGUF ggml-org/SmolVLM2-2.2B-Instruct-GGUF 2B • Updated Apr 30, 2025 • 23.4k • 31 ggml-org/SmolVLM2-500M-Video-Instruct-GGUF 0.4B • Updated Apr 30, 2025 • 24.2k • 17 ggml-org/SmolVLM2-256M-Video-Instruct-GGUF 0.2B • Updated Apr 30, 2025 • 12.2k • 9 ggml-org/SmolVLM-Instruct-GGUF 2B • Updated Apr 30, 2025 • 15.5k • 9
Gemma 3 ggml-org/gemma-3-270m-it-GGUF 0.3B • Updated Aug 15, 2025 • 1.93k • 22 ggml-org/gemma-3-1b-it-GGUF 1.0B • Updated Mar 12, 2025 • 24.2k • 28 ggml-org/gemma-3-4b-it-GGUF Image-Text-to-Text • 4B • Updated May 21, 2025 • 28.8k • 50 ggml-org/gemma-3-12b-it-GGUF Image-Text-to-Text • 12B • Updated May 21, 2025 • 3.63k • 31
llama.cpp presets Models that are used for presets in llama.cpp. ggml-org/gte-small-Q8_0-GGUF Sentence Similarity • 33.2M • Updated Feb 6, 2025 • 43 • 2 ggml-org/bge-small-en-v1.5-Q8_0-GGUF Feature Extraction • 33.2M • Updated Feb 6, 2025 • 347 • 6 ggml-org/e5-small-v2-Q8_0-GGUF Sentence Similarity • 33.2M • Updated Feb 6, 2025 • 65
ggml-org/bge-small-en-v1.5-Q8_0-GGUF Feature Extraction • 33.2M • Updated Feb 6, 2025 • 347 • 6
GGUF LoRA adapters Adapters extracted from fine tuned models, using mergekit-extract-lora ggml-org/LoRA-Llama-3-Instruct-abliteration-8B-F16-GGUF 88.1M • Updated Nov 1, 2024 • 21 ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF 93.6M • Updated Jan 23, 2025 • 102 • 4 ggml-org/LoRA-Qwen2.5-3B-Instruct-abliterated-F16-GGUF 0.1B • Updated Jan 9, 2025 • 20 • 1 ggml-org/LoRA-Qwen2.5-7B-Instruct-abliterated-v3-F16-GGUF 90.9M • Updated Jan 8, 2025 • 43 • 3
ggml-org/LoRA-Qwen2.5-1.5B-Instruct-abliterated-F16-GGUF 93.6M • Updated Jan 23, 2025 • 102 • 4
llama.vim Recommended models for the llama.vim and llama.vscode plugins ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF Text Generation • 0.5B • Updated Jan 31, 2025 • 109k • 8 ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF Text Generation • 2B • Updated Oct 28, 2024 • 4.09k • 17 ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF Text Generation • 3B • Updated Nov 26, 2024 • 2.68k • 8 ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF Text Generation • 8B • Updated Oct 28, 2024 • 3.16k • 9
ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF Text Generation • 0.5B • Updated Jan 31, 2025 • 109k • 8
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF Text Generation • 2B • Updated Oct 28, 2024 • 4.09k • 17
Gemma 1.1 GGUFs ggml-org/gemma-1.1-2b-it-Q8_0-GGUF 3B • Updated Apr 5, 2024 • 195 • 1 ggml-org/gemma-1.1-7b-it-Q8_0-GGUF 9B • Updated Apr 5, 2024 • 10 ggml-org/gemma-1.1-7b-it-Q4_K_M-GGUF 9B • Updated Apr 5, 2024 • 245 • 4