Llama-3.2-3B-Instruct GGUF (Q4_K_M)

Personally quantised GGUF version of meta-llama/Llama-3.2-3B-Instruct, quantised using llama.cpp.

🎓 About OpenMarker

This model was quantised primarily for use with OpenMarker — an open source AI assistant designed to help professors, lecturers, and educators automatically grade and review student work.

OpenMarker uses local quantised models so that:

No student data is sent to external APIs
It runs fully offline and privately
Anyone can self-host it for free

👉 https://github.com/theDALEX/openmarker

📦 Quant Details

Quant	Size	Description
Q4_K_M	~2GB	Good quality, recommended for most use cases

Quantised using llama.cpp on Windows.

🔗 Original Model

Original weights: meta-llama/Llama-3.2-3B-Instruct
Original license: Llama 3.2 Community License
Changes made: None — weights are unchanged beyond quantisation

Downloads last month: 286

GGUF

Model size

3B params

Architecture

llama

Hardware compatibility

4-bit

Model tree for dalexdavis/Llama-3.2-3B-Instruct-Q4_K_M-GGUF

Base model

meta-llama/Llama-3.2-3B-Instruct

Quantized

(439)

this model