DeepSeek-Coder-V2-Lite-Instruct-GGUF

GGUF quantizations of DeepSeek-Coder-V2-Lite-Instruct with imatrix calibration for improved accuracy at lower bit depths.

Available Quants

All quants use imatrix calibration data for better perplexity vs. standard quants.

Load with llama.cpp, Ollama, LM Studio, or any GGUF-compatible runtime.

GGUF

Model size

16B params

Architecture

deepseek2

Hardware compatibility

4-bit

5-bit

6-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(64)

this model