sn97-distilled-V9 - GGUF

Static quantizations.

Available Quantizations

Approximate BPW and file size in decimal GB, ordered from highest precision to lowest.

File Approx. BPW Approx. Size (GB)
sn97-distilled-V9-bf16.gguf 16.00 8.42
sn97-distilled-V9-q8_0.gguf 8.51 4.48
sn97-distilled-V9-q6_k.gguf 6.57 3.46
sn97-distilled-V9-q5_1.gguf 6.09 3.21
sn97-distilled-V9-q5_k_m.gguf 5.83 3.07
sn97-distilled-V9-q5_0.gguf 5.67 2.99
sn97-distilled-V9-q4_k_m.gguf 5.13 2.71
sn97-distilled-V9-q4_1.gguf 5.24 2.77
sn97-distilled-V9-q4_0.gguf 4.82 2.54
Downloads last month
740
GGUF
Model size
4B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for RemySkye/sn97-distilled-V9-GGUF

Quantized
(1)
this model