n-Arno
/

Qwen3_4B-trained-GGUF

Model card Files Files and versions

Q4_K_M quantized version using llama.cpp of this model.

iMatrix file used coming from this repository.

Many thanks to Felldude for his work on this.

Downloads last month: 22

GGUF

Model size

4B params

Architecture

qwen3

Hardware compatibility

Log In to add your hardware

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for n-Arno/Qwen3_4B-trained-GGUF

Base model

Qwen/Qwen3-4B-Base

Finetuned

Quantized

(210)

this model