Qwen3.5-0.8B GGUF

Q4_K_M quantized GGUF for Qwen3.5-0.8B.

Model Details

./llama-cli -m qwen3.5-0.8b.gguf -p "Hello"

This is a standard Q4_K_M quantization, not PreSINQ-optimized.

Original quantization by diodel.

GGUF

Model size

0.8B params

Architecture

qwen35

Hardware compatibility

We're not able to determine the quantization variants.

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support