Quantizations of trohrbaugh/Qwen3.5-9B-heretic-v2.

Add following parameters to llama.cpp to disable thinking.

  --jinja ^
  --chat-template-kwargs "{\"enable_thinking\": false}" ^
Downloads last month
1,568
GGUF
Model size
9B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for AIImageStudio/Qwen3.5-9b-heretic-v2-GGUF

Quantized
(8)
this model