This is just Qwen 3.5 397B heretic quantized down to 2.46 BPW using ubergarm smol-IQ2_XS recipe (https://huggingface.co/ubergarm/Qwen3.5-397B-A17B-GGUF).

Downloads last month
216
GGUF
Model size
396B params
Architecture
qwen35moe
Hardware compatibility
Log In to add your hardware

2-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for tarruda/Qwen3.5-397B-A17B-heretic-smol-IQ2_XS-GGUF