Qwen
/

Qwen3.5-397B-A17B-GPTQ-Int4

Image-Text-to-Text

4-bit precision

Model card Files Files and versions

Resources

View closed (0)

fix chat template to avoid empty historical `<think>` blocks

#5 opened 13 days ago by

latent-variable

GPTQ-Int4 模型能力相比于 FP8 差多少呢？

#4 opened about 1 month ago by

Poor performance in vLLM

#3 opened about 1 month ago by

GPTQ vs Q4 GGUF

#2 opened about 2 months ago by

Benchmark numbers of this quant version

#1 opened about 2 months ago by