quant_method Question?

by glrra30 - opened 12 days ago

You list this model as AWQ, but config.json shows "quant_method": "compressed-tensors"

Was this quantized with vllm-project/llm-compressor ??

Just trying to understand why it is not outputting "quant_method": "awq". ??

Current vllm capability with rocm looks for awq tag with ['w_bit', 'bits'] in the model's quantization config

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment