Compressed Model: MilyaShams/Qwen3-1.7B-PTQ_W8A8_ign

This model was compressed using the llmcompressor framework.

Compression Details

  • Base Model: Qwen/Qwen3-1.7B
  • Experiment Name: PTQ_W8A8_ign
  • Recipe / Modifiers Applied:
config_groups=None targets=['Linear'] ignore=['lm_head'] scheme='W8A8' kv_cache_scheme=None weight_observer=None input_observer=None output_observer=None observer=None bypass_divisibility_checks=False index=None group=None start=None end=None update=None initialized_=True finalized_=None started_=True ended_=True

Note: This model card was automatically generated. All structural modifiers and parameters used during compression are logged above.

Downloads last month
168
Safetensors
Model size
2B params
Tensor type
F16
·
I8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MilyaShams/Qwen3-1.7B-PTQ_W8A8_ign

Finetuned
Qwen/Qwen3-1.7B
Quantized
(254)
this model