YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
alphaXiv/filter-2b-gptq-int4
This is the GPTQ int4 quantized release of alphaXiv/filter-2b.
Quantization
- Base model:
alphaXiv/filter-2b - Method: GPTQ
- Scheme:
W4A16 - Weight bits: 4
- Group size: 128
actorder:weightlm_headexcluded from compression
Files
model.safetensorsconfig.jsongeneration_config.json- tokenizer files
recipe.yaml
Notes
The quantization metadata is stored in config.json under quantization_config.
- Downloads last month
- 59
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support