YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
3-bit HQQ quantized version of Meta-Llama-3.1-405B (base version). Quality will be degraded some, but should still be usable. Quantization parameters:
nbits=3, group_size=128, quant_zero=True, quant_scale=True, axis=0
Shards have been split with "split", to recombine:
cat qmodel_shard* > qmodel.pt
- Downloads last month
- 6
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support