OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models
Paper • 2308.13137 • Published • 19
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Paper: https://arxiv.org/abs/2308.13137
Code: https://github.com/OpenGVLab/OmniQuant
To run this model, refer https://github.com/OpenGVLab/OmniQuant/blob/main/runing_falcon180b_on_single_a100_80g.ipynb for more details.