Update README.md
Browse files
README.md
CHANGED
|
@@ -8,6 +8,8 @@ tags:
|
|
| 8 |
|
| 9 |
This repo contains all the required Learnable Weight Clipping for Omniquant https://arxiv.org/abs/2308.13137.
|
| 10 |
|
|
|
|
|
|
|
| 11 |
To use them please first run:
|
| 12 |
|
| 13 |
```
|
|
@@ -26,4 +28,8 @@ CUDA_VISIBLE_DEVICES=0 python OmniQuant/main.py \
|
|
| 26 |
--epochs 0 --output_dir ./log/test \
|
| 27 |
--eval_ppl --wbits 4 --abits 16 --group_size 128 --lwc \
|
| 28 |
--resume OmniQuant_LWC/NAME_OF_MODEL-w4a16g128.pth
|
| 29 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
|
| 9 |
This repo contains all the required Learnable Weight Clipping for Omniquant https://arxiv.org/abs/2308.13137.
|
| 10 |
|
| 11 |
+
# How to use it?
|
| 12 |
+
|
| 13 |
To use them please first run:
|
| 14 |
|
| 15 |
```
|
|
|
|
| 28 |
--epochs 0 --output_dir ./log/test \
|
| 29 |
--eval_ppl --wbits 4 --abits 16 --group_size 128 --lwc \
|
| 30 |
--resume OmniQuant_LWC/NAME_OF_MODEL-w4a16g128.pth
|
| 31 |
+
```
|
| 32 |
+
|
| 33 |
+
# Methodology
|
| 34 |
+
|
| 35 |
+
The weights were run using a fork of OmniQuant available at [calibration.ipynb](https://github.com/Tfloow/AndaQuant/blob/main/calibration.ipynb)
|