Update README.md
Browse files
README.md
CHANGED
|
@@ -44,6 +44,23 @@ It was created using [AutoRound](https://github.com/intel/auto-round) with a cus
|
|
| 44 |
|
| 45 |
* For more information, please check `quantize.py`.
|
| 46 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 47 |
## How to use
|
| 48 |
|
| 49 |
* This model is tested on the latest `docker.io/vllm/vllm-openai:cu130-nightly`.
|
|
|
|
| 44 |
|
| 45 |
* For more information, please check `quantize.py`.
|
| 46 |
|
| 47 |
+
### KLD Metrics
|
| 48 |
+
| Metric | Value | Description |
|
| 49 |
+
| :--- | :--- | :--- |
|
| 50 |
+
| **Median KLD** | 0.005592 | Median divergence |
|
| 51 |
+
| **P90 KLD** | 0.034514 | Divergence at the 90th percentile |
|
| 52 |
+
| **Mean KLD** | 0.046941 | Average divergence |
|
| 53 |
+
| **Mean Coverage** | 0.994750 | - |
|
| 54 |
+
|
| 55 |
+
### Evaluation Configuration
|
| 56 |
+
| Parameter | Value |
|
| 57 |
+
| :--- | :--- |
|
| 58 |
+
| **Calibration Dataset** | wikitext-2-raw-v1 (test) |
|
| 59 |
+
| **Sequence Length** | 2048 |
|
| 60 |
+
| **Num Samples** | 64 |
|
| 61 |
+
| **Total Positions** | 131,008 |
|
| 62 |
+
| **Top-K Reference** | 1000 |
|
| 63 |
+
|
| 64 |
## How to use
|
| 65 |
|
| 66 |
* This model is tested on the latest `docker.io/vllm/vllm-openai:cu130-nightly`.
|