drawais commited on
Commit
ed3bb83
·
verified ·
1 Parent(s): 58a1e26

Add bench score 100/100/100

Browse files
Files changed (1) hide show
  1. README.md +10 -4
README.md CHANGED
@@ -25,6 +25,16 @@ INT4 quantization of [`Qwen/Qwen2.5-Coder-14B-Instruct`](https://huggingface.co/
25
  | Quantized weights | ~9.4 GB on disk |
26
  | Inference VRAM (incl. KV cache @ 32K context) | ~16 GB |
27
 
 
 
 
 
 
 
 
 
 
 
28
  ## Quick start
29
 
30
  ```bash
@@ -37,10 +47,6 @@ tok = AutoTokenizer.from_pretrained("drawais/Qwen2.5-Coder-14B-Instruct-AWQ-INT4
37
  model = AutoModelForCausalLM.from_pretrained("drawais/Qwen2.5-Coder-14B-Instruct-AWQ-INT4", device_map="auto")
38
  ```
39
 
40
- ## Bench
41
-
42
- Leaderboard score on [`drawais/needle-1M-bench-mvp`](https://huggingface.co/datasets/drawais/needle-1M-bench-mvp) coming after upload.
43
-
44
  ## License
45
 
46
  Apache 2.0 (inherits from base model).
 
25
  | Quantized weights | ~9.4 GB on disk |
26
  | Inference VRAM (incl. KV cache @ 32K context) | ~16 GB |
27
 
28
+ ## Bench
29
+
30
+ Scored on [`drawais/needle-1M-bench-mvp`](https://huggingface.co/datasets/drawais/needle-1M-bench-mvp) (50K-token haystack, real arxiv text):
31
+
32
+ | Metric | Score |
33
+ |---|---|
34
+ | Overall recall | **100.0%** |
35
+ | Paper-anchored | 100.0% |
36
+ | Synthetic codes | 100.0% |
37
+
38
  ## Quick start
39
 
40
  ```bash
 
47
  model = AutoModelForCausalLM.from_pretrained("drawais/Qwen2.5-Coder-14B-Instruct-AWQ-INT4", device_map="auto")
48
  ```
49
 
 
 
 
 
50
  ## License
51
 
52
  Apache 2.0 (inherits from base model).