Improve model card and add metadata
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,8 +1,10 @@
|
|
| 1 |
---
|
| 2 |
-
|
| 3 |
language:
|
| 4 |
- en
|
| 5 |
-
|
|
|
|
|
|
|
| 6 |
tags:
|
| 7 |
- speculative-decoding
|
| 8 |
- specblock
|
|
@@ -13,18 +15,25 @@ tags:
|
|
| 13 |
|
| 14 |
SpecBlock draft model for speculative decoding, trained against the target model [`Qwen/Qwen3-8B`](https://huggingface.co/Qwen/Qwen3-8B).
|
| 15 |
|
|
|
|
|
|
|
| 16 |
## Method
|
| 17 |
|
| 18 |
SpecBlock — multi-block test-time training with cross-slot hidden injection between decoder layers and dynamic tree drafting.
|
| 19 |
|
| 20 |
## Usage
|
| 21 |
|
| 22 |
-
End-to-end training and inference code: https://github.com/shiweijiezero/SpecBlock
|
| 23 |
|
| 24 |
Quick eval with the HF backend:
|
| 25 |
|
| 26 |
```bash
|
| 27 |
-
python benchmarks_hf/run_eval.py
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
```
|
| 29 |
|
| 30 |
## Citation
|
|
@@ -39,4 +48,4 @@ python benchmarks_hf/run_eval.py --algorithm specblock --model-path Qwen
|
|
| 39 |
primaryClass={cs.CL},
|
| 40 |
url={https://arxiv.org/abs/2605.07243}
|
| 41 |
}
|
| 42 |
-
```
|
|
|
|
| 1 |
---
|
| 2 |
+
base_model: Qwen/Qwen3-8B
|
| 3 |
language:
|
| 4 |
- en
|
| 5 |
+
license: apache-2.0
|
| 6 |
+
library_name: transformers
|
| 7 |
+
pipeline_tag: text-generation
|
| 8 |
tags:
|
| 9 |
- speculative-decoding
|
| 10 |
- specblock
|
|
|
|
| 15 |
|
| 16 |
SpecBlock draft model for speculative decoding, trained against the target model [`Qwen/Qwen3-8B`](https://huggingface.co/Qwen/Qwen3-8B).
|
| 17 |
|
| 18 |
+
This model was introduced in the paper [SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting](https://huggingface.co/papers/2605.07243).
|
| 19 |
+
|
| 20 |
## Method
|
| 21 |
|
| 22 |
SpecBlock — multi-block test-time training with cross-slot hidden injection between decoder layers and dynamic tree drafting.
|
| 23 |
|
| 24 |
## Usage
|
| 25 |
|
| 26 |
+
End-to-end training and inference code can be found in the official repository: https://github.com/shiweijiezero/SpecBlock
|
| 27 |
|
| 28 |
Quick eval with the HF backend:
|
| 29 |
|
| 30 |
```bash
|
| 31 |
+
python benchmarks_hf/run_eval.py \
|
| 32 |
+
--algorithm specblock \
|
| 33 |
+
--model-path Qwen/Qwen3-8B \
|
| 34 |
+
--draft-model-path <local-clone-of-this-repo> \
|
| 35 |
+
--benchmark-list mtbench:80 humaneval:164 gsm8k:200 \
|
| 36 |
+
--output ./hf_results/specblock_qwen3.jsonl
|
| 37 |
```
|
| 38 |
|
| 39 |
## Citation
|
|
|
|
| 48 |
primaryClass={cs.CL},
|
| 49 |
url={https://arxiv.org/abs/2605.07243}
|
| 50 |
}
|
| 51 |
+
```
|