Improve model card and add metadata

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +14 -5
README.md CHANGED
@@ -1,8 +1,10 @@
1
  ---
2
- license: apache-2.0
3
  language:
4
  - en
5
- base_model: Qwen/Qwen3-8B
 
 
6
  tags:
7
  - speculative-decoding
8
  - specblock
@@ -13,18 +15,25 @@ tags:
13
 
14
  SpecBlock draft model for speculative decoding, trained against the target model [`Qwen/Qwen3-8B`](https://huggingface.co/Qwen/Qwen3-8B).
15
 
 
 
16
  ## Method
17
 
18
  SpecBlock — multi-block test-time training with cross-slot hidden injection between decoder layers and dynamic tree drafting.
19
 
20
  ## Usage
21
 
22
- End-to-end training and inference code: https://github.com/shiweijiezero/SpecBlock
23
 
24
  Quick eval with the HF backend:
25
 
26
  ```bash
27
- python benchmarks_hf/run_eval.py --algorithm specblock --model-path Qwen/Qwen3-8B --draft-model-path <local-clone-of-this-repo> --benchmark-list mtbench:80 humaneval:164 gsm8k:200 --output ./hf_results/specblock_qwen3.jsonl
 
 
 
 
 
28
  ```
29
 
30
  ## Citation
@@ -39,4 +48,4 @@ python benchmarks_hf/run_eval.py --algorithm specblock --model-path Qwen
39
  primaryClass={cs.CL},
40
  url={https://arxiv.org/abs/2605.07243}
41
  }
42
- ```
 
1
  ---
2
+ base_model: Qwen/Qwen3-8B
3
  language:
4
  - en
5
+ license: apache-2.0
6
+ library_name: transformers
7
+ pipeline_tag: text-generation
8
  tags:
9
  - speculative-decoding
10
  - specblock
 
15
 
16
  SpecBlock draft model for speculative decoding, trained against the target model [`Qwen/Qwen3-8B`](https://huggingface.co/Qwen/Qwen3-8B).
17
 
18
+ This model was introduced in the paper [SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting](https://huggingface.co/papers/2605.07243).
19
+
20
  ## Method
21
 
22
  SpecBlock — multi-block test-time training with cross-slot hidden injection between decoder layers and dynamic tree drafting.
23
 
24
  ## Usage
25
 
26
+ End-to-end training and inference code can be found in the official repository: https://github.com/shiweijiezero/SpecBlock
27
 
28
  Quick eval with the HF backend:
29
 
30
  ```bash
31
+ python benchmarks_hf/run_eval.py \
32
+ --algorithm specblock \
33
+ --model-path Qwen/Qwen3-8B \
34
+ --draft-model-path <local-clone-of-this-repo> \
35
+ --benchmark-list mtbench:80 humaneval:164 gsm8k:200 \
36
+ --output ./hf_results/specblock_qwen3.jsonl
37
  ```
38
 
39
  ## Citation
 
48
  primaryClass={cs.CL},
49
  url={https://arxiv.org/abs/2605.07243}
50
  }
51
+ ```