hsaest commited on
Commit
ac2cd89
·
verified ·
1 Parent(s): ce9f9a8

README: clarify training stage; remove unattributed benchmark numbers

Browse files
Files changed (1) hide show
  1. README.md +8 -16
README.md CHANGED
@@ -9,23 +9,17 @@ tags:
9
 
10
  # QUEST-30B-SFT
11
 
12
- QUEST **30B** vanilla SFT checkpoint (Qwen3-30B-A3B base, dense).
13
 
14
- Quantitative results for QUEST-30B (full MT+SFT+RL pipeline) are shown below for reference. SFT-only scores are not separately reported on the project page.
15
 
16
- ## QUEST-30B benchmark results (MT+SFT+RL)
17
 
18
- | Benchmark | Metric | Score |
19
- | --- | --- | ---: |
20
- | BrowseComp | avg@3 | 37.0 |
21
- | Mind2Web 2 | avg@3 | 28.6 |
22
- | HLE | avg@3 | 24.6 |
23
- | DeepResearch Bench | avg@3 | 45.3 |
24
- | BrowseComp-Plus | avg@3 | 48.2 |
25
- | WideSearch | Item F1 avg@4 | 54.2 |
26
- | GAIA | avg@3 | 69.0 |
27
- | LiveResearchBench | avg@3 | 74.1 |
28
- > Results from the full QUEST-30B (MT+SFT+RL) run, not this SFT-only checkpoint.
29
 
30
  ## Quick start
31
 
@@ -39,8 +33,6 @@ model = AutoModelForCausalLM.from_pretrained(
39
  )
40
  ```
41
 
42
- Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
43
-
44
  ## License
45
 
46
  Released under the **Apache License 2.0**.
 
9
 
10
  # QUEST-30B-SFT
11
 
12
+ QUEST **30B** vanilla SFT checkpoint (Qwen3-30B-A3B base, dense). This is an intermediate artifact in the QUEST 30B training pipeline (MT → **SFT** → RL).
13
 
14
+ Benchmark results for this SFT-only checkpoint are not separately reported on the project page. For the full pipeline results, see [QUEST-30B-RL](https://huggingface.co/osunlp/QUEST-30B-RL).
15
 
16
+ ## Training stage
17
 
18
+ | Stage | Applied |
19
+ | --- | :---: |
20
+ | Mid-training (MT) | |
21
+ | Supervised fine-tuning (SFT) | |
22
+ | Reinforcement learning (RL) | |
 
 
 
 
 
 
23
 
24
  ## Quick start
25
 
 
33
  )
34
  ```
35
 
 
 
36
  ## License
37
 
38
  Released under the **Apache License 2.0**.