hsaest commited on
Commit
6ec668c
·
verified ·
1 Parent(s): aedaa8a

README: clarify training stage; remove unattributed benchmark numbers

Browse files
Files changed (1) hide show
  1. README.md +12 -13
README.md CHANGED
@@ -9,20 +9,21 @@ tags:
9
 
10
  # QUEST-35B-MT
11
 
12
- QUEST **35B-class MoE** checkpoint after **mid-training + SFT** (Qwen3.5-35B-A3B base). This is the +MT stage in the training progression (Vanilla SFT → **MT+SFT** RL).
 
 
 
 
 
 
 
 
 
 
13
 
14
  ## Benchmark results
15
 
16
- | Benchmark | Metric | Score |
17
- | --- | --- | ---: |
18
- | BrowseComp | avg@3 | 45.5 |
19
- | Mind2Web 2 | avg@3 | 29.9 |
20
- | HLE | avg@3 | 39.74 |
21
- | DeepResearch Bench | avg@3 | 39.72 |
22
- | BrowseComp-Plus | avg@3 | 58.6 |
23
- | WideSearch | Item F1 avg@4 | 62.5 |
24
- | GAIA | avg@3 | 83.17 |
25
- | LiveResearchBench | avg@3 | 65.47 |
26
 
27
  ## Quick start
28
 
@@ -36,8 +37,6 @@ model = AutoModelForCausalLM.from_pretrained(
36
  )
37
  ```
38
 
39
- Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
40
-
41
  ## License
42
 
43
  Released under the **Apache License 2.0**.
 
9
 
10
  # QUEST-35B-MT
11
 
12
+ This is the **mid-training only** checkpoint of QUEST-35B-A3B (`Qwen3_5MoeForConditionalGeneration`). It has **not** been fine-tuned with SFT or RL and therefore **does not** have the instruction-following or tool-use capabilities required to complete deep research tasks.
13
+
14
+ It is released as an intermediate artifact for research purposes (e.g., initializing further fine-tuning experiments or studying the effect of mid-training). If you need a usable deep research agent, please use [QUEST-35B-MT+SFT](https://huggingface.co/osunlp/QUEST-35B-MT-Plus-SFT) or [QUEST-35B-RL](https://huggingface.co/osunlp/QUEST-35B-RL) instead.
15
+
16
+ ## Training stage
17
+
18
+ | Stage | Applied |
19
+ | --- | :---: |
20
+ | Mid-training (MT) | ✓ |
21
+ | Supervised fine-tuning (SFT) | ✗ |
22
+ | Reinforcement learning (RL) | ✗ |
23
 
24
  ## Benchmark results
25
 
26
+ Not reported this model lacks task-completion capability without SFT.
 
 
 
 
 
 
 
 
 
27
 
28
  ## Quick start
29
 
 
37
  )
38
  ```
39
 
 
 
40
  ## License
41
 
42
  Released under the **Apache License 2.0**.