hsaest commited on
Commit
aedaa8a
·
verified ·
1 Parent(s): 6212079

README: add benchmark result table from project page

Browse files
Files changed (1) hide show
  1. README.md +17 -6
README.md CHANGED
@@ -9,7 +9,20 @@ tags:
9
 
10
  # QUEST-35B-MT
11
 
12
- **Qwen3.5 MoE**-style **35B-class** checkpoint after **mid-training** (`Qwen3_5MoeForConditionalGeneration`). Expect higher VRAM / expert routing behavior than dense 30B models.
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
  ## Quick start
15
 
@@ -19,14 +32,12 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
19
  model_id = "osunlp/QUEST-35B-MT"
20
  tokenizer = AutoTokenizer.from_pretrained(model_id)
21
  model = AutoModelForCausalLM.from_pretrained(
22
- model_id,
23
- device_map="auto",
24
- torch_dtype="auto",
25
  )
26
  ```
27
 
28
- Use the chat template and `tokenizer.apply_chat_template(...)` when available. VRAM and dtype requirements depend on model size and MoE vs dense architecture; see `config.json` (`model_type`, `architectures`).
29
 
30
  ## License
31
 
32
- This model is released under the **Apache License 2.0** (`apache-2.0`).
 
9
 
10
  # QUEST-35B-MT
11
 
12
+ QUEST **35B-class MoE** checkpoint after **mid-training + SFT** (Qwen3.5-35B-A3B base). This is the +MT stage in the training progression (Vanilla → SFT → **MT+SFT** → RL).
13
+
14
+ ## Benchmark results
15
+
16
+ | Benchmark | Metric | Score |
17
+ | --- | --- | ---: |
18
+ | BrowseComp | avg@3 | 45.5 |
19
+ | Mind2Web 2 | avg@3 | 29.9 |
20
+ | HLE | avg@3 | 39.74 |
21
+ | DeepResearch Bench | avg@3 | 39.72 |
22
+ | BrowseComp-Plus | avg@3 | 58.6 |
23
+ | WideSearch | Item F1 avg@4 | 62.5 |
24
+ | GAIA | avg@3 | 83.17 |
25
+ | LiveResearchBench | avg@3 | 65.47 |
26
 
27
  ## Quick start
28
 
 
32
  model_id = "osunlp/QUEST-35B-MT"
33
  tokenizer = AutoTokenizer.from_pretrained(model_id)
34
  model = AutoModelForCausalLM.from_pretrained(
35
+ model_id, device_map="auto", torch_dtype="auto",
 
 
36
  )
37
  ```
38
 
39
+ Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
40
 
41
  ## License
42
 
43
+ Released under the **Apache License 2.0**.