hsaest commited on
Commit
ce9f9a8
·
verified ·
1 Parent(s): 5686633

README: add benchmark result table from project page

Browse files
Files changed (1) hide show
  1. README.md +20 -6
README.md CHANGED
@@ -9,7 +9,23 @@ tags:
9
 
10
  # QUEST-30B-SFT
11
 
12
- **30B** **vanilla SFT** dense model (QUEST line).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
  ## Quick start
15
 
@@ -19,14 +35,12 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
19
  model_id = "osunlp/QUEST-30B-SFT"
20
  tokenizer = AutoTokenizer.from_pretrained(model_id)
21
  model = AutoModelForCausalLM.from_pretrained(
22
- model_id,
23
- device_map="auto",
24
- torch_dtype="auto",
25
  )
26
  ```
27
 
28
- Use the chat template and `tokenizer.apply_chat_template(...)` when available. VRAM and dtype requirements depend on model size and MoE vs dense architecture; see `config.json` (`model_type`, `architectures`).
29
 
30
  ## License
31
 
32
- This model is released under the **Apache License 2.0** (`apache-2.0`).
 
9
 
10
  # QUEST-30B-SFT
11
 
12
+ QUEST **30B** vanilla SFT checkpoint (Qwen3-30B-A3B base, dense).
13
+
14
+ Quantitative results for QUEST-30B (full MT+SFT+RL pipeline) are shown below for reference. SFT-only scores are not separately reported on the project page.
15
+
16
+ ## QUEST-30B benchmark results (MT+SFT+RL)
17
+
18
+ | Benchmark | Metric | Score |
19
+ | --- | --- | ---: |
20
+ | BrowseComp | avg@3 | 37.0 |
21
+ | Mind2Web 2 | avg@3 | 28.6 |
22
+ | HLE | avg@3 | 24.6 |
23
+ | DeepResearch Bench | avg@3 | 45.3 |
24
+ | BrowseComp-Plus | avg@3 | 48.2 |
25
+ | WideSearch | Item F1 avg@4 | 54.2 |
26
+ | GAIA | avg@3 | 69.0 |
27
+ | LiveResearchBench | avg@3 | 74.1 |
28
+ > Results from the full QUEST-30B (MT+SFT+RL) run, not this SFT-only checkpoint.
29
 
30
  ## Quick start
31
 
 
35
  model_id = "osunlp/QUEST-30B-SFT"
36
  tokenizer = AutoTokenizer.from_pretrained(model_id)
37
  model = AutoModelForCausalLM.from_pretrained(
38
+ model_id, device_map="auto", torch_dtype="auto",
 
 
39
  )
40
  ```
41
 
42
+ Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
43
 
44
  ## License
45
 
46
+ Released under the **Apache License 2.0**.