hsaest commited on
Commit
c803831
·
verified ·
1 Parent(s): bf9d923

Update QUEST family links and citation

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md CHANGED
@@ -24,6 +24,20 @@ QUEST **30B** full model after **mid-training → SFT → RL** (Qwen3-30B-A3B ba
24
  | GAIA | avg@3 | 69.0 |
25
  | LiveResearchBench | avg@3 | 74.1 |
26
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  ## Quick start
28
 
29
  ```python
@@ -38,6 +52,20 @@ model = AutoModelForCausalLM.from_pretrained(
38
 
39
  Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
40
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
  ## License
42
 
43
  Released under the **Apache License 2.0**.
 
24
  | GAIA | avg@3 | 69.0 |
25
  | LiveResearchBench | avg@3 | 74.1 |
26
 
27
+ ## QUEST Family
28
+
29
+ | Type | Resources |
30
+ | --- | --- |
31
+ | 35B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-35B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-35B-MT-Plus-SFT), [MT](https://huggingface.co/osunlp/QUEST-35B-MT), [SFT](https://huggingface.co/osunlp/QUEST-35B-SFT) |
32
+ | 30B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-30B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-30B-MT-Plus-SFT), [SFT](https://huggingface.co/osunlp/QUEST-30B-SFT) |
33
+ | Smaller checkpoints | [9B](https://huggingface.co/osunlp/QUEST-9B), [4B](https://huggingface.co/osunlp/QUEST-4B), [2B](https://huggingface.co/osunlp/QUEST-2B) |
34
+ | Training data | [RL data](https://huggingface.co/datasets/osunlp/QUEST-RL-Data), [SFT objective data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Objective), [SFT open-ended data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Open-ended) |
35
+
36
+ Model selection note: if you only need to evaluate objective tasks and do not
37
+ need open-ended task evaluation, we recommend the MT+SFT checkpoints because
38
+ they perform better on reasoning-heavy objective benchmarks. For a more comprehensive evaluation
39
+ across both objective and open-ended tasks, we recommend the RL checkpoints.
40
+
41
  ## Quick start
42
 
43
  ```python
 
52
 
53
  Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
54
 
55
+ ## Citation
56
+
57
+ If our paper or related resources prove valuable to your research, we kindly ask
58
+ for a citation.
59
+
60
+ ```bibtex
61
+ @misc{xie2026quest,
62
+ title={QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks},
63
+ author={Xie, Jian and Lin, Tianhe and Wang, Zilu and Ning, Yuting and Yao, Yuekun and Xue, Tianci and Zhang, Zhehao and Li, Zhongyang and Zhang, Kai and Wu, Yufan and Chen, Shijie and Gou, Boyu and Han, Mingzhe and Wang, Yifei and Lee, Vint and Wei, Xinpeng and Wang, Xiangjun and Su, Yu and Sun, Huan},
64
+ journal={arXiv preprint arXiv:2605.24218},
65
+ year={2026}
66
+ }
67
+ ```
68
+
69
  ## License
70
 
71
  Released under the **Apache License 2.0**.