hsaest commited on
Commit
40d411e
·
verified ·
1 Parent(s): 96f1666

Update QUEST family links and citation

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md CHANGED
@@ -24,6 +24,20 @@ QUEST **35B-class MoE** SFT-only checkpoint (Qwen3.5-35B-A3B base, `Qwen3_5MoeFo
24
  | GAIA | avg@3 | 83.5 |
25
  | LiveResearchBench | avg@3 | 64.69 |
26
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  ## Quick start
28
 
29
  ```python
@@ -38,6 +52,20 @@ model = AutoModelForCausalLM.from_pretrained(
38
 
39
  Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
40
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
  ## License
42
 
43
  Released under the **Apache License 2.0**.
 
24
  | GAIA | avg@3 | 83.5 |
25
  | LiveResearchBench | avg@3 | 64.69 |
26
 
27
+ ## QUEST Family
28
+
29
+ | Type | Resources |
30
+ | --- | --- |
31
+ | 35B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-35B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-35B-MT-Plus-SFT), [MT](https://huggingface.co/osunlp/QUEST-35B-MT), [SFT](https://huggingface.co/osunlp/QUEST-35B-SFT) |
32
+ | 30B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-30B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-30B-MT-Plus-SFT), [SFT](https://huggingface.co/osunlp/QUEST-30B-SFT) |
33
+ | Smaller checkpoints | [9B](https://huggingface.co/osunlp/QUEST-9B), [4B](https://huggingface.co/osunlp/QUEST-4B), [2B](https://huggingface.co/osunlp/QUEST-2B) |
34
+ | Training data | [RL data](https://huggingface.co/datasets/osunlp/QUEST-RL-Data), [SFT objective data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Objective), [SFT open-ended data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Open-ended) |
35
+
36
+ Model selection note: if you only need to evaluate objective tasks and do not
37
+ need open-ended task evaluation, we recommend the MT+SFT checkpoints because
38
+ they perform better on reasoning-heavy objective benchmarks. For a more comprehensive evaluation
39
+ across both objective and open-ended tasks, we recommend the RL checkpoints.
40
+
41
  ## Quick start
42
 
43
  ```python
 
52
 
53
  Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
54
 
55
+ ## Citation
56
+
57
+ If our paper or related resources prove valuable to your research, we kindly ask
58
+ for a citation.
59
+
60
+ ```bibtex
61
+ @misc{xie2026quest,
62
+ title={QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks},
63
+ author={Xie, Jian and Lin, Tianhe and Wang, Zilu and Ning, Yuting and Yao, Yuekun and Xue, Tianci and Zhang, Zhehao and Li, Zhongyang and Zhang, Kai and Wu, Yufan and Chen, Shijie and Gou, Boyu and Han, Mingzhe and Wang, Yifei and Lee, Vint and Wei, Xinpeng and Wang, Xiangjun and Su, Yu and Sun, Huan},
64
+ journal={arXiv preprint arXiv:2605.24218},
65
+ year={2026}
66
+ }
67
+ ```
68
+
69
  ## License
70
 
71
  Released under the **Apache License 2.0**.