hsaest commited on
Commit
795e148
·
verified ·
1 Parent(s): e43b4e9

Update QUEST family links and citation

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md CHANGED
@@ -21,6 +21,20 @@ We did not run benchmark evaluations on this checkpoint. For full pipeline resul
21
  | Supervised fine-tuning (SFT) | ✓ |
22
  | Reinforcement learning (RL) | ✗ |
23
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
24
  ## Quick start
25
 
26
  ```python
@@ -33,6 +47,20 @@ model = AutoModelForCausalLM.from_pretrained(
33
  )
34
  ```
35
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
36
  ## License
37
 
38
  Released under the **Apache License 2.0**.
 
21
  | Supervised fine-tuning (SFT) | ✓ |
22
  | Reinforcement learning (RL) | ✗ |
23
 
24
+ ## QUEST Family
25
+
26
+ | Type | Resources |
27
+ | --- | --- |
28
+ | 35B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-35B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-35B-MT-Plus-SFT), [MT](https://huggingface.co/osunlp/QUEST-35B-MT), [SFT](https://huggingface.co/osunlp/QUEST-35B-SFT) |
29
+ | 30B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-30B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-30B-MT-Plus-SFT), [SFT](https://huggingface.co/osunlp/QUEST-30B-SFT) |
30
+ | Smaller checkpoints | [9B](https://huggingface.co/osunlp/QUEST-9B), [4B](https://huggingface.co/osunlp/QUEST-4B), [2B](https://huggingface.co/osunlp/QUEST-2B) |
31
+ | Training data | [RL data](https://huggingface.co/datasets/osunlp/QUEST-RL-Data), [SFT objective data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Objective), [SFT open-ended data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Open-ended) |
32
+
33
+ Model selection note: if you only need to evaluate objective tasks and do not
34
+ need open-ended task evaluation, we recommend the MT+SFT checkpoints because
35
+ they perform better on reasoning-heavy objective benchmarks. For a more comprehensive evaluation
36
+ across both objective and open-ended tasks, we recommend the RL checkpoints.
37
+
38
  ## Quick start
39
 
40
  ```python
 
47
  )
48
  ```
49
 
50
+ ## Citation
51
+
52
+ If our paper or related resources prove valuable to your research, we kindly ask
53
+ for a citation.
54
+
55
+ ```bibtex
56
+ @misc{xie2026quest,
57
+ title={QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks},
58
+ author={Xie, Jian and Lin, Tianhe and Wang, Zilu and Ning, Yuting and Yao, Yuekun and Xue, Tianci and Zhang, Zhehao and Li, Zhongyang and Zhang, Kai and Wu, Yufan and Chen, Shijie and Gou, Boyu and Han, Mingzhe and Wang, Yifei and Lee, Vint and Wei, Xinpeng and Wang, Xiangjun and Su, Yu and Sun, Huan},
59
+ journal={arXiv preprint arXiv:2605.24218},
60
+ year={2026}
61
+ }
62
+ ```
63
+
64
  ## License
65
 
66
  Released under the **Apache License 2.0**.