hsaest commited on
Commit
36838f4
·
verified ·
1 Parent(s): 48b2179

Update QUEST family links and citation

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -30,7 +30,7 @@ QUEST **35B-class MoE** full model after **mid-training → SFT → RL** (Qwen3.
30
  | 35B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-35B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-35B-MT-Plus-SFT), [MT](https://huggingface.co/osunlp/QUEST-35B-MT), [SFT](https://huggingface.co/osunlp/QUEST-35B-SFT) |
31
  | 30B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-30B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-30B-MT-Plus-SFT), [SFT](https://huggingface.co/osunlp/QUEST-30B-SFT) |
32
  | Smaller checkpoints | [9B](https://huggingface.co/osunlp/QUEST-9B), [4B](https://huggingface.co/osunlp/QUEST-4B), [2B](https://huggingface.co/osunlp/QUEST-2B) |
33
- | Training data | [RL data](https://huggingface.co/datasets/osunlp/QUEST-RL-Data), [SFT objective data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Objective), [SFT open-ended data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Open-ended) |
34
 
35
  Model selection note: if you only need to evaluate objective tasks and do not
36
  need open-ended task evaluation, we recommend the MT+SFT checkpoints because
@@ -51,6 +51,10 @@ model = AutoModelForCausalLM.from_pretrained(
51
 
52
  Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
53
 
 
 
 
 
54
  ## Citation
55
 
56
  If our paper or related resources prove valuable to your research, we kindly ask
@@ -64,7 +68,3 @@ for a citation.
64
  year={2026}
65
  }
66
  ```
67
-
68
- ## License
69
-
70
- Released under the **Apache License 2.0**.
 
30
  | 35B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-35B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-35B-MT-Plus-SFT), [MT](https://huggingface.co/osunlp/QUEST-35B-MT), [SFT](https://huggingface.co/osunlp/QUEST-35B-SFT) |
31
  | 30B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-30B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-30B-MT-Plus-SFT), [SFT](https://huggingface.co/osunlp/QUEST-30B-SFT) |
32
  | Smaller checkpoints | [9B](https://huggingface.co/osunlp/QUEST-9B), [4B](https://huggingface.co/osunlp/QUEST-4B), [2B](https://huggingface.co/osunlp/QUEST-2B) |
33
+ | Training data | [RL data](https://huggingface.co/datasets/osunlp/QUEST-RL-Data), [SFT objective data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Objective), [SFT open-ended data](https://huggingface.co/datasets/osunlp/Quest-SFT-Data-Open-ended), [Mid-training data](https://huggingface.co/datasets/osunlp/QUEST-Mid-Training-Data) |
34
 
35
  Model selection note: if you only need to evaluate objective tasks and do not
36
  need open-ended task evaluation, we recommend the MT+SFT checkpoints because
 
51
 
52
  Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
53
 
54
+ ## License
55
+
56
+ Released under the **Apache License 2.0**.
57
+
58
  ## Citation
59
 
60
  If our paper or related resources prove valuable to your research, we kindly ask
 
68
  year={2026}
69
  }
70
  ```