hsaest commited on
Commit
ccbc79a
·
verified ·
1 Parent(s): c803831

Update QUEST family links and citation

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -31,7 +31,7 @@ QUEST **30B** full model after **mid-training → SFT → RL** (Qwen3-30B-A3B ba
31
  | 35B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-35B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-35B-MT-Plus-SFT), [MT](https://huggingface.co/osunlp/QUEST-35B-MT), [SFT](https://huggingface.co/osunlp/QUEST-35B-SFT) |
32
  | 30B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-30B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-30B-MT-Plus-SFT), [SFT](https://huggingface.co/osunlp/QUEST-30B-SFT) |
33
  | Smaller checkpoints | [9B](https://huggingface.co/osunlp/QUEST-9B), [4B](https://huggingface.co/osunlp/QUEST-4B), [2B](https://huggingface.co/osunlp/QUEST-2B) |
34
- | Training data | [RL data](https://huggingface.co/datasets/osunlp/QUEST-RL-Data), [SFT objective data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Objective), [SFT open-ended data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Open-ended) |
35
 
36
  Model selection note: if you only need to evaluate objective tasks and do not
37
  need open-ended task evaluation, we recommend the MT+SFT checkpoints because
@@ -52,6 +52,10 @@ model = AutoModelForCausalLM.from_pretrained(
52
 
53
  Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
54
 
 
 
 
 
55
  ## Citation
56
 
57
  If our paper or related resources prove valuable to your research, we kindly ask
@@ -65,7 +69,3 @@ for a citation.
65
  year={2026}
66
  }
67
  ```
68
-
69
- ## License
70
-
71
- Released under the **Apache License 2.0**.
 
31
  | 35B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-35B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-35B-MT-Plus-SFT), [MT](https://huggingface.co/osunlp/QUEST-35B-MT), [SFT](https://huggingface.co/osunlp/QUEST-35B-SFT) |
32
  | 30B checkpoints | [RL](https://huggingface.co/osunlp/QUEST-30B-RL), [MT+SFT](https://huggingface.co/osunlp/QUEST-30B-MT-Plus-SFT), [SFT](https://huggingface.co/osunlp/QUEST-30B-SFT) |
33
  | Smaller checkpoints | [9B](https://huggingface.co/osunlp/QUEST-9B), [4B](https://huggingface.co/osunlp/QUEST-4B), [2B](https://huggingface.co/osunlp/QUEST-2B) |
34
+ | Training data | [RL data](https://huggingface.co/datasets/osunlp/QUEST-RL-Data), [SFT objective data](https://huggingface.co/datasets/osunlp/QUEST-SFT-Data-Objective), [SFT open-ended data](https://huggingface.co/datasets/osunlp/Quest-SFT-Data-Open-ended), [Mid-training data](https://huggingface.co/datasets/osunlp/QUEST-Mid-Training-Data) |
35
 
36
  Model selection note: if you only need to evaluate objective tasks and do not
37
  need open-ended task evaluation, we recommend the MT+SFT checkpoints because
 
52
 
53
  Apply the model's chat template with `tokenizer.apply_chat_template(...)` before passing prompts.
54
 
55
+ ## License
56
+
57
+ Released under the **Apache License 2.0**.
58
+
59
  ## Citation
60
 
61
  If our paper or related resources prove valuable to your research, we kindly ask
 
69
  year={2026}
70
  }
71
  ```