Safetensors
qwen3_moe
xiaoxuanzi commited on
Commit
f48fe94
·
verified ·
1 Parent(s): d378d01

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -1,3 +1,17 @@
1
  ---
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
  ---
4
+ # ✨ GoLongRL-30B-A3B
5
+
6
+ We present GoLongRL, a fully open-source, capability-oriented post-training recipe for long-context reinforcement learning with verifiable rewards (RLVR).
7
+
8
+ | Resource | Link |
9
+ |---|---|
10
+ | 📝 Preprints | [Paper](https://arxiv.org/abs/2605.19577) |
11
+ | 🤗 Daily Paper | [Paper]() |
12
+ | 🤗 Model Hub | [GoLongRL-4B](https://huggingface.co/Kwai-Klear/GoLongRL-4B) |
13
+ | 🤗 Model Hub | [GoLongRL-30B-A3B](https://huggingface.co/Kwai-Klear/GoLongRL-30B-A3B) |
14
+ | 🤗 Dataset Hub | [Code RL](https://huggingface.co/datasets/Kwai-Klear/GoLongRL) |
15
+ | 📧 Contact | xiao_xuan_zi_666@163.com & suzhenpeng13@163.com |
16
+
17
+