Kwai-Klear
/

GoLongRL-30B-A3B

Model card Files Files and versions

xiaoxuanzi commited on about 19 hours ago

Commit

f48fe94

·

verified ·

1 Parent(s): d378d01

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -1,3 +1,17 @@
 ---
 license: mit
 ---

 ---
 license: mit
 ---
+# ✨ GoLongRL-30B-A3B
+We present GoLongRL, a fully open-source, capability-oriented post-training recipe for long-context reinforcement learning with verifiable rewards (RLVR).
+| Resource | Link |
+|---|---|
+| 📝 Preprints | [Paper](https://arxiv.org/abs/2605.19577) |
+| 🤗 Daily Paper | [Paper]() |
+| 🤗 Model Hub | [GoLongRL-4B](https://huggingface.co/Kwai-Klear/GoLongRL-4B) |
+| 🤗 Model Hub | [GoLongRL-30B-A3B](https://huggingface.co/Kwai-Klear/GoLongRL-30B-A3B) |
+| 🤗 Dataset Hub | [Code RL](https://huggingface.co/datasets/Kwai-Klear/GoLongRL) |
+| 📧 Contact | xiao_xuan_zi_666@163.com & suzhenpeng13@163.com |