Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,17 @@
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
+
# ✨ GoLongRL-30B-A3B
|
| 5 |
+
|
| 6 |
+
We present GoLongRL, a fully open-source, capability-oriented post-training recipe for long-context reinforcement learning with verifiable rewards (RLVR).
|
| 7 |
+
|
| 8 |
+
| Resource | Link |
|
| 9 |
+
|---|---|
|
| 10 |
+
| 📝 Preprints | [Paper](https://arxiv.org/abs/2605.19577) |
|
| 11 |
+
| 🤗 Daily Paper | [Paper]() |
|
| 12 |
+
| 🤗 Model Hub | [GoLongRL-4B](https://huggingface.co/Kwai-Klear/GoLongRL-4B) |
|
| 13 |
+
| 🤗 Model Hub | [GoLongRL-30B-A3B](https://huggingface.co/Kwai-Klear/GoLongRL-30B-A3B) |
|
| 14 |
+
| 🤗 Dataset Hub | [Code RL](https://huggingface.co/datasets/Kwai-Klear/GoLongRL) |
|
| 15 |
+
| 📧 Contact | xiao_xuan_zi_666@163.com & suzhenpeng13@163.com |
|
| 16 |
+
|
| 17 |
+
|