Update README.md
Browse files
README.md
CHANGED
|
@@ -2,4 +2,18 @@
|
|
| 2 |
|
| 3 |
This is the pre-trained policy model weights for our novel method, Constrained Initial Representations (CIR).
|
| 4 |
|
| 5 |
-
The results are grouped into each environment and seed. One can directly load the policy weights from the corresponding directory
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
|
| 3 |
This is the pre-trained policy model weights for our novel method, Constrained Initial Representations (CIR).
|
| 4 |
|
| 5 |
+
The results are grouped into each environment and seed. One can directly load the policy weights from the corresponding directory
|
| 6 |
+
|
| 7 |
+
One can find our papers [here](https://arxiv.org/pdf/2602.11800)
|
| 8 |
+
|
| 9 |
+
# Citation
|
| 10 |
+
|
| 11 |
+
If you find our work interesting or use our work in your paper, please consider citing our paper:
|
| 12 |
+
```
|
| 13 |
+
@article{lyu2026temporal,
|
| 14 |
+
title={Temporal Difference Learning with Constrained Initial Representations},
|
| 15 |
+
author={Lyu, Jiafei and Yang, Jingwen and Qiao, Zhongjian and Liu, Runze and Liu, Zeyuan and Ye, Deheng and Lu, Zongqing and Li, Xiu},
|
| 16 |
+
journal={arXiv preprint arXiv:2602.11800},
|
| 17 |
+
year={2026}
|
| 18 |
+
}
|
| 19 |
+
```
|