| --- |
| license: mit |
| language: |
| - en |
| base_model: |
| - meta-llama/CodeLlama-7b-Instruct-hf |
| --- |
| |
| # Citation |
| If you find our work helpful, feel free to give us a cite. |
| ``` |
| @misc{wu2026securecodegenerationonline, |
| title={Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model}, |
| author={Tianyi Wu and Mingzhe Du and Yue Liu and Chengran Yang and Terry Yue Zhuo and Jiaheng Zhang and See-Kiong Ng}, |
| year={2026}, |
| eprint={2602.07422}, |
| archivePrefix={arXiv}, |
| primaryClass={cs.CR}, |
| url={https://arxiv.org/abs/2602.07422}, |
| } |
| ``` |