REARANK: Reasoning Re-ranking Agent via Reinforcement Learning
Paper • 2505.20046 • Published • 18
This is a reasoning reranking agent model built upon Qwen-2.5-7B for the paper REARANK: Reasoning Re-ranking Agent via Reinforcement Learning. The model is trained on reranking dataset built from only 179 queries using GRPO to perform reranking task, the codebase is at https://github.com/lezhang7/Rearank