Training-Free Dynamic Upcycling of Expert Language Models Paper • 2603.29765 • Published 13 days ago • 7
Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO Paper • 2511.09780 • Published Nov 12, 2025 • 29
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published Sep 10, 2025 • 665
steevg/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tough_endangered_cat Text Generation • 0.5B • Updated May 28, 2025 • 3
steevg/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tough_endangered_cat Text Generation • 0.5B • Updated May 28, 2025 • 3