OSINT / .tmp_compare
siddeshwar-kagatikar
fix(rewards): never crash GRPO on malformed completions
d814291