vaibhav12332112312's picture
train: batched parallel rollouts on Qwen2.5-3B + parser hardening
a6b8df0