v3: multi-turn env, thinking tokens, cross-family Qwen->Llama, multi-step GRPO 67509ac Don Rishabh Claude Opus 4.7 (1M context) commited on 13 days ago
Initial commit: Prompt Golf environment for OpenEnv 6850dad Don Rishabh Claude Opus 4.7 (1M context) commited on 15 days ago