Pre-launch fixes: disable Qwen3 thinking, strip think blocks, degenerate-short guard 5abc867 Don Rishabh Claude Opus 4.7 (1M context) commited on 14 days ago
v2 stack: Qwen3.5-2B agent/target, Qwen3.5-9B judge, hard tasks, additive reward 3889513 Don Rishabh Claude Opus 4.7 (1M context) commited on 14 days ago
Initial commit: Prompt Golf environment for OpenEnv 6850dad Don Rishabh Claude Opus 4.7 (1M context) commited on 15 days ago