Nuclear clamp: every reward source in the codebase now returns (0.05, 0.95) 719c147 Running ar9avg commited on 3 days ago
Clamp all remaining score leak paths: /state, step_rewards, demo SSE e99d0aa ar9avg commited on 3 days ago
Bulletproof _safe_score for all bad inputs (None, NaN, strings, bool) 2014920 ar9avg commited on 4 days ago
Fix task scores to be strictly in (0, 1) exclusive per OpenEnv spec d2d92b8 ar9avg commited on 4 days ago
Refactor README.md by removing metadata and updating content 805743c unverified ar9avg commited on 4 days ago
Reset SQL on new attempt so attempts don't concatenate in the same box b15235a ar9avg commited on 4 days ago
Fix UnboundLocalError: remove duplicate local import of REPAIR_ACTION_BY_NAME c2894a4 ar9avg commited on 4 days ago
Surface GEPA optimization: prompt history, live banner, smart retry aa3ae1f ar9avg commited on 4 days ago
Fix IndexError: skip empty-choices chunks from HF Router streaming 8c8093f ar9avg commited on 4 days ago
Add LLM diagnostics: /api/test-llm endpoint + startup/error logging 63cbec3 ar9avg commited on 4 days ago
Fix chat SSE events to match frontend protocol (result+done instead of success) f4110fc ar9avg commited on 4 days ago
fix: GEPA current_generation, task_id mapping, Connect DB button, remove difficulty from header f0b682f ar9avg commited on 4 days ago
fix: default LLM to HF Router + Qwen2.5-72B, no custom Space variables needed 68ebe84 ar9avg commited on 4 days ago
feat: demo mode with reward chart, github diff, single-difficulty rounds, no loop 2d33bcd ar9avg commited on 4 days ago
fix: align rl-state API shape with frontend, add error boundary ce1c471 ar9avg commited on 4 days ago