Commit History

Nuclear clamp: every reward source in the codebase now returns (0.05, 0.95)
719c147
Running

ar9avg commited on

Clamp all remaining score leak paths: /state, step_rewards, demo SSE
e99d0aa

ar9avg commited on

Bulletproof _safe_score for all bad inputs (None, NaN, strings, bool)
2014920

ar9avg commited on

Clamp every grader return value strictly inside (0, 1)
98b87b7

ar9avg commited on

Defensive score clamping at all emission points
263261a

ar9avg commited on

Widen score epsilon to 0.01 so :.3f formatting stays in (0, 1)
ba69b5f

ar9avg commited on

Fix reward.value to always be task_score in (0,1) exclusive
b86d426

ar9avg commited on

Fix task scores to be strictly in (0, 1) exclusive per OpenEnv spec
d2d92b8

ar9avg commited on

fix
63d67f0

ar9avg commited on

fix
f32a4be

ar9avg commited on

fix
c293dc3

ar9avg commited on

fix
140731d

ar9avg commited on

Update project origin description in README
b799708
unverified

ar9avg commited on

Clean up README.md by removing metadata
c8952f6
unverified

ar9avg commited on

fix
9f7dd14

ar9avg commited on

Remove origin details from README
965a112
unverified

ar9avg commited on

Refactor README.md by removing metadata and updating content
805743c
unverified

ar9avg commited on

fix
92cc088

ar9avg commited on

fix
55f54ec

ar9avg commited on

fix
44ef33f

ar9avg commited on

fix
2f89522

ar9avg commited on

fix
17e7bd7

ar9avg commited on

fix
24ef2cf

ar9avg commited on

fix
ed79e58

ar9avg commited on

fix
4ec680a

ar9avg commited on

fix
b00a200

ar9avg commited on

fix
c07b98d

ar9avg commited on

Reset SQL on new attempt so attempts don't concatenate in the same box
b15235a

ar9avg commited on

Fix UnboundLocalError: remove duplicate local import of REPAIR_ACTION_BY_NAME
c2894a4

ar9avg commited on

Surface GEPA optimization: prompt history, live banner, smart retry
aa3ae1f

ar9avg commited on

Fix IndexError: skip empty-choices chunks from HF Router streaming
8c8093f

ar9avg commited on

Add LLM diagnostics: /api/test-llm endpoint + startup/error logging
63cbec3

ar9avg commited on

Surface real error messages and fix LLM exception formatting
e9bea1b

ar9avg commited on

Fix chat SSE events to match frontend protocol (result+done instead of success)
f4110fc

ar9avg commited on

Fix chat query failures and benchmark ID mismatches
8ae8e0b

ar9avg commited on

fix: GEPA current_generation, task_id mapping, Connect DB button, remove difficulty from header
f0b682f

ar9avg commited on

fix: default LLM to HF Router + Qwen2.5-72B, no custom Space variables needed
68ebe84

ar9avg commited on

fix: remove duplicate YAML frontmatter in README
cb9cfe8

ar9avg commited on

fix: light mode contrast, collapse queries on demo completion
cc67cd2

ar9avg commited on

feat: demo mode with reward chart, github diff, single-difficulty rounds, no loop
2d33bcd

ar9avg commited on

feat: add Demo button with scripted autoplay showcase
d0e0cf7

ar9avg commited on

fix: align rl-state API shape with frontend, add error boundary
ce1c471

ar9avg commited on

Initial submission: SQL Agent OpenEnv for Meta+HF hackathon
3c665d2

ar9avg commited on

initial commit
d796343
verified

ar9av commited on