Spaces:

ar9av
/

sql-agent-openenv

Running

App Files Files Community

sql-agent-openenv

Commit History

Nuclear clamp: every reward source in the codebase now returns (0.05, 0.95)

719c147

Running

ar9avg commited on 3 days ago

Clamp all remaining score leak paths: /state, step_rewards, demo SSE

e99d0aa

ar9avg commited on 3 days ago

Bulletproof _safe_score for all bad inputs (None, NaN, strings, bool)

2014920

ar9avg commited on 4 days ago

Clamp every grader return value strictly inside (0, 1)

98b87b7

ar9avg commited on 4 days ago

Defensive score clamping at all emission points

263261a

ar9avg commited on 4 days ago

Widen score epsilon to 0.01 so :.3f formatting stays in (0, 1)

ba69b5f

ar9avg commited on 4 days ago

Fix reward.value to always be task_score in (0,1) exclusive

b86d426

ar9avg commited on 4 days ago

Fix task scores to be strictly in (0, 1) exclusive per OpenEnv spec

d2d92b8

ar9avg commited on 4 days ago

fix

63d67f0

ar9avg commited on 4 days ago

fix

f32a4be

ar9avg commited on 4 days ago

fix

c293dc3

ar9avg commited on 4 days ago

fix

140731d

ar9avg commited on 4 days ago

Update project origin description in README

b799708
unverified

ar9avg commited on 4 days ago

Clean up README.md by removing metadata

c8952f6
unverified

ar9avg commited on 4 days ago

fix

9f7dd14

ar9avg commited on 4 days ago

Remove origin details from README

965a112
unverified

ar9avg commited on 4 days ago

Refactor README.md by removing metadata and updating content

805743c
unverified

ar9avg commited on 4 days ago

fix

92cc088

ar9avg commited on 4 days ago

fix

55f54ec

ar9avg commited on 4 days ago

fix

44ef33f

ar9avg commited on 4 days ago

fix

2f89522

ar9avg commited on 4 days ago

fix

17e7bd7

ar9avg commited on 4 days ago

fix

24ef2cf

ar9avg commited on 4 days ago

fix

ed79e58

ar9avg commited on 4 days ago

fix

4ec680a

ar9avg commited on 4 days ago

fix

b00a200

ar9avg commited on 4 days ago

fix

c07b98d

ar9avg commited on 4 days ago

Reset SQL on new attempt so attempts don't concatenate in the same box

b15235a

ar9avg commited on 4 days ago

Fix UnboundLocalError: remove duplicate local import of REPAIR_ACTION_BY_NAME

c2894a4

ar9avg commited on 4 days ago

Surface GEPA optimization: prompt history, live banner, smart retry

aa3ae1f

ar9avg commited on 4 days ago

Fix IndexError: skip empty-choices chunks from HF Router streaming

8c8093f

ar9avg commited on 4 days ago

Add LLM diagnostics: /api/test-llm endpoint + startup/error logging

63cbec3

ar9avg commited on 4 days ago

Surface real error messages and fix LLM exception formatting

e9bea1b

ar9avg commited on 4 days ago

Fix chat SSE events to match frontend protocol (result+done instead of success)

f4110fc

ar9avg commited on 4 days ago

Fix chat query failures and benchmark ID mismatches

8ae8e0b

ar9avg commited on 4 days ago

fix: GEPA current_generation, task_id mapping, Connect DB button, remove difficulty from header

f0b682f

ar9avg commited on 4 days ago

fix: default LLM to HF Router + Qwen2.5-72B, no custom Space variables needed

68ebe84

ar9avg commited on 4 days ago

fix: remove duplicate YAML frontmatter in README

cb9cfe8

ar9avg commited on 4 days ago

fix: light mode contrast, collapse queries on demo completion

cc67cd2

ar9avg commited on 4 days ago

feat: demo mode with reward chart, github diff, single-difficulty rounds, no loop

2d33bcd

ar9avg commited on 4 days ago

feat: add Demo button with scripted autoplay showcase

d0e0cf7

ar9avg commited on 4 days ago

fix: align rl-state API shape with frontend, add error boundary

ce1c471

ar9avg commited on 4 days ago

Initial submission: SQL Agent OpenEnv for Meta+HF hackathon

3c665d2

ar9avg commited on 4 days ago

initial commit

d796343
verified

ar9av commited on 4 days ago

Commit History

Nuclear clamp: every reward source in the codebase now returns (0.05, 0.95) 719c147 Running

Clamp all remaining score leak paths: /state, step_rewards, demo SSE e99d0aa

Bulletproof _safe_score for all bad inputs (None, NaN, strings, bool) 2014920

Clamp every grader return value strictly inside (0, 1) 98b87b7

Defensive score clamping at all emission points 263261a

Widen score epsilon to 0.01 so :.3f formatting stays in (0, 1) ba69b5f

Fix reward.value to always be task_score in (0,1) exclusive b86d426

Fix task scores to be strictly in (0, 1) exclusive per OpenEnv spec d2d92b8

fix 63d67f0

fix f32a4be

fix c293dc3

fix 140731d

Update project origin description in README b799708 unverified

Clean up README.md by removing metadata c8952f6 unverified

fix 9f7dd14

Remove origin details from README 965a112 unverified

Refactor README.md by removing metadata and updating content 805743c unverified

fix 92cc088

fix 55f54ec

fix 44ef33f

fix 2f89522

fix 17e7bd7

fix 24ef2cf

fix ed79e58

fix 4ec680a

fix b00a200

fix c07b98d

Reset SQL on new attempt so attempts don't concatenate in the same box b15235a

Fix UnboundLocalError: remove duplicate local import of REPAIR_ACTION_BY_NAME c2894a4

Surface GEPA optimization: prompt history, live banner, smart retry aa3ae1f

Fix IndexError: skip empty-choices chunks from HF Router streaming 8c8093f

Add LLM diagnostics: /api/test-llm endpoint + startup/error logging 63cbec3

Surface real error messages and fix LLM exception formatting e9bea1b

Fix chat SSE events to match frontend protocol (result+done instead of success) f4110fc

Fix chat query failures and benchmark ID mismatches 8ae8e0b

fix: GEPA current_generation, task_id mapping, Connect DB button, remove difficulty from header f0b682f

fix: default LLM to HF Router + Qwen2.5-72B, no custom Space variables needed 68ebe84

fix: remove duplicate YAML frontmatter in README cb9cfe8

fix: light mode contrast, collapse queries on demo completion cc67cd2

feat: demo mode with reward chart, github diff, single-difficulty rounds, no loop 2d33bcd

feat: add Demo button with scripted autoplay showcase d0e0cf7

fix: align rl-state API shape with frontend, add error boundary ce1c471

Initial submission: SQL Agent OpenEnv for Meta+HF hackathon 3c665d2

initial commit d796343 verified

Nuclear clamp: every reward source in the codebase now returns (0.05, 0.95)

719c147

Running

Clamp all remaining score leak paths: /state, step_rewards, demo SSE

e99d0aa

Bulletproof _safe_score for all bad inputs (None, NaN, strings, bool)

2014920

Clamp every grader return value strictly inside (0, 1)

98b87b7

Defensive score clamping at all emission points

263261a

Widen score epsilon to 0.01 so :.3f formatting stays in (0, 1)

ba69b5f

Fix reward.value to always be task_score in (0,1) exclusive

b86d426

Fix task scores to be strictly in (0, 1) exclusive per OpenEnv spec

d2d92b8

fix

63d67f0

fix

f32a4be

fix

c293dc3

fix

140731d

Update project origin description in README

b799708
unverified

Clean up README.md by removing metadata

c8952f6
unverified

fix

9f7dd14

Remove origin details from README

965a112
unverified

Refactor README.md by removing metadata and updating content

805743c
unverified

fix

92cc088

fix

55f54ec

fix

44ef33f

fix

2f89522

fix

17e7bd7

fix

24ef2cf

fix

ed79e58

fix

4ec680a

fix

b00a200

fix

c07b98d

Reset SQL on new attempt so attempts don't concatenate in the same box

b15235a

Fix UnboundLocalError: remove duplicate local import of REPAIR_ACTION_BY_NAME

c2894a4

Surface GEPA optimization: prompt history, live banner, smart retry

aa3ae1f

Fix IndexError: skip empty-choices chunks from HF Router streaming

8c8093f

Add LLM diagnostics: /api/test-llm endpoint + startup/error logging

63cbec3

Surface real error messages and fix LLM exception formatting

e9bea1b

Fix chat SSE events to match frontend protocol (result+done instead of success)

f4110fc

Fix chat query failures and benchmark ID mismatches

8ae8e0b

fix: GEPA current_generation, task_id mapping, Connect DB button, remove difficulty from header

f0b682f

fix: default LLM to HF Router + Qwen2.5-72B, no custom Space variables needed

68ebe84

fix: remove duplicate YAML frontmatter in README

cb9cfe8

fix: light mode contrast, collapse queries on demo completion

cc67cd2

feat: demo mode with reward chart, github diff, single-difficulty rounds, no loop

2d33bcd

feat: add Demo button with scripted autoplay showcase

d0e0cf7

fix: align rl-state API shape with frontend, add error boundary

ce1c471

Initial submission: SQL Agent OpenEnv for Meta+HF hackathon

3c665d2

initial commit

d796343
verified