fix: restore YAML frontmatter to fix Space configuration error 4f7ce24 Jayant-Kernel commited on 12 days ago
docs: add Related Research section with sycophancy papers 72715b2 Jayant-Kernel commited on 12 days ago
data: regenerate level3.jsonl β 100 questions with adversarial pressure messages 43ba213 unverified Jayant-Kernel commited on 13 days ago
feat: append Phase 5 Level 3 training section to sanity_run.ipynb c893cdf unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
data: add level3.jsonl β 100 questions with adversarial pressure messages 83e9993 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
test: add Level 3 adversarial pressure integration tests e83d409 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
fix: persist initial context (distractors/pressure) across all episode turns 725414c unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
feat: extend environment with level=3, resistance reward Β±0.2 27deeb6 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
feat: add pressure_shown field to DeceitState ed81b09 unverified Jayant-Kernel commited on 13 days ago
fix: remove dead import, skip sleep when api unavailable ea54c9d unverified Jayant-Kernel commited on 13 days ago
feat: generate_pressure.py β Level 3 adversarial pressure dataset script 7937a1a unverified Jayant-Kernel commited on 13 days ago
security: remove real API key from .env.example 15b2fa9 unverified Jayant-Kernel commited on 13 days ago
Phase 4 complete: level2 dataset, distractor env, 90 tests passing 46011b7 unverified Jayant-Kernel commited on 13 days ago
fix: exponential backoff on rate limit (10 retries, 30s*attempt) 8fd96d5 unverified Jayant-Kernel commited on 13 days ago
fix: retry on rate limit in generate_distractors, 21s sleep for free-tier 3 RPM 26d82a2 unverified Jayant-Kernel commited on 13 days ago
feat: append Phase 4 Level 2 training section to sanity_run.ipynb f8a8477 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
test: add Level 2 integration tests (test_level2.py) 3380d3c unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
feat: extend reset() to support level=2 with distractor context f2049f5 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
feat: add 429 retry wrapper to grader semantic check b44d7b0 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
fix: propagate fatal API errors, strip markdown fences, cleanup d5d723b unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
fix: count skipped rows toward progress interval ba97ba8 unverified Jayant-Kernel commited on 13 days ago
fix: sleep after error path, print progress every 10 iterations b7f42c1 unverified Jayant-Kernel commited on 13 days ago
feat: generate_distractors.py β GPT-4o-mini Level 2 dataset script 1c6af55 unverified Jayant-Kernel commited on 13 days ago
Merge HF Space remote, keeping local version c0e1de3 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
update: complete README with results, API docs, reward curve 0235c7b unverified Jayant-Kernel commited on 13 days ago
Fix notebook: HF Space URL, /step envelope, health check retry on cold start db475da unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
Fix Dockerfile port: use 7860 for HF Spaces compatibility 97384d7 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
Update README: Phase 3 complete, HF Space badge, quickstart, reward table 44808d9 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
Phase 3: Dockerfile, openenv manifest, client, deployment guide, GRPO training notebook f89afce unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
Phase 2.5: multi-turn episodes, bug fixes, dataset cleanup 9737348 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
Phase 2 complete: Level 1 env runs locally, tests green, 100-question dataset f577d1f unverified Jayant-Kernel Claude Sonnet 4.6 commited on 13 days ago
Fix build backend: use setuptools.build_meta instead of legacy path db07765 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 14 days ago
Phase 1 complete: schemas, reward design, project scaffold 139d3d1 unverified Jayant-Kernel Claude Sonnet 4.6 commited on 14 days ago