Parlay / training /generate_data.py

Commit History

Add pre-training audit scripts, OpenEnv manifest, and tune Parlay training/env (GRPO 1.5B default, min-reward filters, weighted data gen, hiring ZOPA+drift, veteran/opponent prompts, Docker/docs)
df724f2

sh4shv4t commited on

feat: flash-lite for data-gen and flash for UI; remove training page; card tests; --quiet data gen; data/ inspect path; random baseline; GRPO env wrapper; reward fixes (buyer ZOPA, ToM signals); drift + Brier metrics; Bayesian ToM module
15976d0

sh4shv4t commited on

feat: backup existing data + per-episode progress tracking + gemini live-call verification
48756ef

sh4shv4t commited on

fix: normalise reward terms for acquisition_term_sheet scale mismatch
5c7939a

sh4shv4t commited on

fix: fixed sys.path issues on running generate_data.py
3791108

sh4shv4t commited on

feat: backup pre-2.5 data + add --inspect flag for quality diagnostic run
7ad35af

sh4shv4t commited on

feat: streamline parlay for demo mode and add spectator negotiation mechanics
2568517

sh4shv4t commited on

feat: split Gemini 2.5 Flash (demo) and Flash-Lite (data), SFT threshold 0.3, favicon + check_gemini
9d82eed

sh4shv4t commited on

feat: project setup
698f4d8

sh4shv4t commited on