Add OpenEnv client, compat layer, manifest, scripts, GRPO plot hook, and README 81b4b70 sh4shv4t commited on 13 days ago
Add pre-training audit scripts, OpenEnv manifest, and tune Parlay training/env (GRPO 1.5B default, min-reward filters, weighted data gen, hiring ZOPA+drift, veteran/opponent prompts, Docker/docs) df724f2 sh4shv4t commited on 13 days ago
feat: flash-lite for data-gen and flash for UI; remove training page; card tests; --quiet data gen; data/ inspect path; random baseline; GRPO env wrapper; reward fixes (buyer ZOPA, ToM signals); drift + Brier metrics; Bayesian ToM module 15976d0 sh4shv4t commited on 13 days ago
fix: normalise reward terms for acquisition_term_sheet scale mismatch 5c7939a sh4shv4t commited on 13 days ago
feat: streamline parlay for demo mode and add spectator negotiation mechanics 2568517 sh4shv4t commited on 14 days ago
build: add Windows PowerShell setup scripts and fix venv paths for Windows development 7183e08 sh4shv4t commited on 16 days ago