Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
sh4shv4t
/
Parlay
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
Parlay
/
agent
66 kB
Ctrl+K
Ctrl+K
1 contributor
History:
12 commits
sh4shv4t
feat: training results page + SFT Colab notebook
108bc34
12 days ago
__init__.py
Safe
379 Bytes
feat: split Gemini 2.5 Flash (demo) and Flash-Lite (data), SFT threshold 0.3, favicon + check_gemini
14 days ago
gemini_client.py
Safe
20.5 kB
Add pre-training audit scripts, OpenEnv manifest, and tune Parlay training/env (GRPO 1.5B default, min-reward filters, weighted data gen, hiring ZOPA+drift, veteran/opponent prompts, Docker/docs)
13 days ago
hf_opponent.py
Safe
5.51 kB
feat: training results page + SFT Colab notebook
12 days ago
personas.py
Safe
6.54 kB
Add pre-training audit scripts, OpenEnv manifest, and tune Parlay training/env (GRPO 1.5B default, min-reward filters, weighted data gen, hiring ZOPA+drift, veteran/opponent prompts, Docker/docs)
13 days ago
runner.py
Safe
12 kB
Add pre-training audit scripts, OpenEnv manifest, and tune Parlay training/env (GRPO 1.5B default, min-reward filters, weighted data gen, hiring ZOPA+drift, veteran/opponent prompts, Docker/docs)
13 days ago
tom_tracker.py
Safe
7.97 kB
Add pre-training audit scripts, OpenEnv manifest, and tune Parlay training/env (GRPO 1.5B default, min-reward filters, weighted data gen, hiring ZOPA+drift, veteran/opponent prompts, Docker/docs)
13 days ago
tom_tracker_bayesian.py
Safe
13.1 kB
feat: flash-lite for data-gen and flash for UI; remove training page; card tests; --quiet data gen; data/ inspect path; random baseline; GRPO env wrapper; reward fixes (buyer ZOPA, ToM signals); drift + Brier metrics; Bayesian ToM module
13 days ago