final-iteration / server /viraltest_environment.py

Commit History

added more scenaiors
1a2a407

anuragredbus commited on

train(grpo): unified hint prompt, no-history chat, positive-advantage filter
3326716

vaibhav12332112312 commited on

fix: align notebook with 15-day horizon, drop unused replies field
f7b5241

vaibhav12332112312 commited on

Set TASK_HORIZON to 15 days and align graders, UI, and training prompts.
99717c2

anuragredbus commited on

fix(env): tolerate malformed predict_engagement scheduled_actions
4bfe286

vaibhav12332112312 commited on

train: shrink to weekly horizon + bounded steps
abe4587

vaibhav12332112312 commited on

train: per-step credit + drop replies + larger batches
9ee7a09

vaibhav12332112312 commited on

reduced steps to fit out free tier
fcfbc38

anuragredbus commited on

Viraltest OpenEnv: deploy to HF Space
28dd5a4

anuragredbus commited on