Spaces:

ycwhencpp
/

final-iteration

Paused

App Files Files Community

final-iteration / server /viraltest_environment.py

Commit History

train(grpo): unified hint prompt, no-history chat, positive-advantage filter

3326716

vaibhav12332112312 commited on about 1 month ago

fix: align notebook with 15-day horizon, drop unused replies field

f7b5241

vaibhav12332112312 commited on about 1 month ago

update

5459ec8

vaibhav12332112312 commited on about 1 month ago

Set TASK_HORIZON to 15 days and align graders, UI, and training prompts.

99717c2

anuragredbus commited on about 1 month ago

update

f9880dd

vaibhav12332112312 commited on about 1 month ago

fix(env): tolerate malformed predict_engagement scheduled_actions

4bfe286

vaibhav12332112312 commited on about 1 month ago

train: shrink to weekly horizon + bounded steps

abe4587

vaibhav12332112312 commited on about 1 month ago

train: per-step credit + drop replies + larger batches

9ee7a09

vaibhav12332112312 commited on about 1 month ago

update

97ee7e7

vaibhav12332112312 commited on about 1 month ago

la la la --123

e2c547b

anuragredbus commited on about 1 month ago

firstiteration

fc3950d

vaibhav12332112312 commited on about 1 month ago

reduced steps to fit out free tier

fcfbc38

anuragredbus commited on Apr 8

Viraltest OpenEnv: deploy to HF Space

28dd5a4

anuragredbus commited on Apr 8