Commit History

Inject peak hours + history + post-mandate, run SFT every round
30614d3

vaibhav12332112312 commited on

Upload folder using huggingface_hub
1dc66ef
verified

vaibhavkhandare commited on

ReAct two-pass per day so model sees current-day tool results
b1c1732

vaibhav12332112312 commited on

Strip leaked peak-hour info from observation, force tool discovery
afbf541

vaibhav12332112312 commited on

Mandate tool calls in system prompt to debug zero-tool collapse
4299c91

vaibhav12332112312 commited on

Upload folder using huggingface_hub
9fac734
verified

vaibhavkhandare commited on

Match eval sampling to training, log all I/O, single round
271bf42

vaibhav12332112312 commited on

Upload folder using huggingface_hub
ad5d3b3
verified

vaibhavkhandare commited on

train(grpo): unified hint prompt, no-history chat, positive-advantage filter
3326716

vaibhav12332112312 commited on

Upload folder using huggingface_hub
e955a2d
verified

vaibhavkhandare commited on

fix: align notebook with 15-day horizon, drop unused replies field
f7b5241

vaibhav12332112312 commited on

Merge branch 'main' of https://huggingface.co/spaces/vaibhavkhandare/train-bhai-train
21edd7d

vaibhav12332112312 commited on

train: batched parallel rollouts on Qwen2.5-3B + parser hardening
a6b8df0

vaibhav12332112312 commited on

Stop tracking plots/*.png with Git LFS; use small inline PNGs for HF Hub.
81cdb34

anuragredbus commited on

Default repo clone branch to main for training notebooks and HF script.
ad48770

anuragredbus commited on

Set TASK_HORIZON to 15 days and align graders, UI, and training prompts.
99717c2

anuragredbus commited on

Upload folder using huggingface_hub
302be2b
verified

vaibhavkhandare commited on

fix(env): tolerate malformed predict_engagement scheduled_actions
4bfe286

vaibhav12332112312 commited on

train: shrink to weekly horizon + bounded steps
abe4587

vaibhav12332112312 commited on

train: default HF Job flavor l4x1 -> l40sx1 (48GB VRAM)
76b19bd

vaibhav12332112312 commited on

train: per-step credit + drop replies + larger batches
9ee7a09

vaibhav12332112312 commited on

fix(notebook): py3.11 f-string backslash error in format_obs
56f70b1

vaibhav12332112312 commited on

Merge branch 'main' of https://huggingface.co/spaces/vaibhavkhandare/train-bhai-train
383294c

vaibhav12332112312 commited on

fix(notebook): pin typing_extensions>=4.13.0 to fix pydantic Sentinel ImportError
b1bd9cc

vaibhav12332112312 commited on

fix: add missing metadata on display_data outputs
6573551

vaibhav12332112312 commited on

fix: add missing 'name' to stream outputs
4e45ebc

vaibhav12332112312 commited on

add: viraltest code (server, models, inference)
9c3eab8

vaibhav12332112312 commited on

fix: restore parse_model_output exception parity with original bare except
aeedd8d

anuragredbus commited on

chore: align train_grpo.ipynb with smoke/syntax patterns for Colab
0587f05

anuragredbus commited on

add training/syntax_only.ipynb — kernel + Python syntax only (no project logic)
0e50d91

anuragredbus commited on

add train_grpo_smoke notebook; quote pip versions in train_grpo
b55c1ff

anuragredbus commited on

fix: notebook loads Qwen without bitsandbytes on Mac; optional training deps
eb1d764

anuragredbus commited on

fix: robust notebook setup (no magic shell) + local CWD auto-detect
8d09986

anuragredbus commited on

Merge branch 'hack1' of github.com:VaibhavKhandare/viral-posts-env into hack1
6c01076

vaibhav12332112312 commited on

fix: rewrite training notebook for real LoRA fine-tuning on Colab
4a29e22

anuragredbus commited on

reduced steps to fit out free tier
fcfbc38

anuragredbus commited on

reduced steps to fit out free tier
571f8a4

anuragredbus commited on

Viraltest OpenEnv: deploy to HF Space
28dd5a4

anuragredbus commited on