Commit History

readme updations
aa2673f

muskan singh commited on

updated readme
1c4a544

muskan singh commited on

adding
f5c84e7

muskan singh commited on

plots, results, readme updations
8d77f52

muskan singh commited on

fix: stable GRPO notebook — pin TRL<=0.24, multi-step reward, Drive checkpoints every 30 steps
5ebb26b

muskan singh Claude Opus 4.7 commited on

better workflows
03d30a6

Taniieeee83 commited on

Merge remote changes
05d8492

srishtichugh commited on

animate ui
c06d0c3

srishtichugh commited on

Fix missing matplotlib dependency
134c2ae

muskan singh commited on

training script
869f731

muskan singh commited on

training notebook with training logs
e22f664

muskan singh commited on

training notebook
9e29238

muskan singh commited on

gemma fix
a35bcd0

muskan singh commited on

quick fix
ef4ebed

muskan singh commited on

bug fixes, better generalization
dd5113f

muskan singh commited on

changed reward scores
1519439

Taniieeee83 commited on

made changes to the workflows
105c5c9

Taniieeee83 commited on

Merge: combine upgrade ui + working pipelines with proper baselines
934f824

srishtichugh commited on

upgrade ui
d05687c

srishtichugh commited on

working pipelines with proper baselineees
a5d93ec

muskan singh commited on

inference fix
ae350b5

muskan singh commited on

updated readme, requirements.txt
da84c63

Taniieeee83 commited on

added minimal ui and all 4 apps+workflows
2305b9f

Taniieeee83 commited on

changed till step 8
4719066

Taniieeee83 commited on