claims-env / training

Commit History

notebook: embed executed GRPO run outputs + plot
92cb33b
verified

akhiilll commited on

deploy: full Space sync — app + server + training + GRPO run artifacts
7d84d3d
verified

akhiilll commited on

make GRPOConfig kwargs version-tolerant
e893ade
verified

akhiilll commited on

add HF Jobs GRPO training script
eed849b
verified

akhiilll commited on

wire Colab notebook to TRL GRPOTrainer (real LoRA weight updates)
43372d5
verified

akhiilll commited on

hackathon submission: theme-aligned README, refreshed training, Qwen vs random plots
9fed01d
verified

akhiilll commited on

Add CUDA backend (transformers) for HF Jobs A10G runs
3b45a42
verified

akhiilll commited on

Deploy ClaimSense adjudication gym
1cfeb15
verified

akhiilll commited on