deploy: full Space sync — app + server + training + GRPO run artifacts 7d84d3d verified akhiilll commited on 15 days ago
wire Colab notebook to TRL GRPOTrainer (real LoRA weight updates) 43372d5 verified akhiilll commited on 15 days ago
hackathon submission: theme-aligned README, refreshed training, Qwen vs random plots 9fed01d verified akhiilll commited on 15 days ago
Add CUDA backend (transformers) for HF Jobs A10G runs 3b45a42 verified akhiilll commited on 15 days ago