README: restructure to hackathon template (demo/results/quickstart) 0e89831 verified akhiilll commited on 12 days ago
mini-blog: reorder (problem→story→market), remove file metrics a4aeb37 verified akhiilll commited on 12 days ago
README: link mini-blog markdown for submission form f6829dd verified akhiilll commited on 12 days ago
mini-blog: add storytelling + simple end-to-end architecture e347811 verified akhiilll commited on 12 days ago
fix: auto-reset on first step + dashboard auto-reset on load (no more 500 on click query_policy) 9e60416 verified akhiilll commited on 13 days ago
deploy: full Space sync — app + server + training + GRPO run artifacts 7d84d3d verified akhiilll commited on 13 days ago
README: embed GRPO training curves + reference adapter run f1d1901 verified akhiilll commited on 13 days ago
wire Colab notebook to TRL GRPOTrainer (real LoRA weight updates) 43372d5 verified akhiilll commited on 13 days ago
add runs/20260425-215059/qwen_vs_random.png (re-upload) 2581673 verified akhiilll commited on 13 days ago
add runs/20260425-215059/reward_curves.png (re-upload) 5aab172 verified akhiilll commited on 13 days ago
hackathon submission: theme-aligned README, refreshed training, Qwen vs random plots 9fed01d verified akhiilll commited on 13 days ago
Add CUDA backend (transformers) for HF Jobs A10G runs 3b45a42 verified akhiilll commited on 13 days ago