revert: restore server/app.py to last known working state 6d34172 verified Vikaspandey582003 commited on 12 days ago
Deploy HTML landing page at root / (replaces JSON response) 75e6c2c verified Vikaspandey582003 commited on 12 days ago
Add direct links to training script, log CSV, and plots in README 1d69094 verified Vikaspandey582003 commited on 12 days ago
revert: restore requirements.txt to last working state 47d0f5b verified Vikaspandey582003 commited on 12 days ago
revert: restore server/app.py to last working state 09024bf verified Vikaspandey582003 commited on 12 days ago
fix: remove openenv.core dependency β pure FastAPI, graceful fallback 26ba066 verified Vikaspandey582003 commited on 12 days ago
fix: remove openenv.core dependency β pure FastAPI, graceful fallback ee53375 verified Vikaspandey582003 commited on 12 days ago
fix: add openenv>=0.1.13 to requirements.txt 7497306 verified Vikaspandey582003 commited on 12 days ago
fix: redirect root / to /ui so judges see Gradio UI not raw JSON 4c9a59a verified Vikaspandey582003 commited on 12 days ago
add: complete HF blog post content for hackathon submission 053ded9 verified Vikaspandey582003 commited on 12 days ago
story: add qualitative reward examples, blog+video links, before/after proof 5acb852 verified Vikaspandey582003 commited on 12 days ago
results: update with real v3 eval data β ECE -86.5% on GPQA-Lite, reward 0.15β0.75 4d67629 verified Vikaspandey582003 commited on 12 days ago
update README: real measured eval β ECE 0.069β0.048, 100Q, 7 domains 5ec5406 verified Vikaspandey582003 commited on 12 days ago
update README: real results table + embed training_curves and baseline_vs_trained plots 62b8a17 verified Vikaspandey582003 commited on 12 days ago
update README with live Space URL, adapter link, real training status ea4745b verified Vikaspandey582003 commited on 12 days ago
feat: A10G-optimised GRPO config β 256 tokens, bf16, 300 samples ce66956 verified Vikaspandey582003 commited on 13 days ago
feat: sanity check cell, format-enforcing system prompt, auto HF Hub push after training 1bacd77 verified Vikaspandey582003 commited on 13 days ago
data: add 7534 real tasks cache (GSM8K, TriviaQA, ARC, SciQ, MedMCQA) 5be6f0f verified Vikaspandey582003 commited on 13 days ago
fix: remove unsloth/trl/bitsandbytes from Space requirements (training only, use Colab) 093e7af verified Vikaspandey582003 commited on 13 days ago
fix: pure FastAPI on port 7860 β all OpenEnv endpoints live + Gradio at /ui fc58aef verified Vikaspandey582003 commited on 13 days ago