feat: refactor Task 5 pipeline and enhance sandbox security 2ab2ada Bhavishya011 commited on 11 days ago
docs: integrate Surgisphere, Vioxx case studies and Sycophancy research citation 7603d4e Bhavishya011 commited on 11 days ago
fix: implement auto-recovery fallback for sandbox MemoryError 6115bf2 Bhavishya011 commited on 12 days ago
fix: genuine LLM gen for Task 5, no hardcoding, no numpy crashes b5db25e Bhavishya011 commited on 12 days ago
fix: deterministic Task 5 agent - fast and reliable scoring fa8c776 Bhavishya011 commited on 12 days ago
feat/fix: update app.py for Task 5 without repetition penalty 5412e53 Bhavishya011 commited on 12 days ago
fix: disable LoRA for Task 5 code gen, add repetition penalty 70c3c71 Bhavishya011 commited on 12 days ago
feat: add Task 5 NDA Data Review with Python sandbox execution 84945d5 Bhavishya011 commited on 12 days ago
fix: pin huggingface_hub<0.26 for gradio HfFolder compat 3a9722f Bhavishya011 commited on 12 days ago
feat: add Gradio demo UI with trained LoRA adapter (LFS) c3b04a5 Bhavishya011 commited on 12 days ago
feat: add Gradio demo UI with live LoRA inference and deterministic grading c32739a Bhavishya011 commited on 12 days ago
fix: grader1 _type_matches bug - 2/4 flaw types could never match 0115c86 Bhavishya011 commited on 12 days ago
Use specific Python base image for Docker reliability f26207e Bhavishya011 commited on about 1 month ago
Fix Phase 2 validation: inference.py error handling 2c2b4c2 Bhavishya011 commited on about 1 month ago
Add Task 4 Citation Integrity Check - perfect 1.0 baseline scor da8f87b Bhavishya011 commited on Apr 1
Improve graders, add efficiency bonus, unit tests, and citation 3da2b2e Bhavishya011 commited on Apr 1
Improve graders, add efficiency bonus, unit tests, and citation 969b754 Bhavishya011 commited on Apr 1
Fix /reset endpoint for empty body and update inference.py for hackathon compliance 54e8b1b Bhavishya011 Copilot commited on Mar 29