Commit History

docs: final structural reorganization
eff945c

Bhavishya011 commited on

docs: mention LoRA weights availability
a14d278

Bhavishya011 commited on

docs: add training logs and update baseline source
6ffb67d

Bhavishya011 commited on

docs: add real agent trace and sandbox image
39c1ef8

Bhavishya011 commited on

feat: refactor Task 5 pipeline and enhance sandbox security
2ab2ada

Bhavishya011 commited on

Update docs/HFBlogPost.md
487ceef
verified

Nexus18 commited on

Update docs/HFBlogPost.md
4df966d
verified

Nexus18 commited on

docs: add testing and verification section
420fc43

Bhavishya011 commited on

feat: enhance sandbox security and update docs
00be1e9

Bhavishya011 commited on

docs: integrate Surgisphere, Vioxx case studies and Sycophancy research citation
7603d4e

Bhavishya011 commited on

docs: sync updated architecture diagram
513f538

Bhavishya011 commited on

docs: add concrete statistics and cite arXiv:2601.19100
9d64412

Bhavishya011 commited on

docs: sync new combined reward and loss plots
696431a

Bhavishya011 commited on

docs: sync updated reward and loss curves
504dfd6

Bhavishya011 commited on

docs: add baseline vs trained comparison plot
35aa364

Bhavishya011 commited on

fix: restore HF Space YAML config block
7d70dd2

Bhavishya011 commited on

docs: sync readme, blog, and images to hf
ae4205e

Bhavishya011 commited on

fix: implement auto-recovery fallback for sandbox MemoryError
6115bf2

Bhavishya011 commited on

fix: enforce csv module and robust keyword extraction
ce756dd

Bhavishya011 commited on

fix: genuine LLM gen for Task 5, no hardcoding, no numpy crashes
b5db25e

Bhavishya011 commited on

fix: deterministic Task 5 agent - fast and reliable scoring
fa8c776

Bhavishya011 commited on

feat: add session audit history table to Gradio UI
04240b8

Bhavishya011 commited on

feat/fix: update app.py for Task 5 without repetition penalty
5412e53

Bhavishya011 commited on

fix: disable LoRA for Task 5 code gen, add repetition penalty
70c3c71

Bhavishya011 commited on

feat: add Task 5 NDA Data Review with Python sandbox execution
84945d5

Bhavishya011 commited on

fix: change fastapi root to /api so gradio can use /
aedaafb

Bhavishya011 commited on

fix: pin huggingface_hub<0.26 for gradio HfFolder compat
3a9722f

Bhavishya011 commited on

fix: upgrade gradio to 5.x for huggingface_hub compat
7203323

Bhavishya011 commited on

feat: add Gradio demo UI with trained LoRA adapter (LFS)
c3b04a5

Bhavishya011 commited on

feat: add Gradio demo UI with live LoRA inference and deterministic grading
c32739a

Bhavishya011 commited on

fix: grader1 _type_matches bug - 2/4 flaw types could never match
0115c86

Bhavishya011 commited on

refactor: PeerGuard clinical trial verification system
8122ba9

Bhavishya011 commited on

Revert to standard python:3.11-slim
7c93d4f

Bhavishya011 commited on

Use specific Python base image for Docker reliability
f26207e

Bhavishya011 commited on

Fix grader scores to be strictly between 0 and 1
660af4f

Bhavishya011 commited on

Fix structured output format with brackets
9898283

Bhavishya011 commited on

Fix OpenAI client proxy parameter conflict
ba38857

Bhavishya011 commited on

Fix Phase 2 validation: inference.py error handling
2c2b4c2

Bhavishya011 commited on

Final OpenEnv submission - all requirements
7f1c570

Bhavishya011 commited on

Final OpenEnv submission - all requirements met
3dccef3

Bhavishya011 commited on

Ready for submission: HF Router + OpenEnv validated
c0f7fc8

Bhavishya011 commited on

Add Task 4 Citation Integrity Check - perfect 1.0 baseline scor
da8f87b

Bhavishya011 commited on

Improve graders, add efficiency bonus, unit tests, and citation
3da2b2e

Bhavishya011 commited on

Improve graders, add efficiency bonus, unit tests, and citation
969b754

Bhavishya011 commited on

Add OpenEnv multi-mode deployment requirements
8c393a0

Bhavishya011 commited on

Add pyproject.toml for OpenEnv multi-mode deployment
1703402

Bhavishya011 commited on

Fix /reset endpoint for empty body and update inference.py for hackathon compliance
54e8b1b

Bhavishya011 Copilot commited on

fix: use Request to properly handle empty/missing POST body
b836ee9

Bhavishya011 commited on

fix: properly handle empty POST body on reset
cc700fe

Bhavishya011 commited on

add root endpoint for better UX
29a2e8e

Bhavishya011 commited on