Spaces:

agentDebugger
/

AgentDebugger-training-v3

Running

App Files Files Community

AgentDebugger-training-v3

Commit History

Optimize for Kaggle P100: float16, batch=1, grad_accum=8, num_gen=4, max_completion=256, lora_r=8

73f957d

shank commited on 13 days ago

Fix GRPOConfig: rename max_new_tokens to max_completion_length for trl==0.14.0

8b16369

shank commited on 13 days ago

Update: Added testing

a5c67b3

shank commited on 13 days ago

Align gradio version with Hugging Face Space builder2

633a3b7

shank commited on 13 days ago

Add dockerignore to reduce Space build context

c945597

shank commited on 13 days ago

Stabilize Space runtime: pin ML deps and disable runtime package drift

663b8db

shank commited on 13 days ago

Pin torch to cu121 build + use model.device instead of hardcoded cuda string

8f291e0

shank commited on 13 days ago

Replace unsloth with bitsandbytes+peft: fixes CUDA driver incompatibility on HF A100

c325ad7

shank commited on 13 days ago

Fix Gradio 4.x every= deprecation: use gr.Timer for auto-refresh

5eea2dd

shank commited on 13 days ago

Reduce training to 500 steps with tightened curriculum for A10G budget

ba8df98

shank commited on 13 days ago

Add Gradio training monitor and fix subprocess python path

b37b2eb

shank commited on 13 days ago

Fix eval device selection with CUDA-safe fallback

dc8001b

shank commited on 13 days ago

Add interactive Gradio demo at /demo in env Space

4bac574

shank commited on 13 days ago

Optimize for A100 80GB: 8 generations, batch 4, lr 2e-5, dense logging

2b1fbf3

shank commited on 13 days ago

Restore full 1000-step training with original curriculum

1128de1

shank commited on 13 days ago

Reduce training to 500 steps with tightened curriculum for A10G budget

3152fa9

shank commited on 13 days ago

"Update: Minor fixes"

755a07d

shank commited on 13 days ago

Add Gradio training monitor and fix subprocess python path

b92ad01

shank commited on 13 days ago

Curated the bugs dataset

85f14d3

PulipatiPranav commited on 13 days ago

Resolved README Merge conflicts

a693c08

PulipatiPranav commited on 13 days ago

Update: Started making changes for the hackathon

a55c81d

shank commited on 13 days ago

Updated README.md

a849e43

Pranav Pulipati commited on 25 days ago

Update: Refined and validated values

6cca39d

shank commited on 29 days ago

Update: Even more Final README.md update

5c507c3

shank commited on 29 days ago

Update: Final README.md update

3548cd0

shank commited on 29 days ago

Update: Final README.md update

4057375

shank commited on 29 days ago

Fix: Changed environment variables and added validator

e93446d

shank commited on 29 days ago

Fix: Final submission cleanup, unified identity and integrity markers

8807d25

shank commited on about 1 month ago

Update: Made refinements to the project

159a5fa

shank commited on about 1 month ago

Fix: Revise README for improved clarity and detail

1c8aca2

Shashaank commited on about 1 month ago

Update: README

e4f09cc

Shashaank commited on about 1 month ago

Fix: Changed scores range

0769caa

shank commited on Apr 7

Fix: Changed dependencies

ea7105c

shank commited on Apr 7

Fix: Change in gitignore

cd3a400

shank commited on Apr 7

Fix: Remove gitignore from commits

62a3a90

shank commited on Apr 7

Fix: Formatted the output

0181886

shank commited on Apr 7

Update: Dockerfile and inference.py

212d2d9

shank commited on Apr 7

Fix: precaution to prevent infinite loop

cd968e7

shank commited on Apr 7

Fix: Fixed exception handling in inference.py

2a482a5

shank commited on Apr 7

docs: updated the readme and added a license

a2ff803

shank commited on Apr 6

docs: updated the readme and added a license

22cb7e7

shank commited on Apr 6

fix: score floor for medium grader, add root and tasks endpoints

b658e10

shank commited on Apr 6

Cleaner code and logic improvement

ee08016

shank commited on Apr 6

docs: final professional polish and code sanitization

9940e16

shank commited on Apr 6

Add server/app.py entry point for OpenEnv validation

ade347f

shank commited on Apr 6

chore: include lockfile for reproducible deployment

8b39a8b

shank commited on Apr 6

Update pyproject.toml for OpenEnv validation

ca23d0f

shank commited on Apr 6

Add pyproject.toml for OpenEnv validation

d765986

shank commited on Apr 6

made changes to server.py

f2ee2fc

shank commited on Apr 6

deleted implementation plan

e766743

shank commited on Apr 6

Commit History

Optimize for Kaggle P100: float16, batch=1, grad_accum=8, num_gen=4, max_completion=256, lora_r=8 73f957d

Fix GRPOConfig: rename max_new_tokens to max_completion_length for trl==0.14.0 8b16369

Update: Added testing a5c67b3

Align gradio version with Hugging Face Space builder2 633a3b7

Add dockerignore to reduce Space build context c945597

Stabilize Space runtime: pin ML deps and disable runtime package drift 663b8db

Pin torch to cu121 build + use model.device instead of hardcoded cuda string 8f291e0

Replace unsloth with bitsandbytes+peft: fixes CUDA driver incompatibility on HF A100 c325ad7

Fix Gradio 4.x every= deprecation: use gr.Timer for auto-refresh 5eea2dd

Reduce training to 500 steps with tightened curriculum for A10G budget ba8df98

Add Gradio training monitor and fix subprocess python path b37b2eb

Fix eval device selection with CUDA-safe fallback dc8001b

Add interactive Gradio demo at /demo in env Space 4bac574

Optimize for A100 80GB: 8 generations, batch 4, lr 2e-5, dense logging 2b1fbf3

Restore full 1000-step training with original curriculum 1128de1

Reduce training to 500 steps with tightened curriculum for A10G budget 3152fa9

"Update: Minor fixes" 755a07d

Add Gradio training monitor and fix subprocess python path b92ad01

Curated the bugs dataset 85f14d3

Resolved README Merge conflicts a693c08

Update: Started making changes for the hackathon a55c81d

Updated README.md a849e43

Update: Refined and validated values 6cca39d

Update: Even more Final README.md update 5c507c3

Update: Final README.md update 3548cd0

Update: Final README.md update 4057375

Fix: Changed environment variables and added validator e93446d

Fix: Final submission cleanup, unified identity and integrity markers 8807d25

Update: Made refinements to the project 159a5fa

Fix: Revise README for improved clarity and detail 1c8aca2

Update: README e4f09cc

Fix: Changed scores range 0769caa

Fix: Changed dependencies ea7105c

Fix: Change in gitignore cd3a400

Fix: Remove gitignore from commits 62a3a90

Fix: Formatted the output 0181886

Update: Dockerfile and inference.py 212d2d9

Fix: precaution to prevent infinite loop cd968e7

Fix: Fixed exception handling in inference.py 2a482a5

docs: updated the readme and added a license a2ff803

docs: updated the readme and added a license 22cb7e7

fix: score floor for medium grader, add root and tasks endpoints b658e10

Cleaner code and logic improvement ee08016

docs: final professional polish and code sanitization 9940e16

Add server/app.py entry point for OpenEnv validation ade347f

chore: include lockfile for reproducible deployment 8b39a8b

Update pyproject.toml for OpenEnv validation ca23d0f

Add pyproject.toml for OpenEnv validation d765986

made changes to server.py f2ee2fc

deleted implementation plan e766743

Optimize for Kaggle P100: float16, batch=1, grad_accum=8, num_gen=4, max_completion=256, lora_r=8

73f957d

Fix GRPOConfig: rename max_new_tokens to max_completion_length for trl==0.14.0

8b16369

Update: Added testing

a5c67b3

Align gradio version with Hugging Face Space builder2

633a3b7

Add dockerignore to reduce Space build context

c945597

Stabilize Space runtime: pin ML deps and disable runtime package drift

663b8db

Pin torch to cu121 build + use model.device instead of hardcoded cuda string

8f291e0

Replace unsloth with bitsandbytes+peft: fixes CUDA driver incompatibility on HF A100

c325ad7

Fix Gradio 4.x every= deprecation: use gr.Timer for auto-refresh

5eea2dd

Reduce training to 500 steps with tightened curriculum for A10G budget

ba8df98

Add Gradio training monitor and fix subprocess python path

b37b2eb

Fix eval device selection with CUDA-safe fallback

dc8001b

Add interactive Gradio demo at /demo in env Space

4bac574

Optimize for A100 80GB: 8 generations, batch 4, lr 2e-5, dense logging

2b1fbf3

Restore full 1000-step training with original curriculum

1128de1

Reduce training to 500 steps with tightened curriculum for A10G budget

3152fa9

"Update: Minor fixes"

755a07d

Add Gradio training monitor and fix subprocess python path

b92ad01

Curated the bugs dataset

85f14d3

Resolved README Merge conflicts

a693c08

Update: Started making changes for the hackathon

a55c81d

Updated README.md

a849e43

Update: Refined and validated values

6cca39d

Update: Even more Final README.md update

5c507c3

Update: Final README.md update

3548cd0

Update: Final README.md update

4057375

Fix: Changed environment variables and added validator

e93446d

Fix: Final submission cleanup, unified identity and integrity markers

8807d25

Update: Made refinements to the project

159a5fa

Fix: Revise README for improved clarity and detail

1c8aca2

Update: README

e4f09cc

Fix: Changed scores range

0769caa

Fix: Changed dependencies

ea7105c

Fix: Change in gitignore

cd3a400

Fix: Remove gitignore from commits

62a3a90

Fix: Formatted the output

0181886

Update: Dockerfile and inference.py

212d2d9

Fix: precaution to prevent infinite loop

cd968e7

Fix: Fixed exception handling in inference.py

2a482a5

docs: updated the readme and added a license

a2ff803

docs: updated the readme and added a license

22cb7e7

fix: score floor for medium grader, add root and tasks endpoints

b658e10

Cleaner code and logic improvement

ee08016

docs: final professional polish and code sanitization

9940e16

Add server/app.py entry point for OpenEnv validation

ade347f

chore: include lockfile for reproducible deployment

8b39a8b

Update pyproject.toml for OpenEnv validation

ca23d0f

Add pyproject.toml for OpenEnv validation

d765986

made changes to server.py

f2ee2fc

deleted implementation plan

e766743