Spaces:

anugrahhu
/

cernenv-trainer

Paused

App Files Files Community

cernenv-trainer

Commit History

sft+reward-fix: space/training/app.py

c2c4674
verified

anugrahhu commited on 13 days ago

sft+reward-fix: training/training_script.py

2b97998
verified

anugrahhu commited on 13 days ago

sft+reward-fix: training/sft_warmstart.py

a8d4d87
verified

anugrahhu commited on 13 days ago

sft+reward-fix: tests/test_reward_hacking.py

a7acc5f
verified

anugrahhu commited on 13 days ago

sft+reward-fix: server/environment.py

70b06db
verified

anugrahhu commited on 13 days ago

sft+reward-fix: server/rewards/reward_function.py

d91fe20
verified

anugrahhu commited on 13 days ago

fix: coerce beam_energy to str so CollisionObservation pydantic check accepts numeric LLM outputs

7df4308
verified

anugrahhu commited on 13 days ago

vanilla GRPO: backport EvidenceCallback for live evidence/*.csv + plots

11307a1
verified

anugrahhu commited on 13 days ago

dashboard: synthesize PNGs on demand + cache-bust + pass --evidence_dir to vanilla

3080a66
verified

anugrahhu commited on 13 days ago

fix: peft 0.18 -> 0.13.2 to match transformers 4.51.3 (vanilla path)

eb2a494
verified

anugrahhu commited on 13 days ago

fix(deps): peft 0.18.0 -> 0.13.2 to match transformers 4.51.3

3495767
verified

anugrahhu commited on 13 days ago

fix: switch trainer Space to vanilla GRPO path

c92f127
verified

anugrahhu commited on 13 days ago

fix: switch trainer Space to vanilla GRPO path

30adf48
verified

anugrahhu commited on 13 days ago

fix: disable fast_inference (vLLM not installed) in training/evaluate.py

8f805e2
verified

anugrahhu commited on 13 days ago

fix: disable fast_inference (vLLM not installed) in training/training_unsloth.py

f82f913
verified

anugrahhu commited on 13 days ago

Update CERNenv Space

0a6c641
verified

anugrahhu commited on 13 days ago

initial commit

b60c252
verified

anugrahhu commited on 13 days ago

Commit History

sft+reward-fix: space/training/app.py c2c4674 verified

sft+reward-fix: training/training_script.py 2b97998 verified

sft+reward-fix: training/sft_warmstart.py a8d4d87 verified

sft+reward-fix: tests/test_reward_hacking.py a7acc5f verified

sft+reward-fix: server/environment.py 70b06db verified

sft+reward-fix: server/rewards/reward_function.py d91fe20 verified

fix: coerce beam_energy to str so CollisionObservation pydantic check accepts numeric LLM outputs 7df4308 verified

vanilla GRPO: backport EvidenceCallback for live evidence/*.csv + plots 11307a1 verified

dashboard: synthesize PNGs on demand + cache-bust + pass --evidence_dir to vanilla 3080a66 verified

fix: peft 0.18 -> 0.13.2 to match transformers 4.51.3 (vanilla path) eb2a494 verified

fix(deps): peft 0.18.0 -> 0.13.2 to match transformers 4.51.3 3495767 verified

fix: switch trainer Space to vanilla GRPO path c92f127 verified

fix: switch trainer Space to vanilla GRPO path 30adf48 verified

fix: disable fast_inference (vLLM not installed) in training/evaluate.py 8f805e2 verified

fix: disable fast_inference (vLLM not installed) in training/training_unsloth.py f82f913 verified

Update CERNenv Space 0a6c641 verified

initial commit b60c252 verified

sft+reward-fix: space/training/app.py

c2c4674
verified

sft+reward-fix: training/training_script.py

2b97998
verified

sft+reward-fix: training/sft_warmstart.py

a8d4d87
verified

sft+reward-fix: tests/test_reward_hacking.py

a7acc5f
verified

sft+reward-fix: server/environment.py

70b06db
verified

sft+reward-fix: server/rewards/reward_function.py

d91fe20
verified

fix: coerce beam_energy to str so CollisionObservation pydantic check accepts numeric LLM outputs

7df4308
verified

vanilla GRPO: backport EvidenceCallback for live evidence/*.csv + plots

11307a1
verified

dashboard: synthesize PNGs on demand + cache-bust + pass --evidence_dir to vanilla

3080a66
verified

fix: peft 0.18 -> 0.13.2 to match transformers 4.51.3 (vanilla path)

eb2a494
verified

fix(deps): peft 0.18.0 -> 0.13.2 to match transformers 4.51.3

3495767
verified

fix: switch trainer Space to vanilla GRPO path

c92f127
verified

fix: switch trainer Space to vanilla GRPO path

30adf48
verified

fix: disable fast_inference (vLLM not installed) in training/evaluate.py

8f805e2
verified

fix: disable fast_inference (vLLM not installed) in training/training_unsloth.py

f82f913
verified

Update CERNenv Space

0a6c641
verified

initial commit

b60c252
verified