Spaces:

anugrah55
/

opensleuth-training-gemini-cli

Paused

App Files Files Community

opensleuth-training-gemini-cli / __pycache__

11.7 kB

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

anugrah55's picture

Overhaul trainer: TRL GRPO with env-backed reward, Qwen2.5-0.5B 4bit+LoRA, slim PyTorch CUDA base, heartbeat HTTP for HF Spaces health probe

d597642 verified 13 days ago

train.cpython-313.pyc

11.7 kB
Overhaul trainer: TRL GRPO with env-backed reward, Qwen2.5-0.5B 4bit+LoRA, slim PyTorch CUDA base, heartbeat HTTP for HF Spaces health probe 13 days ago