Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
Humanlearning
/
Cyber_analyst-round1
Sleeping

App Files Files Community
Fetching metadata from the HF Docker repository...
Cyber_analyst-round1 / training
12.7 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
Humanlearning's picture
Humanlearning
feat: integrate Trackio for experiment tracking and add Modal training infrastructure with environment and test utilities.
4e663d8 13 days ago
  • configs
    feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts. 13 days ago
  • eval_before_after.py
    2.01 kB
    feat: integrate Trackio for experiment tracking and add Modal training infrastructure with environment and test utilities. 13 days ago
  • reward_funcs.py
    711 Bytes
    feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts. 13 days ago
  • rollout.py
    3.32 kB
    feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts. 13 days ago
  • trackio_utils.py
    4.79 kB
    feat: integrate Trackio for experiment tracking and add Modal training infrastructure with environment and test utilities. 13 days ago
  • train_grpo.py
    1.63 kB
    feat: integrate Trackio for experiment tracking and add Modal training infrastructure with environment and test utilities. 13 days ago