Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Humanlearning
/
Cyber_analyst-round1
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
3807ea3
Cyber_analyst-round1
/
training
/
configs
211 Bytes
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Humanlearning
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
3807ea3
13 days ago
grpo_small.yaml
Safe
211 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
13 days ago