Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Humanlearning
/
Cyber_analyst-round1
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
Cyber_analyst-round1
/
training
/
configs
11.4 kB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
Humanlearning
feat: introduce GRPO GPU fallback support, enhance training script with warmstart tagging, and add learning rate parameter for improved training flexibility
1b6d30b
12 days ago
reward_ablations
feat: introduce reward ablation configurations for enhanced training flexibility, implement YAML loading with extends support, and add reward variant tracking in training scripts
12 days ago
grpo_small.yaml
Safe
5.21 kB
feat: update README with GPU-utilization tuning instructions, enhance modal training script with run name parameter, and modify GRPO configuration for trace logging and vLLM settings
12 days ago
sft_warmstart_fast.yaml
Safe
3.72 kB
feat: introduce GRPO GPU fallback support, enhance training script with warmstart tagging, and add learning rate parameter for improved training flexibility
12 days ago