Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Humanlearning
/
Cyber_analyst-round1
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
2eada22
Cyber_analyst-round1
/
tests
33.7 kB
Ctrl+K
Ctrl+K
1 contributor
History:
6 commits
Humanlearning
feat: add episode trace fingerprinting for improved trace logging and update reward penalties in GRPO configuration
2eada22
16 days ago
__init__.py
Safe
44 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
16 days ago
helpers.py
Safe
2.57 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
16 days ago
test_anti_cheat.py
Safe
535 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
16 days ago
test_closed_loop_runtime.py
3.49 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
16 days ago
test_invalid_actions.py
Safe
1.73 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
16 days ago
test_modal_scenario_cache_static.py
1.84 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
16 days ago
test_models.py
Safe
498 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
16 days ago
test_reset_step_state.py
Safe
782 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
16 days ago
test_reward_config.py
1.55 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
16 days ago
test_rewards.py
Safe
5.41 kB
feat: add episode trace fingerprinting for improved trace logging and update reward penalties in GRPO configuration
16 days ago
test_rollouts.py
Safe
898 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
16 days ago
test_scenario_authoring_config.py
Safe
2.83 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
16 days ago
test_scenario_cache.py
Safe
5.36 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
16 days ago
test_seed_reproducibility.py
Safe
419 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
16 days ago
test_trackio_utils.py
4.35 kB
feat: add episode trace fingerprinting for improved trace logging and update reward penalties in GRPO configuration
16 days ago
test_web_interface.py
Safe
1.43 kB
feat: implement RL environment server with training infrastructure and Modal integration
16 days ago