Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Humanlearning
/
Cyber_analyst-round1
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
2eada22
Cyber_analyst-round1
1.27 MB
Ctrl+K
Ctrl+K
1 contributor
History:
13 commits
Humanlearning
feat: add episode trace fingerprinting for improved trace logging and update reward penalties in GRPO configuration
2eada22
13 days ago
.agents
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
.codex
feat: integrate Trackio for experiment tracking, add GRPO training support, and deploy web-based monitoring tools
13 days ago
assets
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
configs
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
scripts
feat: add episode trace fingerprinting for improved trace logging and update reward penalties in GRPO configuration
13 days ago
server
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
tests
feat: add episode trace fingerprinting for improved trace logging and update reward penalties in GRPO configuration
13 days ago
training
feat: add episode trace fingerprinting for improved trace logging and update reward penalties in GRPO configuration
13 days ago
.dockerignore
Safe
100 Bytes
feat: implement core RL training infrastructure and architecture documentation
13 days ago
.gitignore
Safe
82 Bytes
feat: integrate Trackio for experiment tracking and add Modal training infrastructure with environment and test utilities.
13 days ago
.hfignore
Safe
163 Bytes
feat: implement core RL training infrastructure and architecture documentation
13 days ago
00_PROJECT_BRIEF.md
Safe
6.97 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
01_ARCHITECTURE.md
Safe
20 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
AGENTS.md
36.3 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
Dockerfile
Safe
927 Bytes
feat: integrate Trackio for experiment tracking and add Modal training infrastructure with environment and test utilities.
13 days ago
README.md
11.7 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
__init__.py
Safe
594 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
14 days ago
bug_mutator.py
Safe
692 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
14 days ago
client.py
Safe
1.26 kB
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
14 days ago
config.py
Safe
7.72 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
evals.py
Safe
2.91 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
fixture_generator.py
Safe
537 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
14 days ago
models.py
3.85 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
openenv.yaml
Safe
103 Bytes
Initial commit
14 days ago
policy_graph.py
Safe
3.48 kB
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
14 days ago
pyproject.toml
Safe
1.78 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
reward_config.py
3.99 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
rewards.py
Safe
15.7 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
safety.py
Safe
420 Bytes
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
14 days ago
scenario_compiler.py
Safe
680 Bytes
feat: implement RL environment server with training infrastructure and Modal integration
13 days ago
template_renderer.py
Safe
2.86 kB
feat: implement core RL training infrastructure, including GRPO training, evaluation utilities, custom environments, and Modal-based execution scripts.
14 days ago
uv.lock
Safe
817 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago
validators.py
Safe
10.7 kB
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment
13 days ago