feat: enhance reward configuration management with new logging functions, add parallel Modal training guidelines to documentation, and improve reward config hashing for deterministic behavior 0e7f59c Humanlearning commited on 12 days ago
feat: enhance scenario authoring and caching mechanisms, update action submission terminology, and improve reward configuration for CyberSecurity_OWASP environment be8eade Humanlearning commited on 12 days ago
feat: add cybersecurity-owasp-trainer skill with reference notes and update AGENTS.md documentation 28685f3 Humanlearning commited on 13 days ago
docs: add initial project brief and architecture documentation for CyberSecurity_OWASP environment 06bfd31 Humanlearning commited on 13 days ago