Humanlearning commited on
Commit
b2ee80d
·
1 Parent(s): 735ebfa

docs: add architecture and RL training flow diagrams to README and blog for improved clarity on system design

Browse files
Files changed (2) hide show
  1. README.md +2 -0
  2. blog/blog.md +8 -0
README.md CHANGED
@@ -27,6 +27,8 @@ The current implementation includes a functional closed-loop MVP scenario: an in
27
 
28
  ## Diagrams
29
 
 
 
30
  ![CyberSecurity_OWASP architecture](assets/architecture_diagram.svg)
31
 
32
  ![CyberSecurity_OWASP RL training flow](assets/env_rl_training_flow_diagram.svg)
 
27
 
28
  ## Diagrams
29
 
30
+ [Architecture diagram](assets/architecture_diagram.svg) | [RL training flow diagram](assets/env_rl_training_flow_diagram.svg)
31
+
32
  ![CyberSecurity_OWASP architecture](assets/architecture_diagram.svg)
33
 
34
  ![CyberSecurity_OWASP RL training flow](assets/env_rl_training_flow_diagram.svg)
blog/blog.md CHANGED
@@ -52,6 +52,14 @@ The current MVP focuses on generated FastAPI-style invoice applications with inj
52
 
53
  This is not a static multiple-choice benchmark. It is an interactive environment with tools, state, hidden checks, and reward feedback.
54
 
 
 
 
 
 
 
 
 
55
  The agent can use tools such as:
56
 
57
  ```text
 
52
 
53
  This is not a static multiple-choice benchmark. It is an interactive environment with tools, state, hidden checks, and reward feedback.
54
 
55
+ ## Architecture and Training Flow
56
+
57
+ [Architecture diagram](../assets/architecture_diagram.svg) | [RL training flow diagram](../assets/env_rl_training_flow_diagram.svg)
58
+
59
+ ![CyberSecurity_OWASP architecture](../assets/architecture_diagram.svg)
60
+
61
+ ![CyberSecurity_OWASP RL training flow](../assets/env_rl_training_flow_diagram.svg)
62
+
63
  The agent can use tools such as:
64
 
65
  ```text