Spaces:
Sleeping
Sleeping
Commit ·
b2ee80d
1
Parent(s): 735ebfa
docs: add architecture and RL training flow diagrams to README and blog for improved clarity on system design
Browse files- README.md +2 -0
- blog/blog.md +8 -0
README.md
CHANGED
|
@@ -27,6 +27,8 @@ The current implementation includes a functional closed-loop MVP scenario: an in
|
|
| 27 |
|
| 28 |
## Diagrams
|
| 29 |
|
|
|
|
|
|
|
| 30 |

|
| 31 |
|
| 32 |

|
|
|
|
| 27 |
|
| 28 |
## Diagrams
|
| 29 |
|
| 30 |
+
[Architecture diagram](assets/architecture_diagram.svg) | [RL training flow diagram](assets/env_rl_training_flow_diagram.svg)
|
| 31 |
+
|
| 32 |

|
| 33 |
|
| 34 |

|
blog/blog.md
CHANGED
|
@@ -52,6 +52,14 @@ The current MVP focuses on generated FastAPI-style invoice applications with inj
|
|
| 52 |
|
| 53 |
This is not a static multiple-choice benchmark. It is an interactive environment with tools, state, hidden checks, and reward feedback.
|
| 54 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 55 |
The agent can use tools such as:
|
| 56 |
|
| 57 |
```text
|
|
|
|
| 52 |
|
| 53 |
This is not a static multiple-choice benchmark. It is an interactive environment with tools, state, hidden checks, and reward feedback.
|
| 54 |
|
| 55 |
+
## Architecture and Training Flow
|
| 56 |
+
|
| 57 |
+
[Architecture diagram](../assets/architecture_diagram.svg) | [RL training flow diagram](../assets/env_rl_training_flow_diagram.svg)
|
| 58 |
+
|
| 59 |
+

|
| 60 |
+
|
| 61 |
+

|
| 62 |
+
|
| 63 |
The agent can use tools such as:
|
| 64 |
|
| 65 |
```text
|