Spaces:
Sleeping
Sleeping
Tanvi Bisht commited on
Fix image paths in README.md
Browse filesUpdated image paths in README to reflect new directory structure.
README.md
CHANGED
|
@@ -375,13 +375,13 @@ The reward function combines 5 components: workflow completion (0.30), rule comp
|
|
| 375 |
| C — Churn Risk Alert | 0.25 | ~0.48 | +0.23 |
|
| 376 |
| **Average** | **0.50** | **~0.68** | **+0.18** |
|
| 377 |
|
| 378 |
-

|
| 379 |
*Reward per training step*
|
| 380 |
|
| 381 |
-

|
| 382 |
*Per-workflow score comparison*
|
| 383 |
|
| 384 |
-

|
| 385 |
*Episode score distribution before and after GRPO*
|
| 386 |
|
| 387 |
---
|
|
|
|
| 375 |
| C — Churn Risk Alert | 0.25 | ~0.48 | +0.23 |
|
| 376 |
| **Average** | **0.50** | **~0.68** | **+0.18** |
|
| 377 |
|
| 378 |
+

|
| 379 |
*Reward per training step*
|
| 380 |
|
| 381 |
+

|
| 382 |
*Per-workflow score comparison*
|
| 383 |
|
| 384 |
+

|
| 385 |
*Episode score distribution before and after GRPO*
|
| 386 |
|
| 387 |
---
|