Spaces:
Sleeping
Sleeping
docs: updating readme with state changes and test
Browse files
README.md
CHANGED
|
@@ -308,6 +308,18 @@ Stdout will stream `[START]` / `[STEP]` / `[END]` lines per episode. Internal lo
|
|
| 308 |
|
| 309 |
---
|
| 310 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 311 |
### Local Agent β No API Key Required
|
| 312 |
|
| 313 |
To smoke test the full pipeline without calling an external LLM, you can run inference with a local model via [Ollama](https://ollama.com/).
|
|
@@ -346,8 +358,9 @@ USE_LOCAL=true uv run python inference.py
|
|
| 346 |
```
|
| 347 |
WhyDidItFail/
|
| 348 |
βββ inference.py # Baseline inference script
|
|
|
|
| 349 |
βββ client.py # WhyDidItFailEnv client (WebSocket)
|
| 350 |
-
βββ models.py # Action and
|
| 351 |
βββ openenv.yaml # OpenEnv manifest
|
| 352 |
βββ Dockerfile # Container image
|
| 353 |
βββ server/
|
|
|
|
| 308 |
|
| 309 |
---
|
| 310 |
|
| 311 |
+
### 6. Verify environment state (optional)
|
| 312 |
+
|
| 313 |
+
To confirm the state endpoint is working correctly β episode tracking, inspection order, required sources:
|
| 314 |
+
|
| 315 |
+
```bash
|
| 316 |
+
uv run python test_state.py
|
| 317 |
+
```
|
| 318 |
+
|
| 319 |
+
Runs a single hard episode and calls `state()` after every action. Prints `OK` or `FAIL` per checkpoint.
|
| 320 |
+
|
| 321 |
+
---
|
| 322 |
+
|
| 323 |
### Local Agent β No API Key Required
|
| 324 |
|
| 325 |
To smoke test the full pipeline without calling an external LLM, you can run inference with a local model via [Ollama](https://ollama.com/).
|
|
|
|
| 358 |
```
|
| 359 |
WhyDidItFail/
|
| 360 |
βββ inference.py # Baseline inference script
|
| 361 |
+
βββ test_state.py # State endpoint verification script
|
| 362 |
βββ client.py # WhyDidItFailEnv client (WebSocket)
|
| 363 |
+
βββ models.py # Action, Observation, and State Pydantic models
|
| 364 |
βββ openenv.yaml # OpenEnv manifest
|
| 365 |
βββ Dockerfile # Container image
|
| 366 |
βββ server/
|