File size: 512 Bytes
9e64e71 | 1 2 3 4 | # Learnings - Testing
- Add a focused trajectory-shape unit test that asserts the final two turns are `tool` then content-only `assistant` (no `tool_calls`) and keep a separate low-reward test proving incorrect trajectories are still filtered to `None`. *(F014)*
- Keep notebook and pipeline smoke tests aligned by asserting stable training-flow anchors (for example `before_rollouts = sample_random_baseline` appears before `run_training_with_metrics(trainer)`) whenever notebook cells are refactored. *(F015)*
|