Spaces:

hjerpe
/

sql_env

Running

Upload folder using huggingface_hub

9e64e71 verified 4 days ago

512 Bytes

Learnings - Testing

Add a focused trajectory-shape unit test that asserts the final two turns are tool then content-only assistant (no tool_calls) and keep a separate low-reward test proving incorrect trajectories are still filtered to None. (F014)
Keep notebook and pipeline smoke tests aligned by asserting stable training-flow anchors (for example before_rollouts = sample_random_baseline appears before run_training_with_metrics(trainer)) whenever notebook cells are refactored. (F015)