adithya9903 commited on
Commit
90eae1e
Β·
verified Β·
1 Parent(s): 8044cb6

Upload PolyGuard artifact: log command_complete

Browse files
Files changed (1) hide show
  1. outputs/logs/hf_training.log +2 -0
outputs/logs/hf_training.log CHANGED
@@ -20107,3 +20107,5 @@ Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:00<00:00
20107
  Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:00<00:00, 3.17it/s]
20108
  The following generation flags are not valid and may be ignored: ['temperature', 'top_p', 'top_k']. Set `TRANSFORMERS_VERBOSITY=info` for more details.
20109
  postsave_inference_ok
 
 
 
20107
  Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:00<00:00, 3.17it/s]
20108
  The following generation flags are not valid and may be ignored: ['temperature', 'top_p', 'top_k']. Set `TRANSFORMERS_VERBOSITY=info` for more details.
20109
  postsave_inference_ok
20110
+ $ python scripts/evaluate_policy_ablations.py --episodes 8 --checkpoint-dir checkpoints/sweeps/qwen-qwen2-5-3b-instruct --output outputs/reports/sweeps/qwen-qwen2-5-3b-instruct/grpo_ablation_report.json
20111
+ policy_ablations_done