rhythm_env / training /inference_eval.py

Commit History

Fix prompt truncation in inference_eval.py: max_seq_length 768 -> 2048
1217c1d

InosLihka commited on

Fix max_new_tokens for CoT format + add eval-only HF Jobs script
b9c9b8f

InosLihka commited on

Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
ece0bbe

InosLihka commited on

env: meta-RL refactor (continuous profiles, action+belief, adaptation grader)
ecbe0d8

InosLihka Claude Opus 4.7 (1M context) commited on

Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline
cc6473a

InosLihka Claude Sonnet 4.6 commited on