Learnings - Integrations
- Keep the
trainingoptional dependency pinned totrl>=0.14.0,<0.15.0while core runtime usestorch==2.2.2, because newer TRL releases can break import-time compatibility (for exampleFSDPModuleimport failures). (F006) - Rollout generation must handle both callable HF tokenizers and non-callable notebook/test tokenizers by branching tokenization flow and falling back to shape-agnostic decode extraction from
model.generateoutputs. (F015)