Post-deadline: full eval results + bigger plots via Git LFS d64efa6 InosLihka commited on 11 days ago
results.md: tighten language, present results without internal-process noise d51061f InosLihka commited on 12 days ago
Fix max_new_tokens for CoT format + add eval-only HF Jobs script b9c9b8f InosLihka commited on 12 days ago
tooling: scripts/analyze_iter.py + docs/results.md template d6d9e31 InosLihka Claude Opus 4.7 (1M context) commited on 12 days ago