Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper • 2603.10145 • Published Mar 10 • 13
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 636 items • Updated 6 days ago • 96