LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws Paper • 2605.23901 • Published 4 days ago • 8
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published 6 days ago • 92