Model weights of paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)".
AI & ML interests
Efficient and adaptive foundation models across language and multimodal intelligence.
Recent Activity
Papers
Demystifying When Pruning Works via Representation Hierarchies
Understanding and Harnessing Sparsity in Unified Multimodal Models