TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published Feb 11, 2025 • 69
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention Paper • 2603.28458 • Published 15 days ago • 42
Meta Pruning via Graph Metanetworks : A Meta Learning Framework for Network Pruning Paper • 2506.12041 • Published May 24, 2025 • 2
SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass Paper • 2602.06358 • Published Feb 6 • 1
SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass Paper • 2602.06358 • Published Feb 6 • 1
SHINE Collection Models and datasets for paper "SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass" • 6 items • Updated Feb 5 • 1