Submitted by akhaliq 134 MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases · 12 authors 1.42k 13
Submitted by akhaliq 20 ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition · 4 authors 84 6
Submitted by akhaliq 19 Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models · 3 authors 56 6
Submitted by akhaliq 18 AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning · 16 authors 3
Submitted by akhaliq 15 API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs · 10 authors 22 3
Submitted by akhaliq 14 Seamless Human Motion Composition with Blended Positional Encodings · 3 authors 273 1