Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published 8 days ago • 49
Do Audio-Visual Large Language Models Really See and Hear? Paper • 2604.02605 • Published 14 days ago • 7
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published 27 days ago • 36