Vision-Centric Activation and Coordination for Multimodal Large Language Models Paper β’ 2510.14349 β’ Published Oct 16, 2025
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation Paper β’ 2511.05516 β’ Published Oct 26, 2025 β’ 12
TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders Paper β’ 2604.07340 β’ Published 9 days ago β’ 16
TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders Paper β’ 2604.07340 β’ Published 9 days ago β’ 16