Collections
Discover the best community collections!
Collections including paper arxiv:2601.15621
-
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
PaperBanana: Automating Academic Illustration for AI Scientists
Paper • 2601.23265 • Published • 224 -
Moonshine: Speech Recognition for Live Transcription and Voice Commands
Paper • 2410.15608 • Published • 12 -
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 40
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44
-
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 166 -
Self-Supervised Prompt Optimization
Paper • 2502.06855 • Published • 18 -
A decoder-only foundation model for time-series forecasting
Paper • 2310.10688 • Published • 28
-
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Paper • 2503.04721 • Published • 4 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
Text-to-Speech • 2B • Updated • 1.53M • 1.43k -
openbmb/AgentCPM-Report
Text Generation • 8B • Updated • 384 • 298
-
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 166 -
Self-Supervised Prompt Optimization
Paper • 2502.06855 • Published • 18 -
A decoder-only foundation model for time-series forecasting
Paper • 2310.10688 • Published • 28
-
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
PaperBanana: Automating Academic Illustration for AI Scientists
Paper • 2601.23265 • Published • 224 -
Moonshine: Speech Recognition for Live Transcription and Voice Commands
Paper • 2410.15608 • Published • 12 -
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 40
-
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities
Paper • 2503.04721 • Published • 4 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
Text-to-Speech • 2B • Updated • 1.53M • 1.43k -
openbmb/AgentCPM-Report
Text Generation • 8B • Updated • 384 • 298
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44