-
MuLan: A Joint Embedding of Music Audio and Natural Language
Paper • 2208.12415 • Published -
CoCa: Contrastive Captioners are Image-Text Foundation Models
Paper • 2205.01917 • Published • 3 -
SoundStream: An End-to-End Neural Audio Codec
Paper • 2107.03312 • Published -
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music
Paper • 2604.10905 • Published • 27
Collections
Discover the best community collections!
Collections including paper arxiv:2205.01917
-
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Paper • 2403.05135 • Published • 45 -
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Paper • 2403.05525 • Published • 49 -
CoCa: Contrastive Captioners are Image-Text Foundation Models
Paper • 2205.01917 • Published • 3
-
Measuring the Effects of Data Parallelism on Neural Network Training
Paper • 1811.03600 • Published • 2 -
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Paper • 1804.04235 • Published • 2 -
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Paper • 1905.11946 • Published • 4 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 65
-
MuLan: A Joint Embedding of Music Audio and Natural Language
Paper • 2208.12415 • Published -
CoCa: Contrastive Captioners are Image-Text Foundation Models
Paper • 2205.01917 • Published • 3 -
SoundStream: An End-to-End Neural Audio Codec
Paper • 2107.03312 • Published -
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music
Paper • 2604.10905 • Published • 27
-
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Paper • 2403.05135 • Published • 45 -
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Paper • 2403.05525 • Published • 49 -
CoCa: Contrastive Captioners are Image-Text Foundation Models
Paper • 2205.01917 • Published • 3
-
Measuring the Effects of Data Parallelism on Neural Network Training
Paper • 1811.03600 • Published • 2 -
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Paper • 1804.04235 • Published • 2 -
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Paper • 1905.11946 • Published • 4 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 65