-
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
Paper • 2402.01831 • Published • 17 -
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
Paper • 2503.03983 • Published • 28 -
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
Paper • 2507.08128 • Published • 14 -
Jamendo-QA: A Large-Scale Music Question Answering Dataset
Paper • 2509.15662 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2402.01831
-
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
Paper • 2311.00430 • Published • 56 -
MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription
Paper • 2108.02625 • Published • 1 -
FLAP: Fast Language-Audio Pre-training
Paper • 2311.01615 • Published • 16 -
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
Paper • 2402.01831 • Published • 17
-
NExT-GPT: Any-to-Any Multimodal LLM
Paper • 2309.05519 • Published • 79 -
Large Language Model for Science: A Study on P vs. NP
Paper • 2309.05689 • Published • 22 -
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Paper • 2309.06126 • Published • 18 -
Large Language Models for Compiler Optimization
Paper • 2309.07062 • Published • 25
-
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
Paper • 2402.01831 • Published • 17 -
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
Paper • 2503.03983 • Published • 28 -
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
Paper • 2507.08128 • Published • 14 -
Jamendo-QA: A Large-Scale Music Question Answering Dataset
Paper • 2509.15662 • Published • 1
-
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
Paper • 2311.00430 • Published • 56 -
MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription
Paper • 2108.02625 • Published • 1 -
FLAP: Fast Language-Audio Pre-training
Paper • 2311.01615 • Published • 16 -
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
Paper • 2402.01831 • Published • 17
-
NExT-GPT: Any-to-Any Multimodal LLM
Paper • 2309.05519 • Published • 79 -
Large Language Model for Science: A Study on P vs. NP
Paper • 2309.05689 • Published • 22 -
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Paper • 2309.06126 • Published • 18 -
Large Language Models for Compiler Optimization
Paper • 2309.07062 • Published • 25