facebook/metaclip-h14-fullcc2.5b Zero-Shot Image Classification • 1.0B • Updated Jan 11, 2024 • 9.47k • 49
Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models Paper • 2503.02318 • Published Mar 4, 2025 • 2
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper • 2503.03983 • Published Mar 6, 2025 • 27