Beyond Transcripts: A Renewed Perspective on Audio Chaptering Paper • 2602.08979 • Published Feb 9
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 Paper • 2406.16777 • Published Jun 24, 2024 • 1
KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025 Paper • 2505.13036 • Published May 19, 2025
ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition Paper • 2506.04635 • Published Jun 5, 2025