Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation Paper • 2604.11290 • Published 6 days ago • 2
Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation Paper • 2604.11290 • Published 6 days ago • 2
M-RewardBench (ACL 2025) Collection Evaluating Reward Models in Multilingual Settings • 2 items • Updated Feb 10
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published Nov 29, 2024 • 17
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9, 2025 • 9
Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA Paper • 2505.16293 • Published May 22, 2025 • 3
Behind Maya: Building a Multilingual Vision Language Model Paper • 2505.08910 • Published May 13, 2025 • 2
Robust and Fine-Grained Detection of AI Generated Texts Paper • 2504.11952 • Published Apr 16, 2025 • 12
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9, 2025 • 9
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 48
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10, 2025 • 101
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published Jan 14, 2025 • 62
Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published Dec 19, 2024 • 12