Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models Paper β’ 2505.22232 β’ Published May 28, 2025 β’ 18