This is not correct! Negation-aware Evaluation of Language Generation Systems Paper • 2307.13989 • Published Jul 26, 2023
Explainable Semantic Textual Similarity via Dissimilar Span Detection Paper • 2603.21174 • Published 28 days ago
Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs Paper • 2510.20475 • Published Oct 23, 2025 • 1
EXECUTE: A Multilingual Benchmark for LLM Token Understanding Paper • 2505.17784 • Published May 23, 2025
Subword-Delimited Downsampling for Better Character-Level Translation Paper • 2212.01304 • Published Dec 2, 2022
German4All Collection A collection of datasets and models for paraphrasing German texts to different complexity levels. • 4 items • Updated Aug 29, 2025
RoD-TAL: A Benchmark for Answering Questions in Romanian Driving License Exams Paper • 2507.19666 • Published Jul 25, 2025