Natural Language Processing @ TUM

university

AI & ML interests

NLP, XAI, Summarization, Hate Speech Detection, LegalNLP

Recent Activity

dmlls authored a paper 4 days ago

This is not correct! Negation-aware Evaluation of Language Generation Systems

dmlls authored a paper 4 days ago

Explainable Semantic Textual Similarity via Dissimilar Span Detection

dmlls published a dataset about 2 months ago

tum-nlp/span-similarity-dataset

View all activity

authored 2 papers 4 days ago

This is not correct! Negation-aware Evaluation of Language Generation Systems

Paper • 2307.13989 • Published Jul 26, 2023

Explainable Semantic Textual Similarity via Dissimilar Span Detection

Paper • 2603.21174 • Published 28 days ago

published a dataset about 2 months ago

tum-nlp/span-similarity-dataset

Viewer • Updated Mar 1 • 1k • 190

updated a dataset about 2 months ago

tum-nlp/span-similarity-dataset

Viewer • Updated Mar 1 • 1k • 190

updated a dataset 5 months ago

tum-nlp/German4All-Corpus

Preview • Updated Nov 25, 2025 • 111 • 1

updated a model 5 months ago

tum-nlp/German4all-paraphrasing-xl

Text Generation • Updated Nov 25, 2025 • 36 • 1

updated a Space 6 months ago

README

updated a dataset 6 months ago

tum-nlp/cognitive-biases-in-llms

Viewer • Updated Oct 30, 2025 • 30k • 63

updated a dataset 6 months ago

tum-nlp/cognitive-biases-in-llms

Viewer • Updated Oct 30, 2025 • 30k • 63

published a dataset 6 months ago

tum-nlp/cognitive-biases-in-llms

Viewer • Updated Oct 30, 2025 • 30k • 63

authored 4 papers 6 months ago

Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs

Paper • 2510.20475 • Published Oct 23, 2025 • 1

EXECUTE: A Multilingual Benchmark for LLM Token Understanding

Paper • 2505.17784 • Published May 23, 2025

Are BabyLMs Second Language Learners?

Paper • 2410.21254 • Published Oct 28, 2024

Subword-Delimited Downsampling for Better Character-Level Translation

Paper • 2212.01304 • Published Dec 2, 2022

updated a dataset 6 months ago

tum-nlp/cannot-dataset

Viewer • Updated Oct 23, 2025 • 77.4k • 30

in tum-nlp/German4All-Corpus 8 months ago

Update dataset card: Add paper/code links, detailed citation, and relevant tags

#2 opened 8 months ago by

updated a model 8 months ago

tum-nlp/German4all-paraphrasing-xl

Text Generation • Updated Nov 25, 2025 • 36 • 1

published a dataset 8 months ago

tum-nlp/German4All-Corpus

Preview • Updated Nov 25, 2025 • 111 • 1

updated a collection 8 months ago

German4All

A collection of datasets and models for paraphrasing German texts to different complexity levels. • 4 items • Updated Aug 29, 2025

authored a paper 8 months ago

RoD-TAL: A Benchmark for Answering Questions in Romanian Driving License Exams

Paper • 2507.19666 • Published Jul 25, 2025