Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation Paper • 2604.05083 • Published 12 days ago
What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time? Paper • 2603.19017 • Published 29 days ago • 3
What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time? Paper • 2603.19017 • Published 29 days ago • 3
Simba Speech Series Collection Simba bridges the digital divide with a unified suite for African AI: the largest open-source speech benchmark and models covering 61 languages • 13 items • Updated Feb 12 • 1