ASCAT: An Arabic Scientific Corpus and Benchmark for Advanced Translation Evaluation Paper β’ 2604.00015 β’ Published Mar 10
Abjad-Kids: An Arabic Speech Classification Dataset for Primary Education Paper β’ 2603.20255 β’ Published Mar 11
SyriSign: A Parallel Corpus for Arabic Text to Syrian Arabic Sign Language Translation Paper β’ 2603.29219 β’ Published 18 days ago
VECTOR: Velocity-Enhanced GRU Neural Network for Real-Time 3D UAV Trajectory Prediction Paper β’ 2410.23305 β’ Published Oct 24, 2024
ARCADE: A City-Scale Corpus for Fine-Grained Arabic Dialect Tagging Paper β’ 2601.02209 β’ Published Jan 5 β’ 3
Alexandria: A Multi-Domain Dialectal Arabic Machine Translation Dataset for Culturally Inclusive and Linguistically Diverse LLMs Paper β’ 2601.13099 β’ Published Jan 19
Arabic LLM Security & Prompt Guarding Collection This collection brings together a set of Arabic-focused models, datasets, and tools designed to improve the security and safety of LLMs β’ 4 items β’ Updated Mar 16 β’ 2
AraModernBERT: Transtokenized Initialization and Long-Context Encoder Modeling for Arabic Paper β’ 2603.09982 β’ Published Feb 10