Amir Hossein Kargaran's picture

Amir Hossein Kargaran

kargaranamir

·

https://kargaranamir.github.io

AI & ML interests

#NLP, checkout https://huggingface.co/cis-lmu

Recent Activity

updated a dataset about 10 hours ago

kargaranamir/coercion

updated a collection about 16 hours ago

updated a collection about 16 hours ago

View all activity

Organizations

upvoted a collection 1 day ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25, 2025 • 66

upvoted 2 papers 2 days ago

GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts

Paper • 2604.12978 • Published 3 days ago • 5

Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

Paper • 2402.17152 • Published Feb 27, 2024 • 6

upvoted an article 4 days ago

Article

Falcon Perception

16 days ago

•

58

upvoted a collection 5 days ago

GlotSuite

GlotSuite: Paving the Way for Bringing Generative AI to Underserved Communities • 17 items • Updated 2 days ago • 3

upvoted a paper 8 days ago

The Role of Language Imbalance in Cross-lingual Generalisation: Insights from Cloned Language Experiments

Paper • 2404.07982 • Published Apr 11, 2024 • 1

upvoted a paper 9 days ago

Challenging the Evaluator: LLM Sycophancy Under User Rebuttal

Paper • 2509.16533 • Published Sep 20, 2025 • 1

upvoted a paper 29 days ago

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 21

upvoted a paper 30 days ago

Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published Mar 11 • 153

upvoted a paper about 1 month ago

Can Vision-Language Models Solve the Shell Game?

Paper • 2603.08436 • Published Mar 9 • 39

upvoted a collection about 1 month ago

OCR

31 items • Updated about 16 hours ago • 1

upvoted a collection about 2 months ago

Languages identification

a variety of pre-trained language identification models • 9 items • Updated Jul 31, 2025 • 2

upvoted a collection 5 months ago

OLDI and friends

This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task. • 5 items • Updated 23 days ago • 5

upvoted an article 5 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

359

upvoted a paper 5 months ago

Insights from the ICLR Peer Review and Rebuttal Process

Paper • 2511.15462 • Published Nov 19, 2025 • 7

upvoted a collection 6 months ago

mmBERT: a modern multilingual encoder

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 53

upvoted a paper 6 months ago

CoBia: Constructed Conversations Can Trigger Otherwise Concealed Societal Biases in LLMs

Paper • 2510.09871 • Published Oct 10, 2025 • 3

upvoted a paper 8 months ago

Multi-Turn Puzzles: Evaluating Interactive Reasoning and Strategic Dialogue in LLMs

Paper • 2508.10142 • Published Aug 13, 2025 • 3

upvoted a changelog 8 months ago

Hugging Face Changelog

Connect Your MCP Client to the Hugging Face Hub

Jun 6, 2025

• 114

upvoted a collection 9 months ago

llm-urls-neurips

56 items • Updated Mar 2 • 2