10 21

Liam Duignan

Lduignan1

Lduignan1

AI & ML interests

NLP/Named Entity Recognition/LLMs

Recent Activity

liked a dataset 6 days ago

juletxara/mgsm

liked a dataset 8 days ago

HuggingFaceH4/ultrachat_200k

upvoted a paper 9 days ago

VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors

View all activity

Organizations

upvoted a paper 9 days ago

VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors

Paper • 2604.02486 • Published 14 days ago • 9

upvoted a paper 13 days ago

Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts?

Paper • 2503.18018 • Published Mar 23, 2025 • 7

upvoted a paper 20 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 191

upvoted a collection 21 days ago

Pensez-LLM

Collection

French-English reasoning model • 4 items • Updated Mar 2 • 4

upvoted a paper about 1 month ago

Reasoning Models Struggle to Control their Chains of Thought

Paper • 2603.05706 • Published Mar 5 • 37

upvoted an article 3 months ago

Article

TextQuests: How Good are LLMs at Text-Based Video Games?

Aug 12, 2025

•

upvoted an article 5 months ago

Article

Integrating benchmarks into LM Evaluation Harness

Jul 21, 2025

•

upvoted an article 6 months ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21, 2025

•

308

upvoted a collection over 1 year ago

FrenchBench Evaluation datasets

Collection

These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper) • 11 items • Updated Jun 7, 2024 • 8

upvoted a paper almost 2 years ago

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 61

Liam Duignan

AI & ML interests

Recent Activity

Organizations

Lduignan1's activity

TextQuests: How Good are LLMs at Text-Based Video Games?

Integrating benchmarks into LM Evaluation Harness

Supercharge your OCR Pipelines with Open Models