16 9 6

Le Thien Phuc Nguyen

plnguyen2908

https://plnguyen2908.github.io/

plnguyen2908

AI & ML interests

Computer Vision, NLP, Applied AI

Recent Activity

upvoted a paper 29 minutes ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

published a dataset 5 days ago

plnguyen2908/AVHBench_clone

upvoted a collection 15 days ago

VideoLLaMA2

View all activity

Organizations

upvoted a paper 29 minutes ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Paper • 2604.13151 • Published 3 days ago • 13

published a dataset 5 days ago

plnguyen2908/AVHBench_clone

Updated 5 days ago • 19

upvoted a collection 15 days ago

VideoLLaMA2

Collection

Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability • 13 items • Updated Sep 2, 2025 • 20

liked a model 2 months ago

mozilla-ai/gemma-3-4b-it-llamafile

Text Generation • Updated Mar 31, 2025 • 218 • 6

liked a Space 3 months ago

AI Deadlines

⚡

723

Find upcoming AI conference deadlines instantly

upvoted a collection 3 months ago

VisionLM

Collection

1884 items • Updated Jan 12 • 146

upvoted an article 4 months ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7, 2025

•

109

updated a dataset 4 months ago

plnguyen2908/AV-SpeakerBench

Viewer • Updated Dec 15, 2025 • 3.21k • 3.24k • 2

authored 2 papers 4 months ago

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Paper • 2512.02231 • Published Dec 1, 2025 • 9

LASER: Lip Landmark Assisted Speaker Detection for Robustness

Paper • 2501.11899 • Published Jan 21, 2025

commented a paper 4 months ago

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Paper • 2512.02231 • Published Dec 1, 2025 • 9 •

liked a Space 4 months ago

paper-central

⚡

228

Explore, filter, and chat with research papers

submitted a paper to Daily Papers 4 months ago

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Paper • 2512.02231 • Published Dec 1, 2025 • 9

upvoted 2 papers 4 months ago

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Paper • 2512.02231 • Published Dec 1, 2025 • 9

Relational Visual Similarity

Paper • 2512.07833 • Published Dec 8, 2025 • 25

liked a dataset 5 months ago

plnguyen2908/LASER-bench

Viewer • Updated Nov 22, 2025 • 5.32k • 121 • 1

upvoted an article 5 months ago

Article

The Annotated Diffusion Model

Jun 7, 2022

•

341

liked a dataset 5 months ago

plnguyen2908/AV-SpeakerBench

Viewer • Updated Dec 15, 2025 • 3.21k • 3.24k • 2

updated a dataset 5 months ago

plnguyen2908/LASER-bench

Viewer • Updated Nov 22, 2025 • 5.32k • 121 • 1

published a dataset 5 months ago

plnguyen2908/LASER-bench

Viewer • Updated Nov 22, 2025 • 5.32k • 121 • 1

Le Thien Phuc Nguyen

AI & ML interests

Recent Activity

Organizations

plnguyen2908's activity

AI Deadlines

Vision Language Model Alignment in TRL ⚡️

paper-central

The Annotated Diffusion Model