Pouya Esmaeili's picture

8 4

Pouya Esmaeili

Pouyae

·

https://pouyae.xyz

AI & ML interests

RAG/LLM/Agents

Organizations

None yet

upvoted a collection 8 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 8 days ago • 104

upvoted 3 articles 11 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

407

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

293

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

69

upvoted 2 collections 11 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 666

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 207

upvoted a paper about 2 years ago

Grandmaster-Level Chess Without Search

Paper • 2402.04494 • Published Feb 7, 2024 • 69

upvoted a paper over 2 years ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264