Keran 's picture

2 2

Keran

Keeera

·

AI & ML interests

None yet

Organizations

None yet

upvoted an article 12 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

286

upvoted a collection almost 2 years ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Mar 12 • 152