poterliu's picture

poterliu

poterliu

·

AI & ML interests

None yet

Organizations

None yet

upvoted 4 collections about 1 year ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 12 days ago • 267

Deepseek Papers

Deepseek papers collection • 31 items • Updated 3 days ago • 338

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 562

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 839

upvoted a paper about 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 447

upvoted a paper over 1 year ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 302

upvoted a collection over 1 year ago

DeepSeek-V3

4 items • Updated Nov 27, 2025 • 284