1 22 60

MC

Dreamer312

Dreamer

AI & ML interests

NLP, CV, LLM, AGENT, RL

Recent Activity

upvoted a paper 2 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

upvoted a paper 2 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

upvoted a paper 16 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

View all activity

Organizations

None yet

upvoted 2 papers 2 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 14 days ago • 357

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 8 days ago • 234

upvoted a paper 16 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 19 days ago • 143

upvoted a paper 24 days ago

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published 26 days ago • 77

upvoted 2 papers 3 months ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published Jan 29 • 102

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 180

upvoted a paper 11 months ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20, 2025 • 76

upvoted a collection 11 months ago

Llama 4

Collection

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated about 14 hours ago • 55

upvoted 2 papers 11 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19

Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation

Paper • 2409.10262 • Published Sep 16, 2024 • 1

upvoted an article 11 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.12k

upvoted a collection 11 months ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.75k

upvoted an article 12 months ago

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

•

upvoted 3 articles about 1 year ago

Article

Merge Large Language Models with mergekit

Jan 9, 2024

•

154

Article

Trace & Evaluate your Agent with Arize Phoenix

Feb 28, 2025

•

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Jan 31, 2025

•

upvoted a paper over 1 year ago

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Paper • 2404.13013 • Published Apr 19, 2024 • 31

upvoted 3 articles over 1 year ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

•

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18, 2024

•

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

•

MC

AI & ML interests

Recent Activity

Organizations

Dreamer312's activity

Mixture of Experts Explained

Proximal Policy Optimization (PPO)

Merge Large Language Models with mergekit

Trace & Evaluate your Agent with Arize Phoenix

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

A failed experiment: Infini-Attention, and why we should keep trying?

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Preference Optimization for Vision Language Models