In a Training Loop 🔄

12 16

Ads Dawson

0xmoose

AI & ML interests

None yet

Recent Activity

liked a model 12 days ago

google/gemma-4-31B

upvoted a collection 12 days ago

Gemma 4

upvoted an article 12 days ago

Welcome Gemma 4: Frontier multimodal intelligence on device

View all activity

Organizations

upvoted a collection 12 days ago

Gemma 4

Collection

8 items • Updated 13 days ago • 605

upvoted an article 12 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

14 days ago

•

849

upvoted 2 papers 15 days ago

TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published 19 days ago • 142

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published 20 days ago • 50

upvoted a changelog 27 days ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

28 days ago

• 137

upvoted 2 papers about 1 month ago

Reasoning Models Struggle to Control their Chains of Thought

Paper • 2603.05706 • Published Mar 5 • 37

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194

upvoted an article about 1 month ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.11k

upvoted an article 2 months ago

Article

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

Feb 12

•

upvoted a collection 3 months ago

Open Coding Agents

Collection

13 items • Updated Mar 5 • 52

upvoted a changelog 3 months ago

Hugging Face Changelog

HuggingChat for Papers

Jan 7

• 104

upvoted an article 3 months ago

Article

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

Jan 1

•

Ads Dawson

AI & ML interests

Recent Activity

Organizations

0xmoose's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

Hugging Face Papers for AI Agents

Mixture of Experts Explained

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

HuggingChat for Papers

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model