Masoud Hashemi's picture

Masoud Hashemi

masoudhashemi

·

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

upvoted a paper 11 days ago

Terminal Agents Suffice for Enterprise Automation

upvoted a paper 17 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

View all activity

Organizations

upvoted an article 4 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

Mar 10

•

124

upvoted a paper 11 days ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published 12 days ago • 92

upvoted a paper 17 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 18 days ago • 96

upvoted an article 19 days ago

Article

A New Framework for Evaluating Voice Agents (EVA)

20 days ago

•

86

upvoted a paper 26 days ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published 30 days ago • 147

upvoted an article 4 months ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

84

upvoted a paper 6 months ago

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 6

upvoted an article 6 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

Sep 23, 2025

•

138

upvoted a collection 6 months ago

Apriel-1.5-15B-Thinker

3 items • Updated Oct 2, 2025 • 75

upvoted an article 7 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

+9

Sep 22, 2025

•

133

upvoted a paper 7 months ago

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

upvoted a paper 9 months ago

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21, 2025 • 68

upvoted an article 9 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

769

upvoted a collection 10 months ago

MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Feb 13 • 118

upvoted a collection 11 months ago

General-Reasoner

Advancing LLMs' general reasoning capabilities • 9 items • Updated Oct 12, 2025 • 6

upvoted an article 12 months ago

Article

Selective fine-tuning of Language Models with Spectrum

Sep 3, 2024

•

36

upvoted 2 articles about 1 year ago

Article

Open R1: Update #3

Mar 11, 2025

•

297

Article

The N Implementation Details of RLHF with PPO

+1

Oct 24, 2023

•

72

upvoted an article almost 2 years ago

Article

BigCodeBench: The Next Generation of HumanEval

+7

Jun 18, 2024

•

53

upvoted a collection about 2 years ago

[lecture artifacts] aligning open language models

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 58