OriOrii (Ori Dev)

upvoted an article 3 months ago

Article

Introducing Daggr: Chain apps programmatically, inspect visually

+3

Jan 29

•

106

upvoted 2 papers 4 months ago

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

Paper • 2512.22047 • Published Dec 26, 2025 • 30

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Paper • 2512.17504 • Published Dec 19, 2025 • 99

upvoted a collection 10 months ago

Cosmos-Reason1

Collection

⚠️ The latest version of Cosmos Reason is now live! 👉 https://huggingface.co/collections/nvidia/cosmos-reason2 • 5 items • Updated 3 days ago • 41

upvoted an article 10 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

+7

Jun 3, 2025

•

344

upvoted an article 11 months ago

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

+5

May 11, 2025

•

96

upvoted a collection 12 months ago

OLMo 2

Collection

Artifacts for the OLMo 2 release. • 35 items • Updated Mar 3 • 154

upvoted 2 articles about 1 year ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

+2

Feb 4, 2025

•

192

Article

Welcome to Inference Providers on the Hub 🔥

+5

Jan 28, 2025

•

495

upvoted 3 papers over 2 years ago

upvoted a paper almost 3 years ago

OpenMask3D: Open-Vocabulary 3D Instance Segmentation

Paper • 2306.13631 • Published Jun 23, 2023 • 11

Ori Dev

AI & ML interests

Organizations

Introducing Daggr: Chain apps programmatically, inspect visually

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Cosmos-Reason1

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

OLMo 2

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Welcome to Inference Providers on the Hub 🔥

Towards A Unified Agent with Foundation Models

Android in the Wild: A Large-Scale Dataset for Android Device Control

Planting a SEED of Vision in Large Language Model

OpenMask3D: Open-Vocabulary 3D Instance Segmentation

Ori Dev

AI & ML interests

Organizations

OriOrii's activity

Introducing Daggr: Chain apps programmatically, inspect visually

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Welcome to Inference Providers on the Hub 🔥