Mehrdad Farnoosh's picture

9 1

Mehrdad Farnoosh

mfarnoosh

·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 3 months ago

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Paper • 2601.15369 • Published Jan 21 • 21

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published Dec 11, 2025 • 56

upvoted 2 papers 5 months ago

VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation

Paper • 2511.17199 • Published Nov 21, 2025 • 8

RynnVLA-002: A Unified Vision-Language-Action and World Model

Paper • 2511.17502 • Published Nov 21, 2025 • 28

upvoted 2 papers about 1 year ago

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

Paper • 2504.01724 • Published Apr 2, 2025 • 68

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model

Paper • 2503.21144 • Published Mar 27, 2025 • 27

upvoted an article almost 2 years ago

Article

Diffusers welcomes Stable Diffusion 3

+4

Jun 12, 2024

•

99

upvoted a paper about 2 years ago

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27, 2024 • 194