Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mehrdad Farnoosh's picture
9 1

Mehrdad Farnoosh

mfarnoosh
·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 3 months ago

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Paper • 2601.15369 • Published Jan 21 • 21

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published Dec 11, 2025 • 56
upvoted 2 papers 5 months ago

VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation

Paper • 2511.17199 • Published Nov 21, 2025 • 8

RynnVLA-002: A Unified Vision-Language-Action and World Model

Paper • 2511.17502 • Published Nov 21, 2025 • 28
upvoted 2 papers about 1 year ago

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

Paper • 2504.01724 • Published Apr 2, 2025 • 68

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model

Paper • 2503.21144 • Published Mar 27, 2025 • 27
upvoted an article almost 2 years ago
view article
Article

Diffusers welcomes Stable Diffusion 3

  • +4
Jun 12, 2024
•
99
upvoted a paper about 2 years ago

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27, 2024 • 194
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs