Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2312.13964

Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions

Paper • 2501.01425 • Published Jan 2, 2025 • 5
Qwen/Qwen3.5-35B-A3B

Image-Text-to-Text • 36B • Updated Feb 27 • 3.89M • • 1.39k
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 19
Paused

Agents

Featured

5.11k

Wan2.2 Animate

👁

5.11k

Wan2.2 Animate

aMUSEd: An Open MUSE Reproduction

Paper • 2401.01808 • Published Jan 3, 2024 • 31
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

Paper • 2401.01885 • Published Jan 3, 2024 • 28
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity

Paper • 2401.00604 • Published Dec 31, 2023 • 6
LARP: Language-Agent Role Play for Open-World Games

Paper • 2312.17653 • Published Dec 24, 2023 • 33

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 19
vikhyatk/moondream1

Text Generation • 2B • Updated Feb 7, 2024 • 11.2k • 487

Running

Agents

2

Wan AI Wan2.1 T2V 14B

👀

2

Generate videos from text prompts
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 19
Wan-AI/Wan2.1-I2V-14B-720P

Image-to-Video • Updated Feb 26, 2025 • 8.98k • • 576
Running on Zero

MCP

Featured

2.01k

Stable Video Diffusion 1.1

📺

2.01k

Create a short video from a single image

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 19
LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Paper • 2312.12491 • Published Dec 19, 2023 • 76
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model

Paper • 2401.02330 • Published Jan 4, 2024 • 18

One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning

Paper • 2306.07967 • Published Jun 13, 2023 • 26
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Paper • 2306.07954 • Published Jun 13, 2023 • 113
TryOnDiffusion: A Tale of Two UNets

Paper • 2306.08276 • Published Jun 14, 2023 • 75
Seeing the World through Your Eyes

Paper • 2306.09348 • Published Jun 15, 2023 • 34

Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions

Paper • 2501.01425 • Published Jan 2, 2025 • 5
Qwen/Qwen3.5-35B-A3B

Image-Text-to-Text • 36B • Updated Feb 27 • 3.89M • • 1.39k
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 19
Paused

Agents

Featured

5.11k

Wan2.2 Animate

👁

5.11k

Wan2.2 Animate

Running

Agents

2

Wan AI Wan2.1 T2V 14B

👀

2

Generate videos from text prompts
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 19
Wan-AI/Wan2.1-I2V-14B-720P

Image-to-Video • Updated Feb 26, 2025 • 8.98k • • 576
Running on Zero

MCP

Featured

2.01k

Stable Video Diffusion 1.1

📺

2.01k

Create a short video from a single image

aMUSEd: An Open MUSE Reproduction

Paper • 2401.01808 • Published Jan 3, 2024 • 31
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

Paper • 2401.01885 • Published Jan 3, 2024 • 28
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity

Paper • 2401.00604 • Published Dec 31, 2023 • 6
LARP: Language-Agent Role Play for Open-World Games

Paper • 2312.17653 • Published Dec 24, 2023 • 33

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 19
LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Paper • 2312.12491 • Published Dec 19, 2023 • 76
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model

Paper • 2401.02330 • Published Jan 4, 2024 • 18

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 19
vikhyatk/moondream1

Text Generation • 2B • Updated Feb 7, 2024 • 11.2k • 487

One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning

Paper • 2306.07967 • Published Jun 13, 2023 • 26
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Paper • 2306.07954 • Published Jun 13, 2023 • 113
TryOnDiffusion: A Tale of Two UNets

Paper • 2306.08276 • Published Jun 14, 2023 • 75
Seeing the World through Your Eyes

Paper • 2306.09348 • Published Jun 15, 2023 • 34

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs