Instructions to use Efficient-Large-Model/SANA-WM_bidirectional with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Efficient-Large-Model/SANA-WM_bidirectional with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import load_image, export_to_video # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Efficient-Large-Model/SANA-WM_bidirectional", dtype=torch.bfloat16, device_map="cuda") pipe.to("cuda") prompt = "A man with short gray hair plays a red electric guitar." image = load_image( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/guitar-man.png" ) output = pipe(image=image, prompt=prompt).frames[0] export_to_video(output, "output.mp4") - Notebooks
- Google Colab
- Kaggle
add a gif preview?
Browse filesstands out more imo but feel free to close/merge @HaoyiZhu
README.md
CHANGED
|
@@ -10,6 +10,8 @@ tags:
|
|
| 10 |
|
| 11 |
# SANA-WM (Bidirectional)
|
| 12 |
|
|
|
|
|
|
|
| 13 |
**SANA-WM** is an efficient open-source world model trained natively for
|
| 14 |
one-minute generation. The bidirectional checkpoint released here is a
|
| 15 |
2.6B-parameter image-to-video diffusion transformer that synthesises
|
|
|
|
| 10 |
|
| 11 |
# SANA-WM (Bidirectional)
|
| 12 |
|
| 13 |
+

|
| 14 |
+
|
| 15 |
**SANA-WM** is an efficient open-source world model trained natively for
|
| 16 |
one-minute generation. The bidirectional checkpoint released here is a
|
| 17 |
2.6B-parameter image-to-video diffusion transformer that synthesises
|