houyuanchen commited on
Commit
e37e738
·
1 Parent(s): 01c90ba

model card

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
2
+
3
+ UniVidX is a unified multimodal video diffusion framework for versatile video generation and perception. It supports omni-directional conditional generation across multiple modalities by training a single model to handle different input-output mappings rather than one fixed task.
4
+
5
+ This repository hosts the released UniVidX checkpoints:
6
+
7
+ - `univid_intrinsic.safetensors`: checkpoint for UniVid-Intrinsic, covering RGB, albedo, irradiance, and normal video modalities.
8
+ - `univid_alpha.safetensors`: checkpoint for UniVid-Alpha, covering blended RGB video, alpha matte, foreground, and background modalities.
9
+
10
+ ## Links
11
+
12
+ - Paper: [arXiv:2605.00658](https://arxiv.org/pdf/2605.00658)
13
+ - Code: [github.com/houyuanchen111/UniVidX](https://github.com/houyuanchen111/UniVidX)
14
+ - Project / Model Page: [huggingface.co/houyuanchen/UniVidX](https://huggingface.co/houyuanchen/UniVidX)
15
+
16
+ ## Citation
17
+
18
+ If you find this work useful, please cite:
19
+
20
+ ```bibtex
21
+ @article{chen2026unividx,
22
+ title = {UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors},
23
+ author = {Chen, Houyuan and Li, Hong and Kong, Xianghao and Zhu, Tianrui and Xu, Shaocong and Xiao, Weiqing and Guo, Yuwei and Ye, Chongjie and Zhang, Lvmin and Zhao, Hao and Rao, Anyi},
24
+ journal = {ACM Transactions on Graphics},
25
+ volume = {45},
26
+ number = {4},
27
+ articleno = {51},
28
+ year = {2026},
29
+ month = jul,
30
+ doi = {10.1145/3811304},
31
+ url = {https://doi.org/10.1145/3811304}
32
+ }
33
+ ```