houyuanchen
/

UniVidX

video-generation

Model card Files Files and versions

houyuanchen commited on 7 days ago

Commit

e37e738

·

1 Parent(s): 01c90ba

model card

Files changed (1) hide show

README.md +33 -0

README.md ADDED Viewed

	@@ -0,0 +1,33 @@

+# UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
+UniVidX is a unified multimodal video diffusion framework for versatile video generation and perception. It supports omni-directional conditional generation across multiple modalities by training a single model to handle different input-output mappings rather than one fixed task.
+This repository hosts the released UniVidX checkpoints:
+- `univid_intrinsic.safetensors`: checkpoint for UniVid-Intrinsic, covering RGB, albedo, irradiance, and normal video modalities.
+- `univid_alpha.safetensors`: checkpoint for UniVid-Alpha, covering blended RGB video, alpha matte, foreground, and background modalities.
+## Links
+- Paper: [arXiv:2605.00658](https://arxiv.org/pdf/2605.00658)
+- Code: [github.com/houyuanchen111/UniVidX](https://github.com/houyuanchen111/UniVidX)
+- Project / Model Page: [huggingface.co/houyuanchen/UniVidX](https://huggingface.co/houyuanchen/UniVidX)
+## Citation
+If you find this work useful, please cite:
+```bibtex
+@article{chen2026unividx,
+  title     = {UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors},
+  author    = {Chen, Houyuan and Li, Hong and Kong, Xianghao and Zhu, Tianrui and Xu, Shaocong and Xiao, Weiqing and Guo, Yuwei and Ye, Chongjie and Zhang, Lvmin and Zhao, Hao and Rao, Anyi},
+  journal   = {ACM Transactions on Graphics},
+  volume    = {45},
+  number    = {4},
+  articleno = {51},
+  year      = {2026},
+  month     = jul,
+  doi       = {10.1145/3811304},
+  url       = {https://doi.org/10.1145/3811304}
+}
+```