metadata
license: mit
ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Xiaoxue Wu1,2*, Xinyuan Chen2β , Yaohui Wang2β , Yu Qiao2β ,
π₯ Installation
- Clone the Repository
git clone https://github.com/UknowSth/ShotDirector.git
cd ShotDirector
- Set up Environment
conda create -n shotdirector python==3.11.9
conda activate shotdirector
pip install torch==2.5.1 torchvision==0.20.1 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt
π€ Checkpoint
CineTrans-DiT
Download the weights of Wan2.1-T2V-1.3B and the weights required for Shotdirector. Place them in the .ckpt/ folder as shown in the following diagram.
ckpt/
βββ Wan2.1/Wan2.1-T2V-1.3B/
β βββ config.json
β βββ diffusion_pytorch_model.safetensors
β βββ google/
β βββ models_t5_umt5-xxl-enc-bf16.pth
β βββ Wan2.1_VAE.pth
βββ encoder.pt
βββ model.pt
βββ trans.pt
For more inference details, please refer to our GitHub repository.
π BiTeX
If you find ShotDirector useful for your research and applications, please cite using this BibTeX:
@misc{wu2025shotdirectordirectoriallycontrollablemultishot,
title={ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions},
author={Xiaoxue Wu and Xinyuan Chen and Yaohui Wang and Yu Qiao},
year={2025},
eprint={2512.10286},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.10286},
}