ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Paper β’ 2512.10286 β’ Published
Xiaoxue Wu1,2*, Xinyuan Chen2β , Yaohui Wang2β , Yu Qiao2β ,
git clone https://github.com/UknowSth/ShotDirector.git
cd ShotDirector
conda create -n shotdirector python==3.11.9
conda activate shotdirector
pip install torch==2.5.1 torchvision==0.20.1 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt
Download the weights of Wan2.1-T2V-1.3B and the weights required for Shotdirector. Place them in the .ckpt/ folder as shown in the following diagram.
ckpt/
βββ Wan2.1/Wan2.1-T2V-1.3B/
β βββ config.json
β βββ diffusion_pytorch_model.safetensors
β βββ google/
β βββ models_t5_umt5-xxl-enc-bf16.pth
β βββ Wan2.1_VAE.pth
βββ encoder.pt
βββ model.pt
βββ trans.pt
For more inference details, please refer to our GitHub repository.
If you find ShotDirector useful for your research and applications, please cite using this BibTeX:
@misc{wu2025shotdirectordirectoriallycontrollablemultishot,
title={ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions},
author={Xiaoxue Wu and Xinyuan Chen and Yaohui Wang and Yu Qiao},
year={2025},
eprint={2512.10286},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.10286},
}