File size: 2,258 Bytes
555f04e 9ec7dbf 555f04e 9ec7dbf 6527bde 4da1b54 e58ae64 6527bde 9ec7dbf | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 | ---
license: creativeml-openrail-m
language:
- en
tags:
- controlnet
- stable-diffusion
- urban-design
pipeline_tag: image-to-image
---
# Stepwise Generative Urban Design
ControlNet-based diffusion models for automatic urban design generation, conditioned on site constraints and text descriptions.
**Paper**: *Human-guided urban form generation using multimodal diffusion models*, Building and Environment, 2026
[Full paper](https://doi.org/10.1016/j.buildenv.2025.113892); [Arxiv](https://arxiv.org/abs/2505.24260);
**Code & documentation**: [GitHub](https://github.com/Hemy17/Stepwise_GenerativeUrbanDesign)
## Models
Six checkpoints covering two cities × three pipeline steps:
| Checkpoint | City | Step |
|------------|------|------|
| `checkpoints_step1_nyc` | New York City | Site constraints → Land use + road network |
| `checkpoints_step1_chi` | Chicago | Site constraints → Land use + road network |
| `checkpoints_step2_nyc` | New York City | Land use + roads → Building footprint layout |
| `checkpoints_step2_chi` | Chicago | Land use + roads → Building footprint layout |
| `checkpoints_step3_nyc` | New York City | Building footprints → Satellite image |
| `checkpoints_step3_chi` | Chicago | Building footprints → Satellite image |
Fine-tuned from [`runwayml/stable-diffusion-v1-5`](https://huggingface.co/runwayml/stable-diffusion-v1-5) + ControlNet. Checkpoints are FP16, ~2.9 GB each.
## Citation
```bibtex
@article{he2025human,
title = {Human-guided urban form generation using multimodal diffusion models},
author = {He, Mingyi and Liang, Yuebing and Wang, Shenhao and Zheng, Yunhan
and Wang, Qingyi and Zhuang, Dingyi and Tian, Li and Zhao, Jinhua},
journal = {Building and Environment},
pages = {113892},
year = {2025},
doi = {10.1016/j.buildenv.2025.113892}
}
@article{he2025generative,
title = {Generative {AI} for urban design: a stepwise approach integrating
human expertise with multimodal diffusion models},
author = {He, Mingyi and Liang, Yuebing and Wang, Shenhao and Zheng, Yunhan
and Wang, Qingyi and Zhuang, Dingyi and Tian, Li and Zhao, Jinhua},
journal = {arXiv preprint arXiv:2505.24260},
year = {2025}
}
```
|