File size: 2,258 Bytes
555f04e
 
9ec7dbf
 
 
 
 
 
 
555f04e
9ec7dbf
 
 
 
 
6527bde
4da1b54
e58ae64
6527bde
9ec7dbf
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
---
license: creativeml-openrail-m
language:
  - en
tags:
  - controlnet
  - stable-diffusion
  - urban-design
pipeline_tag: image-to-image
---

# Stepwise Generative Urban Design

ControlNet-based diffusion models for automatic urban design generation, conditioned on site constraints and text descriptions.

**Paper**: *Human-guided urban form generation using multimodal diffusion models*, Building and Environment, 2026 

[Full paper](https://doi.org/10.1016/j.buildenv.2025.113892); [Arxiv](https://arxiv.org/abs/2505.24260); 
**Code & documentation**: [GitHub](https://github.com/Hemy17/Stepwise_GenerativeUrbanDesign)

## Models

Six checkpoints covering two cities × three pipeline steps:

| Checkpoint | City | Step |
|------------|------|------|
| `checkpoints_step1_nyc` | New York City | Site constraints → Land use + road network |
| `checkpoints_step1_chi` | Chicago | Site constraints → Land use + road network |
| `checkpoints_step2_nyc` | New York City | Land use + roads → Building footprint layout |
| `checkpoints_step2_chi` | Chicago | Land use + roads → Building footprint layout |
| `checkpoints_step3_nyc` | New York City | Building footprints → Satellite image |
| `checkpoints_step3_chi` | Chicago | Building footprints → Satellite image |

Fine-tuned from [`runwayml/stable-diffusion-v1-5`](https://huggingface.co/runwayml/stable-diffusion-v1-5) + ControlNet. Checkpoints are FP16, ~2.9 GB each.

## Citation

```bibtex
@article{he2025human,
  title   = {Human-guided urban form generation using multimodal diffusion models},
  author  = {He, Mingyi and Liang, Yuebing and Wang, Shenhao and Zheng, Yunhan
             and Wang, Qingyi and Zhuang, Dingyi and Tian, Li and Zhao, Jinhua},
  journal = {Building and Environment},
  pages   = {113892},
  year    = {2025},
  doi     = {10.1016/j.buildenv.2025.113892}
}

@article{he2025generative,
  title   = {Generative {AI} for urban design: a stepwise approach integrating
             human expertise with multimodal diffusion models},
  author  = {He, Mingyi and Liang, Yuebing and Wang, Shenhao and Zheng, Yunhan
             and Wang, Qingyi and Zhuang, Dingyi and Tian, Li and Zhao, Jinhua},
  journal = {arXiv preprint arXiv:2505.24260},
  year    = {2025}
}
```