Instructions to use qian43/Sat3DGen with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use qian43/Sat3DGen with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("qian43/Sat3DGen", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
Improve model card and add metadata (#1)
Browse files- Improve model card and add metadata (7c4c301302409f9b3780ab3186584df5374022e1)
Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>
README.md
CHANGED
|
@@ -1,4 +1,59 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
--
|
| 4 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
pipeline_tag: image-to-3d
|
| 4 |
+
---
|
| 5 |
+
|
| 6 |
+
# Sat3DGen: Comprehensive Street-Level 3D Scene Generation from Single Satellite Image
|
| 7 |
+
|
| 8 |
+
Sat3DGen is a framework for generating street-level 3D scenes from a single satellite image. It uses a geometry-first methodology to bridge the extreme viewpoint gap between satellite and street views, achieving high geometric fidelity and photorealism.
|
| 9 |
+
|
| 10 |
+
[**Paper**](https://arxiv.org/abs/2605.14984) | [**Project Page**](https://qianmingduowan.github.io/Sat3DGen_project_page/) | [**GitHub**](https://github.com/qianmingduowan/Sat3DGen) | [**Demo**](https://huggingface.co/spaces/qian43/Sat3DGen)
|
| 11 |
+
|
| 12 |
+
## Sample Usage
|
| 13 |
+
|
| 14 |
+
To use this model, you will need the code from the [official repository](https://github.com/qianmingduowan/Sat3DGen).
|
| 15 |
+
|
| 16 |
+
```python
|
| 17 |
+
from source.generator import Sat3DGen
|
| 18 |
+
|
| 19 |
+
# Load the model
|
| 20 |
+
Sat3DGen._skip_backbone_weights = True
|
| 21 |
+
model = Sat3DGen.from_pretrained("qian43/Sat3DGen")
|
| 22 |
+
model = model.to("cuda:0").eval()
|
| 23 |
+
|
| 24 |
+
# Proceed with inference as described in the repository
|
| 25 |
+
```
|
| 26 |
+
|
| 27 |
+
## Citation
|
| 28 |
+
|
| 29 |
+
If you find this work useful for your research, please cite:
|
| 30 |
+
|
| 31 |
+
```bibtex
|
| 32 |
+
@inproceedings{
|
| 33 |
+
qian2026satdgen,
|
| 34 |
+
title={Sat3{DG}en: Comprehensive Street-Level 3D Scene Generation from Single Satellite Image},
|
| 35 |
+
author={Ming Qian and Zimin Xia and Changkun Liu and Shuailei Ma and Wen Wang and Zeran Ke and Bin Tan and Hang Zhang and Gui-Song Xia},
|
| 36 |
+
booktitle={The Fourteenth International Conference on Learning Representations},
|
| 37 |
+
year={2026},
|
| 38 |
+
url={https://openreview.net/forum?id=E7JzkZCofa}
|
| 39 |
+
}
|
| 40 |
+
|
| 41 |
+
@ARTICLE{Qian_2026_Sat2Densitypp,
|
| 42 |
+
author={Qian, Ming and Tan, Bin and Wang, Qiuyu and Zheng, Xianwei and Xiong, Hanjiang and Xia, Gui-Song and Shen, Yujun and Xue, Nan},
|
| 43 |
+
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
|
| 44 |
+
title={Seeing Through Satellite Images at Street Views},
|
| 45 |
+
year={2026},
|
| 46 |
+
volume={48},
|
| 47 |
+
number={5},
|
| 48 |
+
pages={5692-5709},
|
| 49 |
+
doi={10.1109/TPAMI.2026.3652860}}
|
| 50 |
+
|
| 51 |
+
@InProceedings{Qian_2023_Sat2Density,
|
| 52 |
+
author = {Qian, Ming and Xiong, Jincheng and Xia, Gui-Song and Xue, Nan},
|
| 53 |
+
title = {Sat2Density: Faithful Density Learning from Satellite-Ground Image Pairs},
|
| 54 |
+
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
|
| 55 |
+
month = {October},
|
| 56 |
+
year = {2023},
|
| 57 |
+
pages = {3683-3692}
|
| 58 |
+
}
|
| 59 |
+
```
|