Image-to-3D
Diffusers
Safetensors
nielsr HF Staff commited on
Commit
7c4c301
·
verified ·
1 Parent(s): 40a9003

Improve model card and add metadata

Browse files

This PR improves the model card for Sat3DGen by adding relevant metadata and documentation.
- Added `pipeline_tag: image-to-3d` to the YAML metadata.
- Included links to the paper, GitHub repository, and project page.
- Added a sample usage code snippet from the official repository.
- Included the BibTeX citation for the paper.

Files changed (1) hide show
  1. README.md +59 -4
README.md CHANGED
@@ -1,4 +1,59 @@
1
- ---
2
- license: mit
3
- ---
4
- arxiv.org/abs/2605.14984
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ pipeline_tag: image-to-3d
4
+ ---
5
+
6
+ # Sat3DGen: Comprehensive Street-Level 3D Scene Generation from Single Satellite Image
7
+
8
+ Sat3DGen is a framework for generating street-level 3D scenes from a single satellite image. It uses a geometry-first methodology to bridge the extreme viewpoint gap between satellite and street views, achieving high geometric fidelity and photorealism.
9
+
10
+ [**Paper**](https://arxiv.org/abs/2605.14984) | [**Project Page**](https://qianmingduowan.github.io/Sat3DGen_project_page/) | [**GitHub**](https://github.com/qianmingduowan/Sat3DGen) | [**Demo**](https://huggingface.co/spaces/qian43/Sat3DGen)
11
+
12
+ ## Sample Usage
13
+
14
+ To use this model, you will need the code from the [official repository](https://github.com/qianmingduowan/Sat3DGen).
15
+
16
+ ```python
17
+ from source.generator import Sat3DGen
18
+
19
+ # Load the model
20
+ Sat3DGen._skip_backbone_weights = True
21
+ model = Sat3DGen.from_pretrained("qian43/Sat3DGen")
22
+ model = model.to("cuda:0").eval()
23
+
24
+ # Proceed with inference as described in the repository
25
+ ```
26
+
27
+ ## Citation
28
+
29
+ If you find this work useful for your research, please cite:
30
+
31
+ ```bibtex
32
+ @inproceedings{
33
+ qian2026satdgen,
34
+ title={Sat3{DG}en: Comprehensive Street-Level 3D Scene Generation from Single Satellite Image},
35
+ author={Ming Qian and Zimin Xia and Changkun Liu and Shuailei Ma and Wen Wang and Zeran Ke and Bin Tan and Hang Zhang and Gui-Song Xia},
36
+ booktitle={The Fourteenth International Conference on Learning Representations},
37
+ year={2026},
38
+ url={https://openreview.net/forum?id=E7JzkZCofa}
39
+ }
40
+
41
+ @ARTICLE{Qian_2026_Sat2Densitypp,
42
+ author={Qian, Ming and Tan, Bin and Wang, Qiuyu and Zheng, Xianwei and Xiong, Hanjiang and Xia, Gui-Song and Shen, Yujun and Xue, Nan},
43
+ journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
44
+ title={Seeing Through Satellite Images at Street Views},
45
+ year={2026},
46
+ volume={48},
47
+ number={5},
48
+ pages={5692-5709},
49
+ doi={10.1109/TPAMI.2026.3652860}}
50
+
51
+ @InProceedings{Qian_2023_Sat2Density,
52
+ author = {Qian, Ming and Xiong, Jincheng and Xia, Gui-Song and Xue, Nan},
53
+ title = {Sat2Density: Faithful Density Learning from Satellite-Ground Image Pairs},
54
+ booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
55
+ month = {October},
56
+ year = {2023},
57
+ pages = {3683-3692}
58
+ }
59
+ ```