--- title: VGGT-Omega Demo emoji: 🌀 colorFrom: blue colorTo: green sdk: gradio sdk_version: 5.50.0 app_file: app.py python_version: "3.10" hardware: zero-gpu pinned: false license: other license_name: fair-noncommercial-research-license-v1 license_link: LICENSE short_description: 3D reconstruction from images/video with VGGT-Omega models: - facebook/VGGT-Omega --- # VGGT-Ω Demo Interactive demo for **VGGT-Ω**, a feed-forward camera and depth reconstruction model from the Visual Geometry Group (Oxford) and Meta AI. Upload images or a short video and the model returns a 3D point cloud with estimated camera poses, visualized as a GLB scene. - **Project page:** https://vggt-omega.github.io/ - **Source code:** https://github.com/facebookresearch/vggt-omega - **Model card:** https://huggingface.co/facebook/VGGT-Omega ## Citation ```bibtex @inproceedings{wang2026vggtomega, title={VGGT-{$\Omega$}}, author={Wang, Jianyuan and Chen, Minghao and Zhang, Shangzhan and Karaev, Nikita and Sch{\"o}nberger, Johannes and Labatut, Patrick and Bojanowski, Piotr and Novotny, David and Vedaldi, Andrea and Rupprecht, Christian}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition}, year={2026} } ``` ## License This demo is released under the FAIR Noncommercial Research License v1. See [LICENSE](LICENSE).