Spaces:

facebook
/

vggt-omega

Running on Zero

App Files Files Community

vggt-omega / README.md

Minghao Chen

Initial Space: VGGT-Omega Gradio demo on ZeroGPU

b2e9eec 11 days ago

preview code

raw

history blame contribute delete

1.37 kB

A newer version of the Gradio SDK is available: 6.15.0

Upgrade

metadata

title: VGGT-Omega Demo
emoji: 🌀
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.50.0
app_file: app.py
python_version: '3.10'
hardware: zero-gpu
pinned: false
license: other
license_name: fair-noncommercial-research-license-v1
license_link: LICENSE
short_description: 3D reconstruction from images/video with VGGT-Omega
models:
  - facebook/VGGT-Omega

VGGT-Ω Demo

Interactive demo for VGGT-Ω, a feed-forward camera and depth reconstruction model from the Visual Geometry Group (Oxford) and Meta AI.

Upload images or a short video and the model returns a 3D point cloud with estimated camera poses, visualized as a GLB scene.

Project page: https://vggt-omega.github.io/
Source code: https://github.com/facebookresearch/vggt-omega
Model card: https://huggingface.co/facebook/VGGT-Omega

Citation

@inproceedings{wang2026vggtomega,
  title={VGGT-{$\Omega$}},
  author={Wang, Jianyuan and Chen, Minghao and Zhang, Shangzhan and Karaev, Nikita and Sch{\"o}nberger, Johannes and Labatut, Patrick and Bojanowski, Piotr and Novotny, David and Vedaldi, Andrea and Rupprecht, Christian},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2026}
}

License

This demo is released under the FAIR Noncommercial Research License v1. See LICENSE.