vggt-omega / README.md
Minghao Chen
Initial Space: VGGT-Omega Gradio demo on ZeroGPU
b2e9eec

A newer version of the Gradio SDK is available: 6.15.0

Upgrade
metadata
title: VGGT-Omega Demo
emoji: 🌀
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.50.0
app_file: app.py
python_version: '3.10'
hardware: zero-gpu
pinned: false
license: other
license_name: fair-noncommercial-research-license-v1
license_link: LICENSE
short_description: 3D reconstruction from images/video with VGGT-Omega
models:
  - facebook/VGGT-Omega

VGGT-Ω Demo

Interactive demo for VGGT-Ω, a feed-forward camera and depth reconstruction model from the Visual Geometry Group (Oxford) and Meta AI.

Upload images or a short video and the model returns a 3D point cloud with estimated camera poses, visualized as a GLB scene.

Citation

@inproceedings{wang2026vggtomega,
  title={VGGT-{$\Omega$}},
  author={Wang, Jianyuan and Chen, Minghao and Zhang, Shangzhan and Karaev, Nikita and Sch{\"o}nberger, Johannes and Labatut, Patrick and Bojanowski, Piotr and Novotny, David and Vedaldi, Andrea and Rupprecht, Christian},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2026}
}

License

This demo is released under the FAIR Noncommercial Research License v1. See LICENSE.