DPT 3.0 release - a nielsr Collection

nielsr 's Collections

Image-to-text models

DPT 3.0 release

DPT 3.1 release

Depth Anything release

DPT 3.0 release

updated Jan 25, 2024

DPT 3.0 (MiDaS) models, leveraging ViT and ViT-hybrid backbones

Vision Transformers for Dense Prediction

Paper • 2103.13413 • Published Mar 24, 2021 • 1
Intel/dpt-large

Depth Estimation • 0.3B • Updated Feb 24, 2024 • 61.2k • 204

Note This model leverages a Vision Transformer (ViT) backbone for monocular depth estimation.
Intel/dpt-hybrid-midas

Depth Estimation • Updated Feb 9, 2024 • 465k • 106

Note This model leverages a hybrid Vision Transformer (ViT-hybrid) backbone for monocular depth estimation.