JonnyYu828
/

DepthVLM-4B

Depth Estimation

image-text-to-text

vision-language-model

Model card Files Files and versions

JonnyYu828 commited on 2 days ago

Commit

0b46424

·

verified ·

1 Parent(s): b8ee814

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ paper:
 Update 2026-05-18 (v1.0): Initial release
-# DepthVLM
 DepthVLM serves as a unified foundation model for both low-level dense geometry prediction and high-level multimodal understanding, while achieving substantially faster inference compared with existing VLM-based approaches such as DepthLM and Youtu-VL.

 Update 2026-05-18 (v1.0): Initial release
+# DepthVLM-4B
 DepthVLM serves as a unified foundation model for both low-level dense geometry prediction and high-level multimodal understanding, while achieving substantially faster inference compared with existing VLM-based approaches such as DepthLM and Youtu-VL.