Depth Estimation
sapiens
sapiens2
human-centric
normal
rawalkhirodkar commited on
Commit
9feed9d
ยท
verified ยท
1 Parent(s): 9b66f54

Update model card

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -18,14 +18,14 @@ Per-pixel surface-normal estimation (3-channel unit vectors in camera frame).
18
 
19
  This repository contains the **0.4B Surface Normal Estimation** checkpoint, finetuned from the [Sapiens2-0.4B pretrained backbone](https://huggingface.co/facebook/sapiens2-pretrain-0.4b).
20
 
21
- - ๐Ÿ“„ **Paper:** [OpenReview (ICLR 2026)](https://openreview.net/pdf?id=IVAlYCqdvW)
22
  - ๐ŸŒ **Project Page:** [rawalkhirodkar.github.io/sapiens2](https://rawalkhirodkar.github.io/sapiens2)
23
  - ๐Ÿ’ป **Code:** [github.com/facebookresearch/sapiens2](https://github.com/facebookresearch/sapiens2)
24
 
25
  ## Model Details
26
 
27
  - **Developed by:** Meta
28
- - **Model type:** Vision Transformer + Surface Normal Estimation head
29
  - **License:** [Sapiens2 License](https://github.com/facebookresearch/sapiens2/blob/main/LICENSE.md)
30
  - **Task:** normal
31
  - **Base model:** [facebook/sapiens2-pretrain-0.4b](https://huggingface.co/facebook/sapiens2-pretrain-0.4b)
@@ -84,10 +84,10 @@ Released under the [Sapiens2 License](https://github.com/facebookresearch/sapien
84
  ## Citation
85
 
86
  ```bibtex
87
- @inproceedings{khirodkar2026sapiens2,
88
  title={Sapiens2},
89
- author={Khirodkar, Rawal and Wen, He and Martinez, Julieta and Dong, Yuan and Zhaoen, Su and Saito, Shunsuke},
90
- booktitle={International Conference on Learning Representations (ICLR)},
91
  year={2026}
92
  }
93
  ```
 
18
 
19
  This repository contains the **0.4B Surface Normal Estimation** checkpoint, finetuned from the [Sapiens2-0.4B pretrained backbone](https://huggingface.co/facebook/sapiens2-pretrain-0.4b).
20
 
21
+ - ๐Ÿ“„ **Paper:** [arXiv:2604.21681](https://arxiv.org/pdf/2604.21681)
22
  - ๐ŸŒ **Project Page:** [rawalkhirodkar.github.io/sapiens2](https://rawalkhirodkar.github.io/sapiens2)
23
  - ๐Ÿ’ป **Code:** [github.com/facebookresearch/sapiens2](https://github.com/facebookresearch/sapiens2)
24
 
25
  ## Model Details
26
 
27
  - **Developed by:** Meta
28
+ - **Model type:** Vision Transformer
29
  - **License:** [Sapiens2 License](https://github.com/facebookresearch/sapiens2/blob/main/LICENSE.md)
30
  - **Task:** normal
31
  - **Base model:** [facebook/sapiens2-pretrain-0.4b](https://huggingface.co/facebook/sapiens2-pretrain-0.4b)
 
84
  ## Citation
85
 
86
  ```bibtex
87
+ @article{khirodkarsapiens2,
88
  title={Sapiens2},
89
+ author={Khirodkar, Rawal and Wen, He and Martinez, Julieta and Dong, Yuan and Su, Zhaoen and Saito, Shunsuke},
90
+ journal={arXiv preprint arXiv:2604.21681},
91
  year={2026}
92
  }
93
  ```