Keypoint Detection
sapiens
sapiens2
human-centric
pose
rawalkhirodkar commited on
Commit
a0dcfcd
Β·
verified Β·
1 Parent(s): 7294a9a

Update model card

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -20,14 +20,14 @@ This repository contains the **1B Pose Estimation** checkpoint, finetuned from t
20
 
21
  Pose is top-down β€” it requires bounding boxes from a person detector. We use [RTMDet](https://github.com/open-mmlab/mmdetection/tree/main/configs/rtmdet).
22
 
23
- - πŸ“„ **Paper:** [OpenReview (ICLR 2026)](https://openreview.net/pdf?id=IVAlYCqdvW)
24
  - 🌐 **Project Page:** [rawalkhirodkar.github.io/sapiens2](https://rawalkhirodkar.github.io/sapiens2)
25
  - πŸ’» **Code:** [github.com/facebookresearch/sapiens2](https://github.com/facebookresearch/sapiens2)
26
 
27
  ## Model Details
28
 
29
  - **Developed by:** Meta
30
- - **Model type:** Vision Transformer + Pose Estimation head
31
  - **License:** [Sapiens2 License](https://github.com/facebookresearch/sapiens2/blob/main/LICENSE.md)
32
  - **Task:** pose
33
  - **Base model:** [facebook/sapiens2-pretrain-1b](https://huggingface.co/facebook/sapiens2-pretrain-1b)
@@ -86,10 +86,10 @@ Released under the [Sapiens2 License](https://github.com/facebookresearch/sapien
86
  ## Citation
87
 
88
  ```bibtex
89
- @inproceedings{khirodkar2026sapiens2,
90
  title={Sapiens2},
91
- author={Khirodkar, Rawal and Wen, He and Martinez, Julieta and Dong, Yuan and Zhaoen, Su and Saito, Shunsuke},
92
- booktitle={International Conference on Learning Representations (ICLR)},
93
  year={2026}
94
  }
95
  ```
 
20
 
21
  Pose is top-down β€” it requires bounding boxes from a person detector. We use [RTMDet](https://github.com/open-mmlab/mmdetection/tree/main/configs/rtmdet).
22
 
23
+ - πŸ“„ **Paper:** [arXiv:2604.21681](https://arxiv.org/pdf/2604.21681)
24
  - 🌐 **Project Page:** [rawalkhirodkar.github.io/sapiens2](https://rawalkhirodkar.github.io/sapiens2)
25
  - πŸ’» **Code:** [github.com/facebookresearch/sapiens2](https://github.com/facebookresearch/sapiens2)
26
 
27
  ## Model Details
28
 
29
  - **Developed by:** Meta
30
+ - **Model type:** Vision Transformer
31
  - **License:** [Sapiens2 License](https://github.com/facebookresearch/sapiens2/blob/main/LICENSE.md)
32
  - **Task:** pose
33
  - **Base model:** [facebook/sapiens2-pretrain-1b](https://huggingface.co/facebook/sapiens2-pretrain-1b)
 
86
  ## Citation
87
 
88
  ```bibtex
89
+ @article{khirodkarsapiens2,
90
  title={Sapiens2},
91
+ author={Khirodkar, Rawal and Wen, He and Martinez, Julieta and Dong, Yuan and Su, Zhaoen and Saito, Shunsuke},
92
+ journal={arXiv preprint arXiv:2604.21681},
93
  year={2026}
94
  }
95
  ```