facebook
/

sapiens2-pose-1b

Keypoint Detection

Model card Files Files and versions

rawalkhirodkar commited on 13 days ago

Commit

a0dcfcd

·

verified ·

1 Parent(s): 7294a9a

Update model card

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -20,14 +20,14 @@ This repository contains the **1B Pose Estimation** checkpoint, finetuned from t
 Pose is top-down — it requires bounding boxes from a person detector. We use [RTMDet](https://github.com/open-mmlab/mmdetection/tree/main/configs/rtmdet).
-- 📄 **Paper:** [OpenReview (ICLR 2026)](https://openreview.net/pdf?id=IVAlYCqdvW)
 - 🌐 **Project Page:** [rawalkhirodkar.github.io/sapiens2](https://rawalkhirodkar.github.io/sapiens2)
 - 💻 **Code:** [github.com/facebookresearch/sapiens2](https://github.com/facebookresearch/sapiens2)
 ## Model Details
 - **Developed by:** Meta
-- **Model type:** Vision Transformer + Pose Estimation head
 - **License:** [Sapiens2 License](https://github.com/facebookresearch/sapiens2/blob/main/LICENSE.md)
 - **Task:** pose
 - **Base model:** [facebook/sapiens2-pretrain-1b](https://huggingface.co/facebook/sapiens2-pretrain-1b)
@@ -86,10 +86,10 @@ Released under the [Sapiens2 License](https://github.com/facebookresearch/sapien
 ## Citation
 ```bibtex
-@inproceedings{khirodkar2026sapiens2,
   title={Sapiens2},
-  author={Khirodkar, Rawal and Wen, He and Martinez, Julieta and Dong, Yuan and Zhaoen, Su and Saito, Shunsuke},
-  booktitle={International Conference on Learning Representations (ICLR)},
   year={2026}
 }
 ```

 Pose is top-down — it requires bounding boxes from a person detector. We use [RTMDet](https://github.com/open-mmlab/mmdetection/tree/main/configs/rtmdet).
+- 📄 **Paper:** [arXiv:2604.21681](https://arxiv.org/pdf/2604.21681)
 - 🌐 **Project Page:** [rawalkhirodkar.github.io/sapiens2](https://rawalkhirodkar.github.io/sapiens2)
 - 💻 **Code:** [github.com/facebookresearch/sapiens2](https://github.com/facebookresearch/sapiens2)
 ## Model Details
 - **Developed by:** Meta
+- **Model type:** Vision Transformer
 - **License:** [Sapiens2 License](https://github.com/facebookresearch/sapiens2/blob/main/LICENSE.md)
 - **Task:** pose
 - **Base model:** [facebook/sapiens2-pretrain-1b](https://huggingface.co/facebook/sapiens2-pretrain-1b)
 ## Citation
 ```bibtex
+@article{khirodkarsapiens2,
   title={Sapiens2},
+  author={Khirodkar, Rawal and Wen, He and Martinez, Julieta and Dong, Yuan and Su, Zhaoen and Saito, Shunsuke},
+  journal={arXiv preprint arXiv:2604.21681},
   year={2026}
 }
 ```