sapiens
sapiens2
human-centric
vision-transformer
File size: 4,043 Bytes
6951533
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2acbac6
6951533
 
 
 
 
 
 
 
 
 
 
 
2acbac6
6951533
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2acbac6
6951533
2acbac6
 
6951533
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
---
license: other
license_name: sapiens2-license
license_link: https://github.com/facebookresearch/sapiens2/blob/main/LICENSE.md
library_name: sapiens
tags:
  - sapiens
  - sapiens2
  - human-centric
  - vision-transformer
---

# Sapiens2

Sapiens2 is a family of high-resolution vision transformers pretrained on **1 billion human images** โ€” designed for human-centric tasks such as pose estimation, body-part segmentation, surface normals, and pointmaps.

This is the **index** repository: each variant lives in its own model repo (linked below).

- ๐Ÿ“„ **Paper:** [arXiv:2604.21681](https://arxiv.org/pdf/2604.21681)
- ๐ŸŒ **Project Page:** [rawalkhirodkar.github.io/sapiens2](https://rawalkhirodkar.github.io/sapiens2)
- ๐Ÿ’ป **Code:** [github.com/facebookresearch/sapiens2](https://github.com/facebookresearch/sapiens2)
- ๐Ÿ“š **Collection:** [Sapiens2 on HuggingFace](https://huggingface.co/collections/facebook/sapiens2)

## Pretrained Backbones

| Model | Params | Repository |
|-------|--------|------------|
| Sapiens2-0.1B | 0.114 B | [facebook/sapiens2-pretrain-0.1b](https://huggingface.co/facebook/sapiens2-pretrain-0.1b) |
| Sapiens2-0.4B | 0.398 B | [facebook/sapiens2-pretrain-0.4b](https://huggingface.co/facebook/sapiens2-pretrain-0.4b) |
| Sapiens2-0.8B | 0.818 B | [facebook/sapiens2-pretrain-0.8b](https://huggingface.co/facebook/sapiens2-pretrain-0.8b) |
| Sapiens2-1B | 1.462 B | [facebook/sapiens2-pretrain-1b](https://huggingface.co/facebook/sapiens2-pretrain-1b) |
| Sapiens2-1B (4K) | 1.607 B | [facebook/sapiens2-pretrain-1b-4k](https://huggingface.co/facebook/sapiens2-pretrain-1b-4k) |
| Sapiens2-5B | 5.071 B | [facebook/sapiens2-pretrain-5b](https://huggingface.co/facebook/sapiens2-pretrain-5b) |

## Task Checkpoints

### Pose Estimation

| Model | Repository |
|-------|------------|
| Sapiens2-0.4B | [facebook/sapiens2-pose-0.4b](https://huggingface.co/facebook/sapiens2-pose-0.4b) |
| Sapiens2-0.8B | [facebook/sapiens2-pose-0.8b](https://huggingface.co/facebook/sapiens2-pose-0.8b) |
| Sapiens2-1B | [facebook/sapiens2-pose-1b](https://huggingface.co/facebook/sapiens2-pose-1b) |
| Sapiens2-5B | [facebook/sapiens2-pose-5b](https://huggingface.co/facebook/sapiens2-pose-5b) |

### Body-Part Segmentation

| Model | Repository |
|-------|------------|
| Sapiens2-0.4B | [facebook/sapiens2-seg-0.4b](https://huggingface.co/facebook/sapiens2-seg-0.4b) |
| Sapiens2-0.8B | [facebook/sapiens2-seg-0.8b](https://huggingface.co/facebook/sapiens2-seg-0.8b) |
| Sapiens2-1B | [facebook/sapiens2-seg-1b](https://huggingface.co/facebook/sapiens2-seg-1b) |
| Sapiens2-5B | [facebook/sapiens2-seg-5b](https://huggingface.co/facebook/sapiens2-seg-5b) |

### Surface Normal Estimation

| Model | Repository |
|-------|------------|
| Sapiens2-0.4B | [facebook/sapiens2-normal-0.4b](https://huggingface.co/facebook/sapiens2-normal-0.4b) |
| Sapiens2-0.8B | [facebook/sapiens2-normal-0.8b](https://huggingface.co/facebook/sapiens2-normal-0.8b) |
| Sapiens2-1B | [facebook/sapiens2-normal-1b](https://huggingface.co/facebook/sapiens2-normal-1b) |
| Sapiens2-5B | [facebook/sapiens2-normal-5b](https://huggingface.co/facebook/sapiens2-normal-5b) |

### Pointmap Estimation

| Model | Repository |
|-------|------------|
| Sapiens2-0.4B | [facebook/sapiens2-pointmap-0.4b](https://huggingface.co/facebook/sapiens2-pointmap-0.4b) |
| Sapiens2-0.8B | [facebook/sapiens2-pointmap-0.8b](https://huggingface.co/facebook/sapiens2-pointmap-0.8b) |
| Sapiens2-1B | [facebook/sapiens2-pointmap-1b](https://huggingface.co/facebook/sapiens2-pointmap-1b) |
| Sapiens2-5B | [facebook/sapiens2-pointmap-5b](https://huggingface.co/facebook/sapiens2-pointmap-5b) |

## License

Released under the [Sapiens2 License](https://github.com/facebookresearch/sapiens2/blob/main/LICENSE.md).

## Citation

```bibtex
@article{khirodkarsapiens2,
  title={Sapiens2},
  author={Khirodkar, Rawal and Wen, He and Martinez, Julieta and Dong, Yuan and Su, Zhaoen and Saito, Shunsuke},
  journal={arXiv preprint arXiv:2604.21681},
  year={2026}
}
```