boretsyury
/

MeFEm

Model card Files Files and versions

xet

Community

boretsyury commited on Feb 22

Commit

8b238ce

verified ·

1 Parent(s): 59514a2

Upload README.md

Browse files

Files changed (1) hide show

README.md +69 -0

README.md ADDED Viewed

	@@ -0,0 +1,69 @@

+# MeFEm: Medical Face Embedding Models
+Vision Transformers pre-trained on face data for potential medical applications. Available in Small (MeFEm-S) and Base (MeFEm-B) sizes.
+## Quick Start
+```python
+import torch
+import timm
+# Load model (MeFEm-S example)
+model = timm.create_model(
+    'vit_small_patch16_224',
+    pretrained=False,
+    num_classes=0,           # No classification head
+    global_pool='token'      # Use CLS token (default)
+)
+model.load_state_dict(torch.load('mefem-s.pt'))
+model.eval()
+# Forward pass
+x = torch.randn(1, 3, 224, 224)  # Your face image
+embeddings = model(x)  # [1, 384] CLS token embeddings
+```
+## Model Details
+- **Architecture**: ViT-Small/16 (384-dim) or ViT-Base/16 (768-dim) with CLS token
+- **Training**: Modified I-JEPA on ~6.5M face images
+- **Input**: Face crops with 2× expanded bounding boxes, 224×224 resolution
+- **Output**: CLS token embeddings (`global_pool='token'`) or all tokens (`global_pool=''`)
+## Usage Tips
+```python
+# For all tokens (CLS + patches):
+model = timm.create_model('vit_small_patch16_224', num_classes=0, global_pool='')
+tokens = model(x)  # [1, 197, 384]
+# For patch embeddings only:
+tokens = model.forward_features(x)
+patch_embeddings = tokens[:, 1:]  # [1, 196, 384]
+```
+## Training Data
+Face images from FaceCaption-15M, AVSpeech, and SHFQ datasets (~6.5M total). Images were cropped with expanded (2×) face bounding boxes.
+## Notes
+- Optimized for face images with loose cropping
+- Intended for representation learning and transfer to medical tasks
+- Results may vary for non-face or tightly-cropped images
+- More info on training and metrics [here](https://arxiv.org/pdf/2602.14672)
+## License
+CC BY 4.0. Reference paper if used:
+```
+@misc{borets2026mefemmedicalfaceembedding,
+      title={MeFEm: Medical Face Embedding model},
+      author={Yury Borets and Stepan Botman},
+      year={2026},
+      eprint={2602.14672},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV},
+      url={https://arxiv.org/abs/2602.14672},
+}
+```