Triad-Swin (3D-MRI self-supervised Swin-B backbone) -- Triad Swin-B encoder (MAE-pretrained)

Description

Triad vision foundation model for 3D MRI, ported to JAX / Equinox from the upstream PyTorch release. Triad is an nnUNet PlainConvEncoder pretrained self-supervised on Triad-131K (131,170 3D MRI volumes spanning brain, breast, and prostate; T1/T2/FLAIR/DWI/fMRI/DCE) and serves as a transfer-learning backbone for downstream MRI segmentation, classification, and registration. The published checkpoints are encoder-only (the self-supervised decoder / mask token are stripped); this port exposes the pretrained encoder, whose multi-scale features are the transfer representation. Two backbone families are ported: the nnUNet PlainConvUNet encoder (TriadPlainConvUNet) and the 3D Swin Transformer encoder (TriadSwinViT, the Swin-B variant, via the shared nimox SwinViT primitive). Each is released under two self-supervised objectives -- masked autoencoding (MAE) and SimMIM -- as separate bundles (four in total).

Intended use

Transfer-learning backbone for 3D MRI: contrast / task-agnostic multi-scale feature extraction via a 3D Swin Transformer. The forward returns the 5-level SwinViT feature pyramid (channels 48, 96, 192, 384, 768 at strides 2, 4, 8, 16, 32); a downstream consumer attaches its own decoder / head. Single-channel input with each spatial dim a multiple of 16. Pretrained by masked autoencoding (MAE) on Triad-131K. Encoder-only: the SSL reconstruction decoder is not shipped.

Usage

from ilex.models.triad import TriadSwinViT
model = TriadSwinViT.from_pretrained('ilex-hub/triad.swinb-mae.1')

Authors

Wang S., et al.

Citation

Wang S., Safari M., Li Q., Chang C.-W., Qiu R. L. J., Roper J., Yu D. S., Yang X. (2025). Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging. arXiv:2502.14064. The Swin backbone is MONAI's SwinUNETR swinViT (use_v2): Hatamizadeh A., Nath V., Tang Y., Yang D., Roth H., Xu D. (2022). Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images. BrainLes 2021. arXiv:2201.01266. Tang Y., et al. (2022). Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis. CVPR. arXiv:2111.14791.

References

License

HF Hub license tag: mit

Effective terms: MIT (Shansong Wang et al.) on both the Triad code (https://github.com/wangshansong1/Triad) and the released pretrained checkpoints. No commercial restrictions; no gating required. The arXiv preprint (2502.14064) is separately distributed under CC BY 4.0, but the code and weights the ilex bundle re-expresses are MIT. The ilex JAX / Equinox port code is separately licensed under Apache-2.0 / GPL-3.0; that does not alter the upstream MIT terms governing the weights.

Upstream license reference: https://github.com/wangshansong1/Triad/blob/main/LICENSE

Copyright

Network architecture and pretrained weights: copyright (c) the Triad authors, released under the MIT License. JAX / Equinox port: copyright (c) the ilex authors, released under the Apache-2.0 / GPL-3.0 dual license used by ilex itself.

Upstream source

Original weights / reference implementation: https://github.com/wangshansong1/Triad

Provenance

This artefact was produced by ilex's save/load pipeline. The architecture is implemented in ilex.models.triad.TriadSwinViT and the weights have been converted from their upstream format. See the upstream source above for the canonical reference.

Downloads last month
22
Safetensors
Model size
18.6M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Papers for ilex-hub/triad.swinb-mae.1