Triad-Swin (3D-MRI self-supervised Swin-B backbone) -- Triad Swin-B encoder (SimMIM-pretrained)
Description
Triad vision foundation model for 3D MRI, ported to JAX / Equinox from the upstream PyTorch release. Triad is an nnUNet PlainConvEncoder pretrained self-supervised on Triad-131K (131,170 3D MRI volumes spanning brain, breast, and prostate; T1/T2/FLAIR/DWI/fMRI/DCE) and serves as a transfer-learning backbone for downstream MRI segmentation, classification, and registration. The published checkpoints are encoder-only (the self-supervised decoder / mask token are stripped); this port exposes the pretrained encoder, whose multi-scale features are the transfer representation. Two backbone families are ported: the nnUNet PlainConvUNet encoder (TriadPlainConvUNet) and the 3D Swin Transformer encoder (TriadSwinViT, the Swin-B variant, via the shared nimox SwinViT primitive). Each is released under two self-supervised objectives -- masked autoencoding (MAE) and SimMIM -- as separate bundles (four in total).
Intended use
As the MAE variant, but pretrained with the SimMIM masked-image-modelling objective. Same Swin-B encoder architecture and Triad-131K corpus; provided so downstream users can pick whichever SSL objective transfers better for their task. Encoder-only.
Usage
from ilex.models.triad import TriadSwinViT
model = TriadSwinViT.from_pretrained('ilex-hub/triad.swinb-simmim.1')
Authors
Wang S., et al.
Citation
Wang S., Safari M., Li Q., Chang C.-W., Qiu R. L. J., Roper J., Yu D. S., Yang X. (2025). Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging. arXiv:2502.14064. The Swin backbone is MONAI's SwinUNETR swinViT (use_v2): Hatamizadeh A., Nath V., Tang Y., Yang D., Roth H., Xu D. (2022). Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images. BrainLes 2021. arXiv:2201.01266. Tang Y., et al. (2022). Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis. CVPR. arXiv:2111.14791.
References
- Wang S., et al. (2025). Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging. arXiv:2502.14064. https://arxiv.org/abs/2502.14064
- Codebase: https://github.com/wangshansong1/Triad
License
HF Hub license tag: mit
Effective terms: MIT (Shansong Wang et al.) on both the Triad code (https://github.com/wangshansong1/Triad) and the released pretrained checkpoints. No commercial restrictions; no gating required. The arXiv preprint (2502.14064) is separately distributed under CC BY 4.0, but the code and weights the ilex bundle re-expresses are MIT. The ilex JAX / Equinox port code is separately licensed under Apache-2.0 / GPL-3.0; that does not alter the upstream MIT terms governing the weights.
Upstream license reference: https://github.com/wangshansong1/Triad/blob/main/LICENSE
Copyright
Network architecture and pretrained weights: copyright (c) the Triad authors, released under the MIT License. JAX / Equinox port: copyright (c) the ilex authors, released under the Apache-2.0 / GPL-3.0 dual license used by ilex itself.
Upstream source
Original weights / reference implementation: https://github.com/wangshansong1/Triad
Provenance
This artefact was produced by ilex's
save/load pipeline. The architecture is implemented in
ilex.models.triad.TriadSwinViT and the weights have been converted
from their upstream format. See the upstream source above
for the canonical reference.
- Downloads last month
- 18