Mirali33
/

MOMO

@@ -13,7 +13,7 @@ tags:
 **MOMO** is the first multi-sensor foundation model for Mars remote sensing, accepted at **CVPR 2026**.
-It integrates representations learned independently from three Martian orbital sensors — HiRISE, CTX, and THEMIS — spanning resolutions from 0.25 m/pixel to 100 m/pixel, using task arithmetic model merging with a novel **Equal Validation Loss (EVL)** checkpoint selection strategy.
 [![arXiv](https://img.shields.io/badge/arXiv-2604.02719-b31b1b.svg?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2604.02719) [![GitHub](https://img.shields.io/badge/GitHub-kerner--lab%2FMOMO-black?logo=github&logoColor=white)](https://github.com/kerner-lab/MOMO)
@@ -29,7 +29,7 @@ Each model size includes 5 checkpoints:
 | `hirise.pth` | Pre-trained on HiRISE (High Resolution Imaging Science Experiment) |
 | `themis.pth` | Pre-trained on THEMIS (THermal EMission Imaging System) |
 | `hirise_ctx_themis.pth` | Pre-trained jointly on all three sensors |
-| `momo.pth` | **MOMO** — merged model via task arithmetic + EVL (main contribution) |
 Each checkpoint is available for three ViT architectures (all with patch size 16):
@@ -61,9 +61,9 @@ For full training and fine-tuning code, see the [MOMO GitHub repository](https:/
 ## Training Data
 MOMO is pre-trained on ~12 million samples (~4M per sensor) from Mars orbital imagery:
-- **HiRISE** — 0.25 m/pixel high-resolution visible spectrum images
-- **CTX** — 5 m/pixel context camera images
-- **THEMIS** — 100 m/pixel thermal infrared images
 ---

 **MOMO** is the first multi-sensor foundation model for Mars remote sensing, accepted at **CVPR 2026**.
+It integrates representations learned independently from three Martian orbital sensors (HiRISE, CTX, and THEMIS) spanning resolutions from 0.25 m/pixel to 100 m/pixel, using task arithmetic model merging with a novel **Equal Validation Loss (EVL)** checkpoint selection strategy.
 [![arXiv](https://img.shields.io/badge/arXiv-2604.02719-b31b1b.svg?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2604.02719) [![GitHub](https://img.shields.io/badge/GitHub-kerner--lab%2FMOMO-black?logo=github&logoColor=white)](https://github.com/kerner-lab/MOMO)
 | `hirise.pth` | Pre-trained on HiRISE (High Resolution Imaging Science Experiment) |
 | `themis.pth` | Pre-trained on THEMIS (THermal EMission Imaging System) |
 | `hirise_ctx_themis.pth` | Pre-trained jointly on all three sensors |
+| `momo.pth` | **MOMO** merged model via task arithmetic + EVL (main contribution) |
 Each checkpoint is available for three ViT architectures (all with patch size 16):
 ## Training Data
 MOMO is pre-trained on ~12 million samples (~4M per sensor) from Mars orbital imagery:
+- **HiRISE**: 0.25 m/pixel high-resolution visible spectrum images
+- **CTX**: 5 m/pixel context camera images
+- **THEMIS**: 100 m/pixel thermal infrared images
 ---