README: keep YAML frontmatter on HF; align body with GitHub
Browse files
README.md
CHANGED
|
@@ -13,11 +13,14 @@ tags:
|
|
| 13 |
|
| 14 |
# Mosaic — Block-Sparse Attention for Weather Forecasting
|
| 15 |
|
| 16 |
-
**
|
|
|
|
|
|
|
| 17 |
|
| 18 |
> **(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models** \
|
| 19 |
-
> Maksim Zhdanov, Ana Lucic, Max Welling, Jan-Willem van de Meent
|
| 20 |
-
|
|
|
|
| 21 |
|
| 22 |

|
| 23 |
|
|
@@ -234,3 +237,11 @@ Released under [CC-BY-NC-4.0](https://creativecommons.org/licenses/by-nc/4.0/).
|
|
| 234 |
## Acknowledgements
|
| 235 |
|
| 236 |
MZ acknowledges support from Microsoft Research AI4Science. JWvdM acknowledges support from the European Union Horizon Framework Programme (Grant agreement ID: 101120237). This work used the Dutch national e-infrastructure with the support of the SURF Cooperative using grant no. EINF-16923. Computations were partially performed using the UvA/FNWI HPC Facility.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 13 |
|
| 14 |
# Mosaic — Block-Sparse Attention for Weather Forecasting
|
| 15 |
|
| 16 |
+
| 📄 [**Paper**](https://arxiv.org/abs/2604.16429) | 🤗 [**Hugging Face**](https://huggingface.co/maxxxzdn/mosaic) | 💻 [**GitHub**](https://github.com/maxxxzdn/mosaic) |
|
| 17 |
+
| :---: | :---: | :---: |
|
| 18 |
+
| ICML 2026 · arXiv:2604.16429 | Pretrained weights & model card | Source code & issue tracker |
|
| 19 |
|
| 20 |
> **(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models** \
|
| 21 |
+
> Maksim Zhdanov, Ana Lucic, Max Welling, Jan-Willem van de Meent · *ICML 2026*
|
| 22 |
+
|
| 23 |
+
**Mosaic** is a probabilistic weather forecasting model that operates on native-resolution grids via mesh-aligned block-sparse attention. At 1.5° resolution with 214M parameters, Mosaic matches or outperforms models trained on 6× finer resolution on key variables, and individual ensemble members exhibit near-perfect spectral alignment across all resolved frequencies. A 24-member, 10-day forecast takes under 12 s on a single H100 GPU.
|
| 24 |
|
| 25 |

|
| 26 |
|
|
|
|
| 237 |
## Acknowledgements
|
| 238 |
|
| 239 |
MZ acknowledges support from Microsoft Research AI4Science. JWvdM acknowledges support from the European Union Horizon Framework Programme (Grant agreement ID: 101120237). This work used the Dutch national e-infrastructure with the support of the SURF Cooperative using grant no. EINF-16923. Computations were partially performed using the UvA/FNWI HPC Facility.
|
| 240 |
+
|
| 241 |
+
## Model card metadata
|
| 242 |
+
|
| 243 |
+
| Field | Value |
|
| 244 |
+
|---------------|-------|
|
| 245 |
+
| License | [`cc-by-nc-4.0`](https://creativecommons.org/licenses/by-nc/4.0/) |
|
| 246 |
+
| Library | `pytorch` |
|
| 247 |
+
| Tags | `weather` · `weather-forecasting` · `climate` · `atmospheric-science` · `sparse-attention` · `transformer` · `probabilistic-forecasting` |
|