add model card
Browse filesinitial draft model card
README.md
CHANGED
|
@@ -1,5 +1,98 @@
|
|
| 1 |
-
---
|
| 2 |
-
license:
|
| 3 |
-
tags:
|
| 4 |
-
- weather
|
| 5 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: cc-by-4.0
|
| 3 |
+
tags:
|
| 4 |
+
- weather
|
| 5 |
+
- numerical-weather-prediction
|
| 6 |
+
- gnn
|
| 7 |
+
- autoregressive
|
| 8 |
+
- medium-range-forecast
|
| 9 |
+
pipeline_tag: graph-ml
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
# Model Card for FastNet-v1.1
|
| 13 |
+
|
| 14 |
+
**FastNet** is an data-driven medium range numerical weather prediction model developed jointly by the UK Met Office and the Alan Turing Institute. This release of FastNet v.1.1 marks the first publicly shared experimental release of the FastNet project.
|
| 15 |
+
|
| 16 |
+
FastNet produces highly skilled forecasts that overcome commonly known limitations of AI models, resulting in more physically realistic forecasts as demonstrated in the corresponding publication xxx.
|
| 17 |
+
|
| 18 |
+
## Model Details
|
| 19 |
+
|
| 20 |
+
|
| 21 |
+

|
| 22 |
+
### Model Description
|
| 23 |
+
|
| 24 |
+
FastNet has an encode-process-decode structure with a series of graph neural networks and auto-regressive rollout. The encoder is a directional bipartite graph linking the current atmospheric state defined on input grid cells to a lower resolution latent space defined on mesh nodes. The processor then advances the mesh state in time by six hour increments. The processor operates on a multi-scale icosahedral mesh, starting from the 12-node icosahedron and subdividing six times, enabling the model to capture both localised and long-range interactions. Finally, the decoder maps the latent mesh representation back to the output domain, and the prediction is fed back as input for subsequent steps during rollout. FastNet uses a residual formulation, where the decoder output represents the increment to be added to the input state via skip-level connections, rather than predicting the full field from scratch. Notably, FastNet was trained with loss-function adaptations designed to improve physical realism compared to similar models.
|
| 25 |
+
|
| 26 |
+
- **Developed by:** Met Office & The Alan Turing Institute
|
| 27 |
+
- **Model type:** Encoder-processor-decoder model
|
| 28 |
+
- **License:** These model weights are published under a Creative Commons Attribution 4.0 International (CC BY 4.0). To view a copy of this licence, visit [https://creativecommons.org/licenses/by/4.0/](https://creativecommons.org/licenses/by/4.0/). The corresponding inference scripts are released under the under the GNU Affero General Public License (AGPL) v3.0.
|
| 29 |
+
|
| 30 |
+
### Model Sources
|
| 31 |
+
|
| 32 |
+
- **Inference-only repository:** https://github.com/MetOffice/fastnet-inference
|
| 33 |
+
- **Paper:** _add-doi-here_
|
| 34 |
+
|
| 35 |
+
## Uses
|
| 36 |
+
|
| 37 |
+
### Direct Use
|
| 38 |
+
This model is intended for research and exploratory inference on historical or real-time atmospheric reanalysis inputs to produce global weather pattern predictions over a time horizon of up to ~2 days.
|
| 39 |
+
It is released for inference only: weights are provided for forward prediction, but training and fine-tuning are not supported in this release.
|
| 40 |
+
|
| 41 |
+
Typical direct uses include: benchmarking against baselines, sensitivity experiments (e.g., perturbing input fields), case-study analysis of notable events
|
| 42 |
+
|
| 43 |
+
### Out of scope
|
| 44 |
+
This release is not intended for operational forecasting, safety-critical decision making or issuing public warnings
|
| 45 |
+
|
| 46 |
+
## Known Limitations
|
| 47 |
+
|
| 48 |
+
## Training Details
|
| 49 |
+
|
| 50 |
+
### Training Data
|
| 51 |
+
FastNet is trained on the Copernicus ERA5 reanalysis dataset produced by ECMWF. This dataset is also available via [anemoi](https://anemoi.readthedocs.io/projects/training/en/latest/user-guide/download-era5-o96.html)
|
| 52 |
+
We use data re-gridded to an O96 (~ grid spacing of 104km) reduced Gaussian grid from 13 pressure levels taken at 6 hour time snapshots.
|
| 53 |
+
|
| 54 |
+
The full list of input and output fields is shown below:
|
| 55 |
+
|
| 56 |
+
|Variable|Level|Input/Output|
|
| 57 |
+
|---|---|---|
|
| 58 |
+
|Geopotential|Pressure levels (hPa): 50, 100, 150, 200, 250, 300, 400, 500, 600, 700, 850, 925, 1000|Both|
|
| 59 |
+
|Horizontal wind (zonal & meridional components)|Pressure levels (hPa): 50, 100, 150, 200, 250, 300, 400, 500, 600, 700, 850, 925, 1000|Both|
|
| 60 |
+
|Specific humidity|Pressure levels (hPa): 50, 100, 150, 200, 250, 300, 400, 500, 600, 700, 850, 925, 1000|Both|
|
| 61 |
+
|Temperature|Pressure levels (hPa): 50, 100, 150, 200, 250, 300, 400, 500, 600, 700, 850, 925, 1000|Both|
|
| 62 |
+
|Surface pressure|Surface|Both|
|
| 63 |
+
|Mean sea-level pressure|Surface|Both|
|
| 64 |
+
|Skin temperature|Surface|Both|
|
| 65 |
+
|2m temperature|Surface|Both|
|
| 66 |
+
|2m dewpoint temperature|Surface|Both|
|
| 67 |
+
|10m horizontal wind (zonal & meridional components)|Surface|Both|
|
| 68 |
+
|Land-sea mask|Surface|Input-only|
|
| 69 |
+
|Orography|Surface|Input-only|
|
| 70 |
+
|Standard deviation of sub-grid orography|Surface|Input-only|
|
| 71 |
+
|Slope of sub-scale orography|Surface|Input-only|
|
| 72 |
+
|Top-of-atmosphere solar radiation|Surface|Input-only|
|
| 73 |
+
|Solar hour angle (cos & sin)|Surface|Input-only|
|
| 74 |
+
|Time of year (cos & sin)|Surface|Input-only|
|
| 75 |
+
|Latitude (cos & sin)|Surface|Input-only|
|
| 76 |
+
|Longitude (cos & sin)|Surface|Input-only|
|
| 77 |
+
|
| 78 |
+
### Training Procedure
|
| 79 |
+
|
| 80 |
+
Below we summarize the three-stage training procedure, including the number of update steps and the loss/objective used at each stage.
|
| 81 |
+
|
| 82 |
+
| Stage | # rollout | Loss / Objective |
|
| 83 |
+
| ------------- | ----------- | ---------------- |
|
| 84 |
+
| Pre-training | n = 1 (6h) | weighted MSE |
|
| 85 |
+
| Fine-tuning 1 | n = 7 (42h) | MSE |
|
| 86 |
+
| Fine-tuning 2 | n = 7 (42h) | spectral MSE |
|
| 87 |
+
Weighting was applied per-variable and was proportional to pressure level.
|
| 88 |
+
|
| 89 |
+
#### Preprocessing
|
| 90 |
+
Training data from each pressure level (including surface-level variables) are separately standardised to have zero mean and unit standard deviation, over latitude, longitude and time. Orography is rescaled to the unit interval, and land-sea mask, solar hour angle, time of year, latitude, and longitude require no change as they are already suitably normalised.
|
| 91 |
+
|
| 92 |
+
### Speed, Sizes, Times
|
| 93 |
+
|
| 94 |
+
## Evaluation
|
| 95 |
+
|
| 96 |
+
FastNet is evaluated against ERA5 data using the WeatherBench 2 software package analysis for 2022. We compute the full rollout for all forecast valid times in 2022, re-gridding the output to a 1.5 degree latitude-longitude grid using a conservative re-gridding scheme.
|
| 97 |
+
|
| 98 |
+
We also evaluate against the Met Office Global Model operational
|