Enhancing Data-Driven Weather Forecasting via Gated Relative Position Encoding and Spatial-Aware Feed-Forward Network

Wang, Leyi; Zhang, Duo; Yang, Jerry Zhijian; Pan, Baoxiang; Xi, Dazhi; Huang, Xiaoyu

doi:10.5194/egusphere-2026-1990

Preprints

https://doi.org/10.5194/egusphere-2026-1990

Preprints

14 Apr 2026

| 14 Apr 2026

Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

Enhancing Data-Driven Weather Forecasting via Gated Relative Position Encoding and Spatial-Aware Feed-Forward Network

Leyi Wang, Duo Zhang, Jerry Zhijian Yang, Baoxiang Pan, Dazhi Xi, and Xiaoyu Huang

Abstract. Data-driven weather models have emerged to address the immense computational costs of traditional numerical weather prediction by generating highly accurate, global forecasts in seconds. While Transformer-based architectures have achieved higher accuracy than numerical weather predictions, their existing position encodings typically embed limited spatial and temporal context, failing to fully account for the time variability, directionality, and location-dependency inherent in atmospheric motions. To resolve this, we introduce a novel model, Neighborhood Attention Transformer for atmospheric prediction (AtmoNAT). We propose two unique architectural components: a Gated Relative Position Encoding (GRPE) and a Spatial-Aware Feed-Forward Network (SAFN). The GRPE maintains independent positional biases based on absolute coordinates to secure location-dependency with a negligible increase in model size, while effectively capturing the directionality and temporal variations of the atmosphere. Simultaneously, the SAFN incorporates parallel input and gating branches, alongside a global positional bias, to explicitly simulate non-local interactions between atmospheric variables and integrate terrain effects. Evaluated on the WeatherBench 2 data at a 1.5° spatial resolution, AtmoNAT’s deterministic forecasts demonstrate lower prediction errors on key variables up to a 72-hour lead time when compared to other coarse-resolution ensemble forecasts. Furthermore, AtmoNAT achieves state-of-the-art forecasting performance over global land areas, highlighting the profound potential of GRPE and SAFN in advancing next-generation weather forecasting.

Received: 08 Apr 2026 – Discussion started: 14 Apr 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 2382 KB)

Supplement (3082 KB)

Download & links

Leyi Wang, Duo Zhang, Jerry Zhijian Yang, Baoxiang Pan, Dazhi Xi, and Xiaoyu Huang

Status: open (until 09 Jun 2026)

Post a comment Subscribe to comment alert

Leyi Wang, Duo Zhang, Jerry Zhijian Yang, Baoxiang Pan, Dazhi Xi, and Xiaoyu Huang

Supplement

https://doi.org/10.5194/egusphere-2026-1990-supplement

Model code and software

Source code of AtmoNAT Xiaoyu Huang and Leyi Wang https://doi.org/10.5281/zenodo.19369025

Leyi Wang, Duo Zhang, Jerry Zhijian Yang, Baoxiang Pan, Dazhi Xi, and Xiaoyu Huang

Metrics will be available soon.

Latest update: 15 Apr 2026

Short summary

We built a new artificial intelligence model to forecast the weather, designed to better understand air movement and how landscapes shape atmospheric motions. We trained this model on historical data to predict future conditions. Our tool proved highly accurate at predicting weather up to three days in advance. It also outperforms top models over land area. Our method requires significantly less resources. It paves the way for more efficient and more accurate daily weather forecasts worldwide.