Preprints
https://doi.org/10.5194/egusphere-2026-2325
https://doi.org/10.5194/egusphere-2026-2325
22 Jun 2026
 | 22 Jun 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

unseen-awg v1.0: spatio-temporal weather generation using analogs and unseen data

Jonathan Wider and Jakob Zscheischler

Abstract. Weather generators allow anticipating unseen weather and help prepare for possible weather-related hazards by providing long continuous time series representative of a given climate. Accurately representing dependencies between variables and locations within weather generators is challenging yet important – ignoring them can result in biased risk estimates. Daily analog weather generators trivially capture spatial and multivariate dependencies within each single time step. These generators resample a historical dataset while ensuring that successive sampled days have consistent large-scale atmospheric fields, thereby also ensuring temporally consistent local weather to some extent. Nevertheless, analog weather generators so far underestimate temporal correlations and are limited by the length of the available dataset they sample from, usually observations or reanalysis data. We propose unseen-awg, an analog weather generator based on data from weather forecasts initialized with historical conditions (reforecasts) and apply it to Europe in a case study. Combined with a novel tuning strategy and block sampling, this large, high-resolution dataset representative of present-day climate allows unseen-awg to simulate weather for the full annual cycle, improve on the temporal continuity of the generated time series, and generate unseen extremes at a daily timescale. We demonstrate that unseen-awg captures both the distributional properties of the individual variables and the dependence between summer temperature and precipitation at the grid-cell scale. We further highlight its ability to simulate droughts and heatwaves of unprecedented spatial extent. Combined with climate impact models, unseen-awg holds great potential for assessing weather-related risks across sectors such as water, agriculture, and forestry, domains that require simulating multiple variables and spatial dependencies across a large number of locations.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Jonathan Wider and Jakob Zscheischler

Status: open (until 17 Aug 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Jonathan Wider and Jakob Zscheischler

Data sets

An instance of the analog weather generator unseen-awg with precomputed similarities and a large set of generated weather data Jonathan Wider and Jakob Zscheischler https://doi.org/10.26050/WDCC/unsawg_wg

Preprocessed atmospheric circulation and impact-relevant variables from ERA5 and "Extended ensemble forecast hindcast" (ECMWF) for unseen-awg simulations Jonathan Wider and Jakob Zscheischler https://doi.org/21.14106/43da459a79e9f91e817e0b8690d494e8e91a00a5

Model code and software

unseen-awg: Spatio-temporal weather generation using analogs and unseen data Jonathan Wider and Jakob Zscheischler https://doi.org/10.5281/zenodo.19698708

Evaluation code for 'unseen-awg: Spatio-temporal weather generation using analogs and unseen data' Jonathan Wider and Jakob Zscheischler https://doi.org/10.5281/zenodo.19698738

unseen-awg user guide Jonathan Wider and Jakob Zscheischler https://jonathanwider.github.io/unseen-awg/

Jonathan Wider and Jakob Zscheischler
Metrics will be available soon.
Latest update: 22 Jun 2026
Download
Short summary
unseen-awg, a novel approach for generating multivariate spatiotemporal weather data, operates by resampling a large dataset of historical weather forecasts. The method uses block sampling of days and a parameter-tuning strategy to improve the realism of the generated European weather. It holds great potential for assessing weather-related risks across sectors such as water, agriculture, or forestry that require simulating multiple variables and spatial dependencies across numerous locations.
Share