unseen-awg v1.0: spatio-temporal weather generation using analogs and unseen data
Abstract. Weather generators allow anticipating unseen weather and help prepare for possible weather-related hazards by providing long continuous time series representative of a given climate. Accurately representing dependencies between variables and locations within weather generators is challenging yet important – ignoring them can result in biased risk estimates. Daily analog weather generators trivially capture spatial and multivariate dependencies within each single time step. These generators resample a historical dataset while ensuring that successive sampled days have consistent large-scale atmospheric fields, thereby also ensuring temporally consistent local weather to some extent. Nevertheless, analog weather generators so far underestimate temporal correlations and are limited by the length of the available dataset they sample from, usually observations or reanalysis data. We propose unseen-awg, an analog weather generator based on data from weather forecasts initialized with historical conditions (reforecasts) and apply it to Europe in a case study. Combined with a novel tuning strategy and block sampling, this large, high-resolution dataset representative of present-day climate allows unseen-awg to simulate weather for the full annual cycle, improve on the temporal continuity of the generated time series, and generate unseen extremes at a daily timescale. We demonstrate that unseen-awg captures both the distributional properties of the individual variables and the dependence between summer temperature and precipitation at the grid-cell scale. We further highlight its ability to simulate droughts and heatwaves of unprecedented spatial extent. Combined with climate impact models, unseen-awg holds great potential for assessing weather-related risks across sectors such as water, agriculture, and forestry, domains that require simulating multiple variables and spatial dependencies across a large number of locations.