Preprints
https://doi.org/10.5194/egusphere-2024-1886
https://doi.org/10.5194/egusphere-2024-1886
17 Jul 2024
 | 17 Jul 2024
Status: this preprint is open for discussion.

A Method for Quantifying Uncertainty in Spatially Interpolated Meteorological Data with Application to Daily Maximum Air Temperature

Conor T. Doherty, Weile Wang, Hirofumi Hashimoto, and Ian G. Brosnan

Abstract. Conventional point estimate error statistics are not well-suited to describing spatial and temporal variation in the accuracy of spatially interpolated meteorological variables. This paper describes, applies, and evaluates a method for quantifying prediction uncertainty in spatially interpolated estimates of meteorological variables. The approach presented here, which we will refer to as DNK for “detrend, normal score, krige,” uses established methods from geostatistics and we apply them to interpolate data from ground-based weather stations. The method is validated using daily maximum near-surface air temperature (Tmax). Uncertainty is inherent in gridded meteorological data, but this fact is often overlooked when data products provide single-value point estimates without a quantitative description of prediction uncertainty. Uncertainty varies as a function of spatial factors, like distance to the nearest measurement location, and temporal factors, like seasonality in sample heterogeneity. DNK produces not only point estimates but predictive distributions for each location. Predictive distributions quantitatively describe uncertainty suitably for propagation into physical models that take meteorological variables as inputs. We validate the uncertainty quantification by comparing theoretical versus actual coverage of prediction intervals computed at locations where measurement data were held out from the estimation procedure. We find that, for most days, the predictive distributions accurately quantify uncertainty and that theoretical versus actual coverage levels of prediction intervals closely match one another. Even for days with the worst agreement, the predictive distributions meaningfully convey the relative certainty of predictions for different locations in space. After validating the methodology, we demonstrate how the magnitude of prediction uncertainty varies significantly in both space and time. Finally, we examine spatial correlation in predictive distributions by using conditional Gaussian simulation in place of kriging. We conclude that spatial correlation in Tmax errors is relatively small, and that less computationally expensive kriging-based methods will suffice for many applications.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Conor T. Doherty, Weile Wang, Hirofumi Hashimoto, and Ian G. Brosnan

Status: open (until 11 Sep 2024)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Conor T. Doherty, Weile Wang, Hirofumi Hashimoto, and Ian G. Brosnan

Model code and software

Code for initial submission of "A Method for Quantifying Uncertainty in Spatially Interpolated Meteorological Data with Application to Daily Maximum Air Temperature" Conor T. Doherty https://zenodo.org/doi/10.5281/zenodo.12171025

Conor T. Doherty, Weile Wang, Hirofumi Hashimoto, and Ian G. Brosnan
Metrics will be available soon.
Latest update: 17 Jul 2024
Download
Short summary
We present, analyze, and validate a methodology for quantifying uncertainty in gridded meteorological data products produced by spatial interpolation. In a validation case study using daily maximum near-surface air temperature (Tmax), the method works well and produces predictive distributions with closely matching theoretical versus actual coverage levels. Application of the method reveals that the magnitude of uncertainty in interpolated Tmax varies significantly in both space and time.