Preprints
https://doi.org/10.5194/egusphere-2025-2870
https://doi.org/10.5194/egusphere-2025-2870
01 Jul 2025
 | 01 Jul 2025
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Covariance-informed spatiotemporal clustering improves the detection of hazardous weather events

Hunter C. Quintal, Antonia Sebastian, Marc L. Serre, Wiebke S. Jäger, and Marleen C. de Ruiter

Abstract. Spatiotemporal clustering can be used to detect weather events in multi-dimensional datasets. This method requires that the resolution of a dataset equivalently resolves fluctuations across space and time, thereby normalizing the dataset for unbiased clustering across three dimensions. Yet, few studies test whether a dataset meets this requirement as there is no standard approach to do so. To address this methodological gap, we present a framework to quantify the relationship between space and time using space time separable covariance modelling. We demonstrate that, by defining a temporal resolution of interest (e.g. hours, days), the equivalent spatial resolution can be empirically derived using a space time metric. We present an application using the unsupervised machine learning method Density-Based Spatial Clustering of Applications with Noise (DBSCAN) to detect heat waves and severe storms across the Southeastern US from 1940 to 2023 from ECMWF Reanalysis version 5 (ERA5) data. We analyse the seasonal behaviour of space time metrics for precipitation and heat index before selecting representative values. We find that both ERA5-derived daily heat index and hourly precipitation are insufficiently resolved for unbiased clustering at their native resolutions (i.e., 0.25 spatial degrees [degree] per day for heat index and 0.25 degree per hour for precipitation). We show that a resolution of 0.39 degree per day (0.05 degree per hour) prevents preferential clustering in either the spatial or temporal dimension for heat index (precipitation). We hypothesize that event identification will improve by resampling the data by the space time metric. Heat wave clusters that were produced using the unbiased resolution were compared against the NOAA Storm Events Database from 2019 to 2023. Recall of heat waves increased from 0.92 to 0.94 using the covariance-informed resolution, demonstrating the importance of normalization prior to weather event reconstruction. Ultimately, the inclusion of temporal geostatistics leads to improved reconstruction of historical weather events and enables evaluation of their scale and variability.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Hunter C. Quintal, Antonia Sebastian, Marc L. Serre, Wiebke S. Jäger, and Marleen C. de Ruiter

Status: open (until 18 Aug 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Hunter C. Quintal, Antonia Sebastian, Marc L. Serre, Wiebke S. Jäger, and Marleen C. de Ruiter
Hunter C. Quintal, Antonia Sebastian, Marc L. Serre, Wiebke S. Jäger, and Marleen C. de Ruiter

Viewed

Total article views: 182 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
154 22 6 182 7 3 6
  • HTML: 154
  • PDF: 22
  • XML: 6
  • Total: 182
  • Supplement: 7
  • BibTeX: 3
  • EndNote: 6
Views and downloads (calculated since 01 Jul 2025)
Cumulative views and downloads (calculated since 01 Jul 2025)

Viewed (geographical distribution)

Total article views: 166 (including HTML, PDF, and XML) Thereof 166 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 16 Jul 2025
Download
Short summary
High quality weather event datasets are crucial to community preparedness and resilience. Researchers create such datasets using clustering methods, which we advance by addressing current limitation in the relationship between space and time. We propose a method to determine the appropriate factor by which to resample the spatial resolution of the data prior to clustering. Ultimately, our approach increases the ability to detect historic heatwaves over current methods.
Share