Preprints
https://doi.org/10.5194/egusphere-2024-2222
https://doi.org/10.5194/egusphere-2024-2222
13 Aug 2024
 | 13 Aug 2024
Status: this preprint is open for discussion.

Using Random Forests to Predict Extreme Sea-Levels at the Baltic Coast at Weekly Timescales

Kai Bellinghausen, Birgit Hünicke, and Eduardo Zorita

Abstract. We have designed a machine-learning method to predict the occurrence of daily extreme sea-level at the Baltic Sea coast with lead times of a few days. The method is based on a Random Forest Classifier and uses spatially resolved fields of daily sea level pressure, surface wind, precipitation, and the prefilling state of the Baltic Sea as predictors for daily sea level above the 95 % quantile at each of seven tide-gauge stations representative of the Baltic coast.

The method is purely data-driven and is trained with sea-level data from the Global Extreme Sea Level Analysis (GESLA) data set and from the meteorological reanalysis ERA5 of the European Centre for Mid-range Weather Forecasting. Sea-level extremes at lead times of up to 3 days are statisfactorily predicted by the method and the relevant predictor regions are identified. The sensitivity, measured as the proportion of correctly predicted extremes is, depending on the stations, of the order of 70 %.

The proportion of false warnings, related to the specificity of the predictions, is typically as low as 10 to 20 %. For lead times longer than 3 days, the predictive skill degrades; for 7 days, it is comparable to a random skill. These values are generally higher than those derived from storm-surge reanalysis of dynamical models.

The importance of each predictor depends on the location of the tide gauge. Usually, the most relevant predictors are sea level pressure, surface wind and prefilling. Extreme sea levels in the Northern Baltic are better predicted by surface pressure and the meridional surface wind component. By contrast, for stations located in the south, the most relevant predictors are surface pressure and the zonal wind component. Precipitation was not a relevant predictor for any of the stations analysed.

The Random Forest classifier is not required to have considerable complexity and the computing time to issue predictions is typically a few minutes on a personal laptop. The method can, therefore, be used as a pre-warning system triggering the application of more sophisticated algorithms to estimate the height of the ensuing extreme sea level or as a warning to run larger ensembles with physically based numerical models.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Kai Bellinghausen, Birgit Hünicke, and Eduardo Zorita

Status: open (until 24 Sep 2024)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Kai Bellinghausen, Birgit Hünicke, and Eduardo Zorita
Kai Bellinghausen, Birgit Hünicke, and Eduardo Zorita

Viewed

Total article views: 126 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
94 25 7 126 1 1
  • HTML: 94
  • PDF: 25
  • XML: 7
  • Total: 126
  • BibTeX: 1
  • EndNote: 1
Views and downloads (calculated since 13 Aug 2024)
Cumulative views and downloads (calculated since 13 Aug 2024)

Viewed (geographical distribution)

Total article views: 134 (including HTML, PDF, and XML) Thereof 134 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 31 Aug 2024
Download
Short summary
We designed a tool to predict the storm surges at the Baltic Sea coast with a satisfactorily predictability (70 % correct predictions) using lead times of a few days. The proportion of false warnings is typically as low as 10 to 20 %. We could identify the relevant predictor regions and their patterns – such as low pressure systems and strong winds. Due to its short computing time the method can be used as a pre-warning system triggering the application of more sophisticated algorithms.