Preprints
https://doi.org/10.5194/egusphere-2025-1138
https://doi.org/10.5194/egusphere-2025-1138
06 May 2025
 | 06 May 2025
Status: this preprint is open for discussion and under review for Natural Hazards and Earth System Sciences (NHESS).

Automated tail-informed threshold selection for extreme coastal sea levels

Thomas P. Collings, Callum J. R. Murphy-Barltrop, Conor Murphy, Ivan D. Haigh, Paul D. Bates, and Niall D. Quinn

Abstract. Peaks over threshold (POT) techniques are commonly used in practice to model tail behaviour of univariate variables. The resulting models can be used to aid in risk assessments, providing estimates of relevant quantities such as return levels and periods. An important consideration during such modelling procedures involves the choice of threshold; this selection represents a bias-variance trade-off and is fundamental for ensuring reliable model fits. Despite the crucial nature of this problem, most applications of the POT framework select the threshold in an arbitrary manner and do not consider the sensitivity of the model to this choice. Recent works have called for a more robust approach for selecting thresholds, and a small number of automated methods have been proposed. However, these methods come with limitations, and currently, there does not appear to be a 'one size fits all' technique for threshold selection. In this work, we introduce a novel threshold selection approach that addresses some of the limitations of existing techniques. In particular, our approach ensures that the fitted model captures the tail behaviour at the most extreme observations, at the cost of some additional uncertainty. We apply our method to a global data set of coastal observations, where we illustrate the robustness of our approach and compare it to an existing threshold selection technique and an arbitrary threshold choice. Our novel approach is shown to select thresholds that are greater than the existing technique. We assess the resulting model fits using a right-sided Anderson-Darling test, and find that our method outperforms the existing and arbitrary methods on average. We present and discuss, in the context of uncertainty, the results from two tide gauge records; Apalachicola, US, and Fishguard, UK. In conclusion, the novel method proposed in this study improves the estimation of the tail behaviour of observed coastal water levels, and we encourage researchers from other disciplines to experiment using this method with their own data sets.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Thomas P. Collings, Callum J. R. Murphy-Barltrop, Conor Murphy, Ivan D. Haigh, Paul D. Bates, and Niall D. Quinn

Status: open (until 17 Jun 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-1138', Anonymous Referee #1, 06 May 2025 reply
Thomas P. Collings, Callum J. R. Murphy-Barltrop, Conor Murphy, Ivan D. Haigh, Paul D. Bates, and Niall D. Quinn
Thomas P. Collings, Callum J. R. Murphy-Barltrop, Conor Murphy, Ivan D. Haigh, Paul D. Bates, and Niall D. Quinn

Viewed

Total article views: 136 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
101 30 5 136 3 3
  • HTML: 101
  • PDF: 30
  • XML: 5
  • Total: 136
  • BibTeX: 3
  • EndNote: 3
Views and downloads (calculated since 06 May 2025)
Cumulative views and downloads (calculated since 06 May 2025)

Viewed (geographical distribution)

Total article views: 134 (including HTML, PDF, and XML) Thereof 134 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 15 May 2025
Download
Short summary
Determining the threshold above which events are considered extreme is an important consideration for many modelling procedures. We propose an extension of an existing data-driven method for automatic threshold selection. We test our approach on tide gauge records, and show that it outperforms existing techniques. This helps improve estimates of extreme sea levels, and we hope other researchers will use this method for other natural hazards.
Share