the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Automated tail-informed threshold selection for extreme coastal sea levels
Abstract. Peaks over threshold (POT) techniques are commonly used in practice to model tail behaviour of univariate variables. The resulting models can be used to aid in risk assessments, providing estimates of relevant quantities such as return levels and periods. An important consideration during such modelling procedures involves the choice of threshold; this selection represents a bias-variance trade-off and is fundamental for ensuring reliable model fits. Despite the crucial nature of this problem, most applications of the POT framework select the threshold in an arbitrary manner and do not consider the sensitivity of the model to this choice. Recent works have called for a more robust approach for selecting thresholds, and a small number of automated methods have been proposed. However, these methods come with limitations, and currently, there does not appear to be a 'one size fits all' technique for threshold selection. In this work, we introduce a novel threshold selection approach that addresses some of the limitations of existing techniques. In particular, our approach ensures that the fitted model captures the tail behaviour at the most extreme observations, at the cost of some additional uncertainty. We apply our method to a global data set of coastal observations, where we illustrate the robustness of our approach and compare it to an existing threshold selection technique and an arbitrary threshold choice. Our novel approach is shown to select thresholds that are greater than the existing technique. We assess the resulting model fits using a right-sided Anderson-Darling test, and find that our method outperforms the existing and arbitrary methods on average. We present and discuss, in the context of uncertainty, the results from two tide gauge records; Apalachicola, US, and Fishguard, UK. In conclusion, the novel method proposed in this study improves the estimation of the tail behaviour of observed coastal water levels, and we encourage researchers from other disciplines to experiment using this method with their own data sets.
- Preprint
(2027 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2025-1138', Anonymous Referee #1, 06 May 2025
This study proposes a novel automated threshold selection method for modeling extreme coastal sea levels within the Peaks Over Threshold (POT) framework, aiming to better capture tail behavior while addressing the limitations of arbitrary and existing automated threshold choices. The method is applied to global tide gauge data and evaluated using the Anderson-Darling test, demonstrating improved performance over conventional techniques. However, the quality and readability of Figure 4 should be enhanced to ensure clearer communication of results. The discussion section is relatively weak, lacking depth, logical structure, and clarity. It is recommended that the discussion be expanded and subdivided to include specific commentary on the data, methodology, and results of this study, with comparative insights drawn from previous research to highlight the strengths and limitations of the proposed approach. The authors are also encouraged to include a forward-looking perspective outlining directions for future work. In summary, I recommend a major revision.
Citation: https://doi.org/10.5194/egusphere-2025-1138-RC1 -
AC1: 'Reply on RC1', Thomas Collings, 22 Jul 2025
Dear Reviewer,
Thank you for the comments. We are very grateful to both reviewers for their constructive feedback. We have attached a supplement PDF which describes how we have addressed each reviewers comments. We believe these changes have greatly strengthened the paper.
Regards,
The authors
-
AC1: 'Reply on RC1', Thomas Collings, 22 Jul 2025
-
RC2: 'Comment on egusphere-2025-1138', Anonymous Referee #2, 02 Jun 2025
The manuscript entitled \textit{Automated tail-informed threshold selection for extreme coastal sea levels} by Collings et al. presents a methodological advancement for improving the threshold selection process in the Peaks-Over-Threshold (POT) framework. The authors propose an automated, data-driven approach tailored for extreme coastal sea level analysis, addressing a key bottleneck in both scientific and operational applications of extreme value theory.
The proposed method is potentially useful for practitioners, particularly in settings where arbitrary or fixed thresholds are problematic and require substantial expertise and technical judgment from the user. The manuscript is timely and relevant for NHESS, as it contributes to a sensitive topic and to the ongoing discussions around robust quantification of coastal hazards. It is generally well-written and understandable.
However, the discussion of the results should be expanded and more clearly structured to enhance the scientific impact of the work. The validation of the method—although supported by two illustrative case studies—remains somewhat limited. A more comprehensive evaluation of the model’s performance across a wider range of sites or under varying data availability conditions would help assess its robustness and provide stronger support for broader applicability.
For these reasons, I recommend a major revision before the manuscript can be considered for publication.
MAJOR COMMENTS:- It would be useful to explore TAILS approach sensitivity to the length of the calibration sample, i.e., the number of available years of observation. Since the method is appealing for practical applications by professionals and practitioners, understanding its robustness under limited data availability would greatly increase its usability in poorly gauged sites;
- Given the more weight attributed to the tail of the events population, that is the novelty and authors contibution to overcome existing approaches limitations, the more extreme events presence in data records effects are even more important in the proposed technique. I am wondering how sensitive the model threshold selection is to very rare events compared to the other methods. Artificially removing/adding adding large enough events or, consistently with previous point, years containing large enough events, and testing obtained thresold/quantiles can give some useful insights;
- Considering the global scale of the analyses conducted, it would be valuable to assess whether any spatial patterns emerge in the performance of TAILS relative to conventional methods. Identifying systematic spatial behaviours, if any, could offer useful insights for practitioners and strengthen the case for its broader adoption, especially where spatially consistent behavior is observed;
- The study provides a good validation of the proposed method based on the exceedences over the thresholds goodness-of-fit. Given the practical relevance of the proposed method, I would suggest to include a more comprehensive benchmark of the model performance on the annual maxima, in addition to the two case studies currently presented. Applying the method to a larger, eventually selected, set of stations could offer a more comprehensive assessment of its return levels predictive potential, that is of primary interest from an engineering perspective.
MINOR COMMENTS:
- I would also mention the work from Tancredi et al. (Extremes, 2006) in the POT modelling section as a Bayesian study that explore how to integrate uncertainty in the threshold selection;
- The authors mention the GESLA 3.1 update as a minor revision of the GESLA 3 dataset, which is provided by one of the authors. As far as I am aware, this updated dataset is not yet publicly available. The authors are encouraged to clarify whether they plan to release it, to ensure reproducibility and broader adoption of the proposed methodology.
Citation: https://doi.org/10.5194/egusphere-2025-1138-RC2 -
AC2: 'Reply on RC2', Thomas Collings, 22 Jul 2025
Dear Reviewer,
Thank you for the comments. We are very grateful to both reviewers for their constructive feedback. We have attached a supplement PDF which describes how we have addressed each reviewers comments. We believe these changes have greatly strengthened the paper.
Regards,
The authors
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
382 | 99 | 18 | 499 | 21 | 31 |
- HTML: 382
- PDF: 99
- XML: 18
- Total: 499
- BibTeX: 21
- EndNote: 31
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1