Automated tail-informed threshold selection for extreme coastal sea levels

Collings, Thomas P.; Murphy-Barltrop, Callum J. R.; Murphy, Conor; Haigh, Ivan D.; Bates, Paul D.; Quinn, Niall D.

doi:10.5194/egusphere-2025-1138

Preprints

https://doi.org/10.5194/egusphere-2025-1138

Preprints

06 May 2025

| 06 May 2025

Automated tail-informed threshold selection for extreme coastal sea levels

Thomas P. Collings, Callum J. R. Murphy-Barltrop, Conor Murphy, Ivan D. Haigh, Paul D. Bates, and Niall D. Quinn

Abstract. Peaks over threshold (POT) techniques are commonly used in practice to model tail behaviour of univariate variables. The resulting models can be used to aid in risk assessments, providing estimates of relevant quantities such as return levels and periods. An important consideration during such modelling procedures involves the choice of threshold; this selection represents a bias-variance trade-off and is fundamental for ensuring reliable model fits. Despite the crucial nature of this problem, most applications of the POT framework select the threshold in an arbitrary manner and do not consider the sensitivity of the model to this choice. Recent works have called for a more robust approach for selecting thresholds, and a small number of automated methods have been proposed. However, these methods come with limitations, and currently, there does not appear to be a 'one size fits all' technique for threshold selection. In this work, we introduce a novel threshold selection approach that addresses some of the limitations of existing techniques. In particular, our approach ensures that the fitted model captures the tail behaviour at the most extreme observations, at the cost of some additional uncertainty. We apply our method to a global data set of coastal observations, where we illustrate the robustness of our approach and compare it to an existing threshold selection technique and an arbitrary threshold choice. Our novel approach is shown to select thresholds that are greater than the existing technique. We assess the resulting model fits using a right-sided Anderson-Darling test, and find that our method outperforms the existing and arbitrary methods on average. We present and discuss, in the context of uncertainty, the results from two tide gauge records; Apalachicola, US, and Fishguard, UK. In conclusion, the novel method proposed in this study improves the estimation of the tail behaviour of observed coastal water levels, and we encourage researchers from other disciplines to experiment using this method with their own data sets.

Received: 10 Mar 2025 – Discussion started: 06 May 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Thomas P. Collings, Callum J. R. Murphy-Barltrop, Conor Murphy, Ivan D. Haigh, Paul D. Bates, and Niall D. Quinn

Status: closed

RC1:
'Comment on egusphere-2025-1138', Anonymous Referee #1, 06 May 2025

This study proposes a novel automated threshold selection method for modeling extreme coastal sea levels within the Peaks Over Threshold (POT) framework, aiming to better capture tail behavior while addressing the limitations of arbitrary and existing automated threshold choices. The method is applied to global tide gauge data and evaluated using the Anderson-Darling test, demonstrating improved performance over conventional techniques. However, the quality and readability of Figure 4 should be enhanced to ensure clearer communication of results. The discussion section is relatively weak, lacking depth, logical structure, and clarity. It is recommended that the discussion be expanded and subdivided to include specific commentary on the data, methodology, and results of this study, with comparative insights drawn from previous research to highlight the strengths and limitations of the proposed approach. The authors are also encouraged to include a forward-looking perspective outlining directions for future work. In summary, I recommend a major revision.

Citation: https://doi.org/10.5194/egusphere-2025-1138-RC1
- AC1: 'Reply on RC1', Thomas Collings, 22 Jul 2025
  
  Dear Reviewer,
  Thank you for the comments. We are very grateful to both reviewers for their constructive feedback. We have attached a supplement PDF which describes how we have addressed each reviewers comments. We believe these changes have greatly strengthened the paper.
  Regards,
  The authors
  
  Citation: https://doi.org/10.5194/egusphere-2025-1138-AC1
RC2:
'Comment on egusphere-2025-1138', Anonymous Referee #2, 02 Jun 2025
The manuscript entitled \textit{Automated tail-informed threshold selection for extreme coastal sea levels} by Collings et al. presents a methodological advancement for improving the threshold selection process in the Peaks-Over-Threshold (POT) framework. The authors propose an automated, data-driven approach tailored for extreme coastal sea level analysis, addressing a key bottleneck in both scientific and operational applications of extreme value theory.
The proposed method is potentially useful for practitioners, particularly in settings where arbitrary or fixed thresholds are problematic and require substantial expertise and technical judgment from the user. The manuscript is timely and relevant for NHESS, as it contributes to a sensitive topic and to the ongoing discussions around robust quantification of coastal hazards. It is generally well-written and understandable.
However, the discussion of the results should be expanded and more clearly structured to enhance the scientific impact of the work. The validation of the method—although supported by two illustrative case studies—remains somewhat limited. A more comprehensive evaluation of the model’s performance across a wider range of sites or under varying data availability conditions would help assess its robustness and provide stronger support for broader applicability.
For these reasons, I recommend a major revision before the manuscript can be considered for publication.

MAJOR COMMENTS:
It would be useful to explore TAILS approach sensitivity to the length of the calibration sample, i.e., the number of available years of observation. Since the method is appealing for practical applications by professionals and practitioners, understanding its robustness under limited data availability would greatly increase its usability in poorly gauged sites;

Given the more weight attributed to the tail of the events population, that is the novelty and authors contibution to overcome existing approaches limitations, the more extreme events presence in data records effects are even more important in the proposed technique. I am wondering how sensitive the model threshold selection is to very rare events compared to the other methods. Artificially removing/adding adding large enough events or, consistently with previous point, years containing large enough events, and testing obtained thresold/quantiles can give some useful insights;

Considering the global scale of the analyses conducted, it would be valuable to assess whether any spatial patterns emerge in the performance of TAILS relative to conventional methods. Identifying systematic spatial behaviours, if any, could offer useful insights for practitioners and strengthen the case for its broader adoption, especially where spatially consistent behavior is observed;

The study provides a good validation of the proposed method based on the exceedences over the thresholds goodness-of-fit. Given the practical relevance of the proposed method, I would suggest to include a more comprehensive benchmark of the model performance on the annual maxima, in addition to the two case studies currently presented. Applying the method to a larger, eventually selected, set of stations could offer a more comprehensive assessment of its return levels predictive potential, that is of primary interest from an engineering perspective.

MINOR COMMENTS:
I would also mention the work from Tancredi et al. (Extremes, 2006) in the POT modelling section as a Bayesian study that explore how to integrate uncertainty in the threshold selection;

The authors mention the GESLA 3.1 update as a minor revision of the GESLA 3 dataset, which is provided by one of the authors. As far as I am aware, this updated dataset is not yet publicly available. The authors are encouraged to clarify whether they plan to release it, to ensure reproducibility and broader adoption of the proposed methodology.
Citation: https://doi.org/10.5194/egusphere-2025-1138-RC2
- AC2: 'Reply on RC2', Thomas Collings, 22 Jul 2025
  
  Dear Reviewer,
  Thank you for the comments. We are very grateful to both reviewers for their constructive feedback. We have attached a supplement PDF which describes how we have addressed each reviewers comments. We believe these changes have greatly strengthened the paper.
  Regards,
  The authors
  
  Citation: https://doi.org/10.5194/egusphere-2025-1138-AC2

Status: closed

RC1:
'Comment on egusphere-2025-1138', Anonymous Referee #1, 06 May 2025

This study proposes a novel automated threshold selection method for modeling extreme coastal sea levels within the Peaks Over Threshold (POT) framework, aiming to better capture tail behavior while addressing the limitations of arbitrary and existing automated threshold choices. The method is applied to global tide gauge data and evaluated using the Anderson-Darling test, demonstrating improved performance over conventional techniques. However, the quality and readability of Figure 4 should be enhanced to ensure clearer communication of results. The discussion section is relatively weak, lacking depth, logical structure, and clarity. It is recommended that the discussion be expanded and subdivided to include specific commentary on the data, methodology, and results of this study, with comparative insights drawn from previous research to highlight the strengths and limitations of the proposed approach. The authors are also encouraged to include a forward-looking perspective outlining directions for future work. In summary, I recommend a major revision.

Citation: https://doi.org/10.5194/egusphere-2025-1138-RC1
- AC1: 'Reply on RC1', Thomas Collings, 22 Jul 2025
  
  Dear Reviewer,
  Thank you for the comments. We are very grateful to both reviewers for their constructive feedback. We have attached a supplement PDF which describes how we have addressed each reviewers comments. We believe these changes have greatly strengthened the paper.
  Regards,
  The authors
  
  Citation: https://doi.org/10.5194/egusphere-2025-1138-AC1
RC2:
'Comment on egusphere-2025-1138', Anonymous Referee #2, 02 Jun 2025
The manuscript entitled \textit{Automated tail-informed threshold selection for extreme coastal sea levels} by Collings et al. presents a methodological advancement for improving the threshold selection process in the Peaks-Over-Threshold (POT) framework. The authors propose an automated, data-driven approach tailored for extreme coastal sea level analysis, addressing a key bottleneck in both scientific and operational applications of extreme value theory.
The proposed method is potentially useful for practitioners, particularly in settings where arbitrary or fixed thresholds are problematic and require substantial expertise and technical judgment from the user. The manuscript is timely and relevant for NHESS, as it contributes to a sensitive topic and to the ongoing discussions around robust quantification of coastal hazards. It is generally well-written and understandable.
However, the discussion of the results should be expanded and more clearly structured to enhance the scientific impact of the work. The validation of the method—although supported by two illustrative case studies—remains somewhat limited. A more comprehensive evaluation of the model’s performance across a wider range of sites or under varying data availability conditions would help assess its robustness and provide stronger support for broader applicability.
For these reasons, I recommend a major revision before the manuscript can be considered for publication.

MAJOR COMMENTS:
It would be useful to explore TAILS approach sensitivity to the length of the calibration sample, i.e., the number of available years of observation. Since the method is appealing for practical applications by professionals and practitioners, understanding its robustness under limited data availability would greatly increase its usability in poorly gauged sites;

Given the more weight attributed to the tail of the events population, that is the novelty and authors contibution to overcome existing approaches limitations, the more extreme events presence in data records effects are even more important in the proposed technique. I am wondering how sensitive the model threshold selection is to very rare events compared to the other methods. Artificially removing/adding adding large enough events or, consistently with previous point, years containing large enough events, and testing obtained thresold/quantiles can give some useful insights;

Considering the global scale of the analyses conducted, it would be valuable to assess whether any spatial patterns emerge in the performance of TAILS relative to conventional methods. Identifying systematic spatial behaviours, if any, could offer useful insights for practitioners and strengthen the case for its broader adoption, especially where spatially consistent behavior is observed;

The study provides a good validation of the proposed method based on the exceedences over the thresholds goodness-of-fit. Given the practical relevance of the proposed method, I would suggest to include a more comprehensive benchmark of the model performance on the annual maxima, in addition to the two case studies currently presented. Applying the method to a larger, eventually selected, set of stations could offer a more comprehensive assessment of its return levels predictive potential, that is of primary interest from an engineering perspective.

MINOR COMMENTS:
I would also mention the work from Tancredi et al. (Extremes, 2006) in the POT modelling section as a Bayesian study that explore how to integrate uncertainty in the threshold selection;

The authors mention the GESLA 3.1 update as a minor revision of the GESLA 3 dataset, which is provided by one of the authors. As far as I am aware, this updated dataset is not yet publicly available. The authors are encouraged to clarify whether they plan to release it, to ensure reproducibility and broader adoption of the proposed methodology.
Citation: https://doi.org/10.5194/egusphere-2025-1138-RC2
- AC2: 'Reply on RC2', Thomas Collings, 22 Jul 2025
  
  Dear Reviewer,
  Thank you for the comments. We are very grateful to both reviewers for their constructive feedback. We have attached a supplement PDF which describes how we have addressed each reviewers comments. We believe these changes have greatly strengthened the paper.
  Regards,
  The authors
  
  Citation: https://doi.org/10.5194/egusphere-2025-1138-AC2

Thomas P. Collings, Callum J. R. Murphy-Barltrop, Conor Murphy, Ivan D. Haigh, Paul D. Bates, and Niall D. Quinn

Viewed

Total article views: 961 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
827	113	21	961	25	36

HTML: 827
PDF: 113
XML: 21
Total: 961
BibTeX: 25
EndNote: 36

Views and downloads (calculated since 06 May 2025)

Month	HTML	PDF	XML	Total
May 2025	128	37	6	171
Jun 2025	58	13	4	75
Jul 2025	42	30	7	79
Aug 2025	87	18	1	106
Sep 2025	451	7	0	458
Oct 2025	51	5	2	58
Nov 2025	10	3	1	14

Cumulative views and downloads (calculated since 06 May 2025)

Month	HTML	PDF	XML	Total
May 2025	128	37	6	171
Jun 2025	58	13	4	75
Jul 2025	42	30	7	79
Aug 2025	87	18	1	106
Sep 2025	451	7	0	458
Oct 2025	51	5	2	58
Nov 2025	10	3	1	14

Viewed (geographical distribution)

Total article views: 936 (including HTML, PDF, and XML) Thereof 936 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 08 Nov 2025

Short summary

Determining the threshold above which events are considered extreme is an important consideration for many modelling procedures. We propose an extension of an existing data-driven method for automatic threshold selection. We test our approach on tide gauge records, and show that it outperforms existing techniques. This helps improve estimates of extreme sea levels, and we hope other researchers will use this method for other natural hazards.


Total:	0
HTML:	0
PDF:	0
XML:	0