Evaluation of Extreme Sea-Levels and Flood Return Period using Tidal Day Maxima at Coastal Locations in the United Kingdom

Taylor, Stephen

doi:10.5194/egusphere-2025-1804

Preprints

https://doi.org/10.5194/egusphere-2025-1804

Preprints

30 Apr 2025

| 30 Apr 2025

Evaluation of Extreme Sea-Levels and Flood Return Period using Tidal Day Maxima at Coastal Locations in the United Kingdom

Stephen Taylor

Abstract. Tidal storm surges can result in significant damage and inundation if sea defences are insufficiently robust. Coastal planners need to know the risk of flooding so that sea defences and coastal developments can be specified and sited appropriately. Since Gumbel's original work on extreme value statistics, several modifications and new methods have been proposed for evaluating the risk of tidal inundation, with the Skew Surge Joint Probability Method (SSJPM) recently gaining popularity. However, SSJPM is complex, often requiring manual intervention, and is difficult to automate. Guided by the search for a method specifically applicable to tides that is amenable to automation, this paper proposes several modifications to Gumbel's original approach. The novel technique is termed TMAX since its initial time unit is one tidal day, rather than the usual annual maxima (AMAX). Compared to AMAX, the TMAX method offers more efficient use of extreme data events, provides reduced variance in design height, and more efficiently handles missing data. The results of TMAX are compared with those of a recent study using the SSJPM method at 35 United Kingdom identical coastal locations, showing broad agreement. This new approach provides a robust mechanism for extreme tide analysis and better informs strategies for coastal management and resilience.

Received: 16 Apr 2025 – Discussion started: 30 Apr 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Stephen Taylor

Status: final response (author comments only)

RC1:
'Comment on egusphere-2025-1804', Anonymous Referee #1, 21 Jun 2025
General comments
This study presents a new technique for extreme value analysis of coastal tide gauge records called the TMAX method. The TMAX method primarily improves upon the AMAX method of Gumbel & Lieblein (1954) by using a subset of the highest total water level values of each tidal day, rather than annual maxima. The author applies the TMAX method to the same 35 UK tide gauge records used by Batstone et al. (2013) for development of the skew surge joint probability method, and results are compared to extreme water level estimates derived from the SSJPM and the AMAX method of Gumbel & Lieblein (1954).
The author describes several valid advantages of this method, including: 1) the method is simpler than the SSJPM because it does not require harmonic analysis or fitting a probability distribution; and 2) using the highest water level of each tidal data, rather than annual maxima, enables more elegant treatment of incomplete time series.
However, I disagree with one of the author’s motivations for developing the TMAX method. They write: “SSJPM is complex, often requiring manual intervention.” It is true that the SSJPM is complex, but papers such as Batstone et al. (2013) and Baranes et al. (2020) clearly demonstrate that application of joint probability methods in regions where tides are large relative to surge (and thus large surge events may not be included in the highest recorded total water levels) provides more precise and stable return level estimates with a narrowed uncertainty range. It would be more appropriate to apply the TMAX method in an area with smaller tides – or, perhaps, to apply the TMAX method to calculating skew surge statistics, and then convolving the TMAX probabilities with the tide probability distribution. In terms of the manual intervention, I interpreted that part of Batstone et al. (2013) as finding that manual intervention with the SSJPM was necessary in two distinct geographic regions (the Severn and James estuaries). To me, this points to a geographically linked phenomenon as a potential challenge (such as nonlinear tide-surge interaction or river influence), rather than the statistical method itself.
I interpret the primary conclusions of this paper as the TMAX method 1) giving “a significantly better internal fit and reduced variance” compared to the AMAX method, and 2) being “at least as accurate as the AMAX and SSJPM methods.” In this paper’s current form, I do not think that these conclusions are sufficiently supported.
In general, there is little analysis provided of the results. In particular, it is important to discuss how geography, record length, and tidal range affect the results.

I think “accuracy” is determined by consistency with the SSJPM, yet it seems that for 9 of the 35 locations, TMAX and SSJPM results are significantly different, even at the 20-year return period level. It is difficult to interpret this result without further discussion of factors such as record length and geography.

It’s a bit unclear how “internal fit” is compared, is it a comparison of the analyses that extend through 2009 to the ones that extend through 2018? If so…
On average, the TMAX-derived return water levels increase when the analysis is extended from 2009 to 2018 (Table 4). This is likely due to detrending the entire dataset using a constant assumed rate of RSLR that likely increased between 2009 and 2018. To actually assess something like stability (what I am interpreting “internal fit” to mean), it would make more sense to use something like a moving 365-day window to remove sea level rise and variability, then compare the results over the two time periods. Baranes et al. (2020) showed that application of a joint probability method significantly improved stability compared to fitting a probability distribution to total water level using a Monte Carlo validation method and by comparing statistical fits over two time periods. A similar sort of analysis could be used here to support the author’s conclusion.

If I am interpreting “internal fit” correctly, the only comparison of the AMAX and TMAX internal fits are in Table 3, which only show mean values.

Additional comments:
Fitting a constant sea level rise rate for the “AMAX to 2009” and “TMAX to 2009” analysis may facilitate comparison with the results in Batstone et al. (2013), but it will yield less robust statistics compared to using something like a moving 365-day mean, as rates of RSLR are variable, and interannual sea level variability may significantly affect the results. This should be discussed in the manuscript.

The use of a tidal day, rather than 1 year, is described as a novel approach. However, once the tidal days are reduced to a subset of the largest tidal days, I interpret this method as becoming equivalent to what Batstone et al. (2013) does in applying a peaks-over-threshold approach to skew surge (although this paper uses total water level). In other words, selecting a number of peak tidal day water levels that is equivalent to 5 times the record length is roughly equivalent to using the top 0.7%. This is thus equivalent to the innovation of the peaks-over-threshold approach, and the novelty here is overstated.

In the specific comments below, I highlight several parts of the introduction with questionable descriptions of published studies, the methods description is confusing, or the results are inconsistent.

Specific comments
Lines 24-26: While tide gauge-based extreme value analysis is valuable for many reasons, the spatially varying nature flood return levels that you mention (lines 24-25), along with the fact that tide gauges are generally purposefully installed in wave-sheltered locations, make it somewhat rare that they are the only tool used to determine design elevations for coastal defense structures. Perhaps you could modify the text to describe alternative applications, such as for determining boundary conditions and/or validating numerical models used for coastal planning.
Lines 29-31: This definition of HAT could use some clarifying. Do you mean that HAT assumes average conditions for the meteorological component of tidal height (or perhaps of total water level)? This might be clearer than calling the meteorological component of water level “noise.”
Lines 42-43: I’m curious why this is being highlighted as a particular weakness of the JPM when none of the extreme value analysis methods you discuss in the introduction provide flood duration information.
Lines 45-46: This is not quite right. The timing of the actual astronomical tide is shifted compared to the predicted astronomical tide (and the predicted tide is what’s used to calculate the non-tidal residual).
Lines 49-51: It’s not that the “difference” is uncorrelated; it’s that skew surge is uncorrelated with measured high water (see Williams, 2016).
Lines 51-53: The SSJPM fitting a GPD to skew surges is not a reflection of there being fewer skew surge values compared to non-tidal residual values for a time series of the same length. Fitting the GPD (or any extreme value distribution), as opposed to an empirical distribution, has the advantage of providing probabilities for values that exceed the maximum observed value. In fact, the Revised Joint Probability Method (Tawn & Vassie 1989; Tawn, 1992) made this improvement by fitting a GEV to the non-tidal residuals (rather than an empirical distribution).
Lines 69-71: I would recommend defining these terms earlier in the introduction and using one consistent term for measured minus predicted water level. I recommend “residual,” rather than storm surge (because the residual is often not storm surge) or random noise (because there are deterministic components of the residual).
Lines 54-57: See general comments above about the Batstone et al. (2013) manual intervention
Lines 128-134: This paragraph is confusing in a couple of places:
Lines 129-130: Do you mean the reverse? i.e. that the extreme values are more difficult to determine?

Lines 81-83 (and Equation 1) show Gumbel ranking in ascending order, but you say “substituting for Gumbel’s descending rank with an ascending rank”

Lines 209-214: Why not fit a spline to the hourly data? Or you could show that it’s not important to do this by comparing high waters over time periods with 15-minute data to time periods with hourly data.
Table 3:
Is the “mean difference” the mean of the difference between AMAX or TMAX and Batstone et al. (2013) across all 35 stations? Which is subtracted from which? And is the standard deviation the standard deviation of the difference?

I would suggest showing these results on a map for each individual gauge – especially because Batstone et al. (2013) discusses individual sites where the GPD fit to the skew surge distribution was not physically plausible. Essentially, show the information in Table 5 on a map.

Why don’t the means and standard deviations in Table 5 match the mean differences in the “TMAX to 2009” row of Table 3?

Figure 4: This is difficult to interpret the way the bins are labeled and without geographic information. It should be shown on a map with exact values reported. The same should be done for AMAX (and AMAX should somehow be compared to TMAX) to support the conclusion that one method provides more stable estimates than the other (see also general comments above).
Results presented in Table 5: There are 9 sites that have differences greater than or equal to 10 cm at the 20-year return period level, compared to the SSJPM (Avonmouth, Dover, Hinkley Point, Immingham, Newlyn, North Shields, Port Ellen, Tobermory, Workington). This seems like a relatively large difference, but it is difficult to interpret without information on geography, record length, or tidal range
Conclusions: I interpret the primary conclusions as 1) the TMAX method giving “a significantly better internal fit and reduced variance” compared to the AMAX method, and 2) the TMAX method being “at least as accurate as the MAX and SSJPM methods.” These conclusions should be described in an expanded and quantitative discussion section that points to the specific results and/or analyses that support the conclusions. Topics such as how geography, tidal range, and record length impact the results should be discussed.

Technical corrections
Lines 48-49: Perhaps revise to “… difference between the maxima of measured and predicted water level for each tidal cycle…”
Batstone et al. (2013) is sometimes referred to as “Batstone” and sometimes referred to as “Batstone 2013.”
The equations used in the TMAX method should be more clearly and concisely stated.
Figure 5: I recommend not using red and green for colorblindness
Citation: https://doi.org/10.5194/egusphere-2025-1804-RC1
- AC1:
  'Reply on RC1', Stephen Taylor, 30 Jun 2025
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1804/egusphere-2025-1804-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2025-1804-AC1
  - AC2: 'Reply on AC1', Stephen Taylor, 02 Sep 2025
    
    I attach here a few further points which have come to light upon closer inspection of the comments of Reviewer 1.. These are additional to my response to your comment in the document trail.
    Unfortunately, some of the points you raise are factually incorrect. For example
    You ask: "Why don’t the means and standard deviations in Table 5 match the mean differences in the “TMAX to 2009” row of Table 3?"
    The figures on the last row of Table 3 do match the figures on the final two rows of Table 5. However Table 5 Caption "using data to 2009" was a typo and should be omitted.
    You state. "I would suggest showing these results on a map for each individual gauge - especially because Batstone et al. (2013) discusses individual sites where the GPD fit to the skew surge distribution was not physically plausible. Essentially, show the information in Table 5 on a map."
    They are already on a map. See Figure 5.
    Some of the other points you raise are suggestions for extending the research and fall outside the scope of the paper, rather than being a critique of the work.
    However as I indicated in my previous response, I am happy to address your other comments in a revision of the paper.
    
    Citation: https://doi.org/10.5194/egusphere-2025-1804-AC2
RC2:
'Comment on egusphere-2025-1804', Tasneem Ahmed, 02 Sep 2025
This paper is another contribution based on the application of extreme value theory (EVT) to se level extremes. EVT is a wide field and in recent times since the early 21st century has been widely used in environmental studies , especially for extreme events and any modifications have been suggested thereof. Similarly this paper touches on a specific aspect of the EVT and suggests some new developments to the Gumbel type I AMAX approach and looks at the highest tide each tidal day and verifies the approach with comparison against SSJPM.
The methodology and the results are sound. However a consideration of the following points may benefit the manuscript:
The Most important point would be regarding a more detailed treatment of the comparison of the TMAX method with the r-largest method , which arises as a limiting distribution to the GEV distribution , see Coles (2021). The r-largest method is well established, theoretically sound, and implemented in R packages with maximum likelihood estimation. Many studies implementing the r -largest method usually stick to top 3/top 5 annual maxima sea level. Within the TMAX method as well the author infers that n=5 leads to minimum variance. So More detail is required on how the TMAX is an added value over the r-largest method. Following this the results derived from comparison with the SSJPM is definitely meaningful.

Following the above point , the introduction could be improved to include more about the r-largest method and how the proposed TMAX method is an added value.
Citation: https://doi.org/10.5194/egusphere-2025-1804-RC2
- AC3: 'Reply on RC2', Stephen Taylor, 03 Sep 2025
  
  Dear Tasneem Ahmed,
  Thank you very much for taking the time to read my manuscript and to comment on it.
  You specifically mention the difference between my proposed TMAX and the r-largest approach. You make a similar point to that made by reviewer #1, except that in that case the claimed similarity was with the peaks-over-threshold (POT) approach.
  First let me respond by showing why I called the method TMAX. In the AMAX method the data is grouped annually. Hence because I use the inherent time unit of one tidal day rather than the annual unit it seems natural to replace the "A" with a "T". TMAX also follows the method of using a Gumbel log(log) plot and LSQ regression, just as AMAX does.
  Turning to your point, one can say it is indeed similar to r-largest, except TMAX has an implicit time grouping unit of one per tidal day rather than the usual one year for r-largest; TMAX would also have a value of r=1, since it uses only the largest value for each tidal day.
  However, one could also say it is similar to the peaks-over-threshold (POT) method. In TMAX the number of peaks required Nr is calculated from the number per year times the number of years. The TMAX method first fills the peak array and then sorts the peaks, numbering the largest peak as index 1; thus by counting along the array from 1 to Nr, each array element a[i] is getting smaller until a[Nr] contains the threshold figure. So one could argue it is equivalent to selecting the peaks on a POT approach using a[Nr] as the threshold. However, the usual way of proceeding further with the POT method is to then model the distribution as a Generalised Pareto Distribution (GPD). Unlike peaks-over-threshold, TMAX does not use this approach but uses a Gumbel plot and LSQ regression.
  Note that the number of points N, in equation (1,5) is the number of tidal days of valid data, not Nr. Since Nr < N, it ensures the linear (low probability) portion of Gumbel plot is used.
  Perhaps I overstated the innovation value a little for TMAX; I plan to correct this in a revision and also to provide an improved explanation regarding the method and to discuss comparisons with r-largest and peaks-over-threshold, as I have above.
  I very much hope this has addresses your concerns with my manuscript, which I intend to revise before publication.
  Yours sincerely
  Stephen Taylor
  
  Citation: https://doi.org/10.5194/egusphere-2025-1804-AC3
RC3:
'Comment on egusphere-2025-1804', Roberto Mínguez, 23 Oct 2025

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1804/egusphere-2025-1804-RC3-supplement.pdf

Citation: https://doi.org/10.5194/egusphere-2025-1804-RC3
- AC4: 'Reply on RC3', Stephen Taylor, 04 Nov 2025
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1804/egusphere-2025-1804-AC4-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2025-1804-AC4

Stephen Taylor

Viewed

Total article views: 1,654 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,525	89	40	1,654	67	87

HTML: 1,525
PDF: 89
XML: 40
Total: 1,654
BibTeX: 67
EndNote: 87

Views and downloads (calculated since 30 Apr 2025)

Month	HTML	PDF	XML	Total
Apr 2025	13	1	0	14
May 2025	78	10	7	95
Jun 2025	54	8	3	65
Jul 2025	30	11	3	44
Aug 2025	184	4	1	189
Sep 2025	1,009	12	10	1,031
Oct 2025	79	17	5	101
Nov 2025	35	8	5	48
Dec 2025	29	16	6	51
Jan 2026	14	2	0	16

Cumulative views and downloads (calculated since 30 Apr 2025)

Month	HTML	PDF	XML	Total
Apr 2025	13	1	0	14
May 2025	78	10	7	95
Jun 2025	54	8	3	65
Jul 2025	30	11	3	44
Aug 2025	184	4	1	189
Sep 2025	1,009	12	10	1,031
Oct 2025	79	17	5	101
Nov 2025	35	8	5	48
Dec 2025	29	16	6	51
Jan 2026	14	2	0	16

Viewed (geographical distribution)

Total article views: 1,589 (including HTML, PDF, and XML) Thereof 1,589 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 09 Jan 2026

Short summary

Coastal planners need to know the flood risk from tidal surges so sea defences can be sited appropriately. The author has developed a novel technique which analyses tide gauge data to estimate the risk and the height of sea-defences required. A comparison with results of a UK Environment Agency 2011 study shows good agreement. However, this new approach is simpler to automate than the method used in that study, and could be widely used to inform strategies for coastal management and resilience.


Total:	0
HTML:	0
PDF:	0
XML:	0