the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Skillful Decadal Prediction of German Bight Storm Activity
Abstract. We evaluate the prediction skill of the Max-Planck-Institute Earth System Model (MPI-ESM) decadal hindcast system for German Bight storm activity (GBSA) on a multiannual to decadal scale. We define GBSA every year via the most extreme three-hourly geostrophic wind speeds, which are derived from mean sea-level pressure (MSLP) data. Our 64-member ensemble of annually initialized hindcast simulations spans the time period 1960-2018. For this period, we compare deterministically and probabilistically predicted winter MSLP anomalies and annual GBSA with a lead time of up to ten years against observations. The model shows limited deterministic skill for single prediction years, but significant positive deterministic skill for long averaging periods. For probabilistic predictions of high and low storm activity, the model is skillful at both short and long averaging periods, and outperforms persistence-based predictions. For short lead years, the skill of the probabilistic prediction for high and low storm activity notably exceeds the deterministic skill. We therefore conclude that, for the German Bight, skillful decadal predictions of regional storm activity can be viable with a large ensemble and a carefully designed approach.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(10606 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(10606 KB) - Metadata XML
- BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
RC1: 'Review of egusphere-2022-288', Anonymous Referee #1, 23 May 2022
Review of the paper Skillful Decadal Predictions of German Bight Storm Activity
The paper investigates the decadal prediction skill for MSLP anomalies and German Bight Storm Activity in a 64-member ensemble of yearly initialized decadal hindcast simulations. These are based on MPI-ESM-LR and cover the period 1960-2018. The authors use both deterministic and probabilistic skill metrics to analyse the prediction skill for different lead times.
The paper covers an interesting and relevant topic that could be of interest to NHESS readers. It is clearly structured and the presented topics are covered with sufficient material. However, some aspects in terms of methods, analyses and results are unclear and/or not well described/discussed. Here the manuscript needs some general improvements in order to enhance its clarity and readability. Please refer to the main points and specific comments below. If these points are taken into account, I think the article would be a useful addition to the literature.
Main points:
- DATA
- Just to clarify, you are not using the MiKlip data, but have constructed your own decadal prediction system? I was not sure until I got to line 102…
- I do not quite understand how you constructed the 64-member ensemble (L104-111). Please describe this in more detail.
- Please clarify which decadal runs you chose. If you are looking at the period 1961-2018, did you select all runs that include those years regardless of the lead time, or is the last run you selected the one that was initialized in 2008?
- METHODS
- Lead times, part 1: The selection of lead times seems somewhat arbitrary. Why did you choose 4-10 and 7 and not 1-7 and 4 or 2-8 and 5 …? Have you checked whether your results/conclusions would be different with a different choice of lead time?
- Lead times, part 2: In L126ff, you state that you focus on lead years 4-10 and 7. However, this only applies to the MSLP anomalies, since you show all possible lead year ranges for GBSA. Please be more specific in this regard.
- Pressure reduction: Is this a standard procedure for calculating MSLP from modelled surface pressure? Could you add a reference for equation 1? Does it affect the comparability of your results if you use direct MSLP for one half of the ensemble and calculate MSLP for the other half?
- Region of interest: Please clarify that you are analysing MSLP anomalies for the entire North Atlantic basin (including the German Bight), whereas the GBSA analyses focus only on the German Bight.
- Selection of grid points (L140-144): This information refers to the generation of GBSA time series, correct? If so, either integrate it in the respective paragraph (L146ff) or clarify why you need to select three grid points. At the moment, the whole paragraph comes a bit out of nowhere, without a clear link to the preceding/subsequent paragraphs…
- Generation of GBSA time series: Did I understand correctly that the time series cover the whole period 1960-2018, while you only use the period 1961-2010 for the standardization?
- Prediction skill: Please add a short explanation of why it is important to consider both deterministic and probabilistic skill scores when assessing the skill of a decadal prediction system.
- ACC: Although this should be common knowledge, please add the possible range of ACC and an explanation of what the different values mean.
- ACC versus BS: Be careful when using f and o in equations 2 and 4. You chose the same letters, but they have different meanings (value for ACC, probability for BS). Consider replacing f and o in equation 4 with capital letters.
- Choice of BSS: Out of curiosity – why did you choose the BSS rather than the ranked probability skill score (RPSS)? Since you are interested in three categories (low/normal/high), the RPSS seems the more natural choice to me as it also contains some information about the distance between model and observations.
- RESULTS
- Some thoughts on L234-242: Could it be that the initialisation has a “negative” impact in the first years (some kind of initialisation shock) – which would explain why the predictive skill is highest for lead time ranges starting in year 3 and 4? This would also fit (to some extent) to previous studies on wind-related variables like Kruschke et al. (2014) or Moemken et al. (2016). However, these studies use uninitialized historical simulations as reference and not persistence…
For temperature, several studies show high predictive skill for later/longer lead times (e.g. Feldmann et al., 2019). This increase seems to originate mainly from the long-term climate trend. However, I have never heard of the importance of climate trend for decadal predictions of wind-based parameters… - L304-338: These paragraphs seem to be more of a general discussion of your results and are not really related to the rest of section 3.2.2. Therefore, it might make sense to introduce a new section (3.3 Discussion) or new chapter (4. Discussion) for this part of the manuscript.
- Persistence as reference: Many studies dealing with decadal prediction systems use uninitialized historical simulations of the same model or simple climatology as reference. Is there any particular reason why you have not tried this as well? Please do not get me wrong – I think it is a strength of your study that you consider persistence and random guessing. It just makes it harder to compare your results with other studies on decadal predictions.
- Some thoughts on L234-242: Could it be that the initialisation has a “negative” impact in the first years (some kind of initialisation shock) – which would explain why the predictive skill is highest for lead time ranges starting in year 3 and 4? This would also fit (to some extent) to previous studies on wind-related variables like Kruschke et al. (2014) or Moemken et al. (2016). However, these studies use uninitialized historical simulations as reference and not persistence…
- FIGURES
- For readers unfamiliar with Germany (and the German Bight in particular), it might be helpful to include a figure showing the region of interest. In this, you could also mark the grid points given in Table 1.
- Figure 2: Please add some explanation in the text (L226-230) about the structure of the plot (that it shows all possible lead time combinations etc.).
- Consider simplifying the captions of Figures 5 and 6 (the same applies to B3 and B4) by saying something like “Same as Figure 4, but for …”.
Specific comments:
- Moemken et al. (2020) should be Moemken et al. (2021)
- L14-16: Please add a reference for this statement.
- L17: “certain types of extremes” – Can you give one or two examples?
- L23: “potential” instead of “potential value”?
- L27: Please add some of the standard references for the topic of decadal predictions.
- L28: Please add explanation of the abbreviation “MiKlip”.
- L35ff: Moemken et al. (2021) have analysed a dynamically downscaled component of the MiKlip system – please add this information somewhere.
- L39-40: Consider citing Reyers et al. (2019) at the end of this statement.
- L49/50: “an assumption which was shown by Krueger et al. (2019) to be valid” – please rephrase.
- L62-65: Could you add a reference for this statement?
- L68: Probably a stupid question – but would it be better to explain what a hindcast is?
- L73: “skill of the model system” instead of “skill of model system”
- L76: Please add a reference for the Brier Skill Score.
- L85-86: The use of two parentheses directly following each other looks strange – can you merge them? The same applies to lines 94-96.
- L155: “model’s prediction skill” instead of “model’s predictions skill”
- L168: “The BSS is defined as” instead of “The BSS defined as”
- L179: “calculated” instead of “caulated”
- L193: “prediction skill” instead of “predictions skill”
- Figure 1: “Deterministic prediction skill” instead of “Prediction skill”?
- L210/211: Consider placing “(Fig. 1b)” after “… ACC is lower for lead year 7”
- L358: “predictions for short lead year ranges” instead of “predictions short lead year ranges”
- L394: “in such a way that” instead of “in such a way so that”
References
- Feldmann H, Pinto JG … Kottmeier C (2019) Skill and added value of the MiKlip regional decadal prediction system for temperature over Europe. Tellus A 71, https://doi.org/10.1080/16000870.2019.1618678
- Kruschke T, Rust HW, Kadow C, Leckebusch GC and Ulbrich U (2014) Evaluating decadal predictions of northern hemispheric cyclone frequencies. Tellus A 66 22830, https://doi.org/10.3402/tellusa.v66.22830
- Moemken J, Reyers M, Buldmann B and Pinto JG (2016) Decadal predictability of regional scale wind speed and wind energy potentials over Central Europe. Tellus A 68 29199, https://doi.org/10.3402/tellusa.v68.29199
- Moemken J, Feldmann H ... Marotzke M (2021) The regional MiKlip decadal prediction system for Europe: Hindcast skill for extremes and user-oriented variables. Int. J. Climatol. 41 (Suppl. 1) E1944-E1958, https://doi.org/10.1002/joc.6824
- Reyers M, Feldmann H … Schädler G and Kottmeier C (2019) Development and prospects of the regional MiKlip decadal prediction system over Europe: predictive skill, added value of regionalization, and ensemble size dependency. Earth Syst. Dynam. 10 171-187, https://doi.org/10.5194/esd-10-171-2019, 2019
Citation: https://doi.org/10.5194/egusphere-2022-288-RC1 -
AC1: 'Reply on RC1', Daniel Krieger, 19 Jul 2022
We sincerely thank Reviewer #1 for their constructive and insightful comments on our manuscript Skillful Decadal Prediction of German Bight Storm Activity. The comments greatly helped us to improve the manuscript and clarify key points.
Please find enclosed our response to the reviewer's comments.
- DATA
-
RC2: 'Comment on egusphere-2022-288', Anonymous Referee #2, 26 May 2022
-
AC2: 'Reply on RC2', Daniel Krieger, 19 Jul 2022
We sincerely thank Reviewer #2 for their constructive and insightful comments on our manuscript Skillful Decadal Prediction of German Bight Storm Activity. The comments greatly helped us to improve the manuscript and clarify key points.
Please find enclosed our response to the reviewer's comments.
-
AC2: 'Reply on RC2', Daniel Krieger, 19 Jul 2022
-
RC3: 'Comment on egusphere-2022-288', Anonymous Referee #3, 17 Jun 2022
Krieger et al have investigated the predictability of German Bight Storm Activity for various interannual lead times and averaging periods. They find limited skill, but some hope that high and low storm activity is predictable. It's a well written and clear paper and it is great that both deterministic and probabilistic skill is covered. I recommend that this paper be accepted, subject to some minor revisions as detailed below.
One issue is that the paper focusses on some unusual forecast lead times (4-10 years and 7 years) without properly motivating why they use these. It would not seem the most interesting lead times for a user of a storm activity forecast. There is frequent reference to short and long averaging periods and I was not always sure whether that referred specifically to these two periods or had been generalised somehow. But if it is the latter, it is not defined. The language needs to be cleaned up around this.
Figures which lead to firm conclusions are squirreled away in an appendix. I suggest all important figures need to be in the main paper. See minor comments below.
Another issue is that there is no deterministic skill for mslp anywhere near the German Bight (Figure 1), so how do you explain that you have skill in predicting storm activity there? This needs to be covered in the discussion.
Negative skill is presented as useful skill. It is true, you could multiply the forecast by -1 and get a good forecast on average. The problem is that the skill is possibly negative due to a poorly modelled teleconnection and if there is an individual year when that teleconnection is not strong, multiplying by -1 could be the wrong thing to do. Better to assume negative skill is not useful even if it is significant.
Finally, the text often refers to "tails" of the distribution and "extremes" when in fact the data refers to anomalies exceeding 1 sigma, which is neither in the tail or an extreme. These words need to be removed from the text.
Minor comments:
l.9 "short lead years" is this years 4-10? If so, this should be explicit in the abstract, which needs to stand alone.l.126 The choice of lead years used in the study is not well motivated. Why 4-10? As a user I would want to know what next year brings and how the next five years will look on average. Do you have a user that is interested in years 4-10?
l.149 Would you not get a better estimate of the mean and standard deviation if you calculated it once over all members, rather than each one separately? (Assuming the members are interchangeable).
l.162 As you are looking at a quantity that has decadal variability there is persistence from one year to the next. This means you do not have independent samples as assumed by the Fisher method. Perhaps you should try block bootstrapping or another method that does account for this? If not, what is the lead 1 autocorrelation? Perhaps a time series of derived storm activity should be included.
1.195 "activity of a number n of years" would be clearer if written "activity of n years"
l.205 "skill in a circular area west of the british isles" and l.211 "skill over scandinavia" - both these regions have negative skill, which means the model is doing something wrong and potentially not responding to teleconnections correctly.
l.224 "anyhow" this word is unnecessary for the sentence and a bit informal for a scientific text.
l.241 "convincing explanation" how about initialisation shock? Lead years 2,3 and 4 are poor and 1 probably less good than it should be, which points to initialisation shock.
l.246 "We expect" a shift in the tail is usually caused by a shift by the whole distribution (and hence the mean), so you don't need a large ensemble size to detect shifts in the tail (Eade et al, 2012). Maybe you mean changes in the shape of the tail? You look at 1 sigma, which I would not describe as being in the tail.
l.251: "short and long lead year ranges" what does this mean? When does a lead year range go from being short to long? If you mean year 7 and years 4-10, be explicit and write that.
l.260 "skill emerges over the German Bight" It would help the reader to mark the German Bight on Figure 3.
l.271 "Overall" you are generalising here to all short and long averaging periods, which you cannot do as you have only looked at one of each. You also have not defined what is short and what is long - is the boundary at an averaging range of 2 years? 3 years?
l.302 "we conclude" figures that lead to conclusions should not be in the extended materials. I suggest you put the coin flip reference figures in the paper and persistence figures in the appendices.
l.319 "anyhow" see comment for line 224. Also "overwhelmingly" there is no room for emotional reactions in a scientific text
l.323 "ill-suited" you know the probability of falling into the upper category, can you not use this as a reference forecast?
l.329-338 Caveats belong in the conclusions section, not the results section.
l.345 "Brier Score" should be "Brier Skill Score". In addition, you also tested it against a 50% forecast.
l.361 This is the conclusions section, you may speculate upon a cause
Figure 1: Please add something to the maps to show where the German Bight is, not everyone knows. You could also mark out your three points used for the geostrophic wind calculation (or a on a later figure if the resolution is too coarse here).
Eade, R., Hamilton, E., Smith, D. M., Graham, R. J., and Scaife, A. A.(2012), Forecasting the number of extreme daily events out to a decade ahead, J. Geophys. Res., 117, D21110, doi:10.1029/2012JD018015.
Citation: https://doi.org/10.5194/egusphere-2022-288-RC3 -
AC3: 'Reply on RC3', Daniel Krieger, 19 Jul 2022
We sincerely thank Reviewer #3 for their constructive and insightful comments on our manuscript Skillful Decadal Prediction of German Bight Storm Activity. The comments greatly helped us to improve the manuscript and clarify key points.
Please find enclosed our response to the reviewer's comments.
-
AC3: 'Reply on RC3', Daniel Krieger, 19 Jul 2022
Interactive discussion
Status: closed
-
RC1: 'Review of egusphere-2022-288', Anonymous Referee #1, 23 May 2022
Review of the paper Skillful Decadal Predictions of German Bight Storm Activity
The paper investigates the decadal prediction skill for MSLP anomalies and German Bight Storm Activity in a 64-member ensemble of yearly initialized decadal hindcast simulations. These are based on MPI-ESM-LR and cover the period 1960-2018. The authors use both deterministic and probabilistic skill metrics to analyse the prediction skill for different lead times.
The paper covers an interesting and relevant topic that could be of interest to NHESS readers. It is clearly structured and the presented topics are covered with sufficient material. However, some aspects in terms of methods, analyses and results are unclear and/or not well described/discussed. Here the manuscript needs some general improvements in order to enhance its clarity and readability. Please refer to the main points and specific comments below. If these points are taken into account, I think the article would be a useful addition to the literature.
Main points:
- DATA
- Just to clarify, you are not using the MiKlip data, but have constructed your own decadal prediction system? I was not sure until I got to line 102…
- I do not quite understand how you constructed the 64-member ensemble (L104-111). Please describe this in more detail.
- Please clarify which decadal runs you chose. If you are looking at the period 1961-2018, did you select all runs that include those years regardless of the lead time, or is the last run you selected the one that was initialized in 2008?
- METHODS
- Lead times, part 1: The selection of lead times seems somewhat arbitrary. Why did you choose 4-10 and 7 and not 1-7 and 4 or 2-8 and 5 …? Have you checked whether your results/conclusions would be different with a different choice of lead time?
- Lead times, part 2: In L126ff, you state that you focus on lead years 4-10 and 7. However, this only applies to the MSLP anomalies, since you show all possible lead year ranges for GBSA. Please be more specific in this regard.
- Pressure reduction: Is this a standard procedure for calculating MSLP from modelled surface pressure? Could you add a reference for equation 1? Does it affect the comparability of your results if you use direct MSLP for one half of the ensemble and calculate MSLP for the other half?
- Region of interest: Please clarify that you are analysing MSLP anomalies for the entire North Atlantic basin (including the German Bight), whereas the GBSA analyses focus only on the German Bight.
- Selection of grid points (L140-144): This information refers to the generation of GBSA time series, correct? If so, either integrate it in the respective paragraph (L146ff) or clarify why you need to select three grid points. At the moment, the whole paragraph comes a bit out of nowhere, without a clear link to the preceding/subsequent paragraphs…
- Generation of GBSA time series: Did I understand correctly that the time series cover the whole period 1960-2018, while you only use the period 1961-2010 for the standardization?
- Prediction skill: Please add a short explanation of why it is important to consider both deterministic and probabilistic skill scores when assessing the skill of a decadal prediction system.
- ACC: Although this should be common knowledge, please add the possible range of ACC and an explanation of what the different values mean.
- ACC versus BS: Be careful when using f and o in equations 2 and 4. You chose the same letters, but they have different meanings (value for ACC, probability for BS). Consider replacing f and o in equation 4 with capital letters.
- Choice of BSS: Out of curiosity – why did you choose the BSS rather than the ranked probability skill score (RPSS)? Since you are interested in three categories (low/normal/high), the RPSS seems the more natural choice to me as it also contains some information about the distance between model and observations.
- RESULTS
- Some thoughts on L234-242: Could it be that the initialisation has a “negative” impact in the first years (some kind of initialisation shock) – which would explain why the predictive skill is highest for lead time ranges starting in year 3 and 4? This would also fit (to some extent) to previous studies on wind-related variables like Kruschke et al. (2014) or Moemken et al. (2016). However, these studies use uninitialized historical simulations as reference and not persistence…
For temperature, several studies show high predictive skill for later/longer lead times (e.g. Feldmann et al., 2019). This increase seems to originate mainly from the long-term climate trend. However, I have never heard of the importance of climate trend for decadal predictions of wind-based parameters… - L304-338: These paragraphs seem to be more of a general discussion of your results and are not really related to the rest of section 3.2.2. Therefore, it might make sense to introduce a new section (3.3 Discussion) or new chapter (4. Discussion) for this part of the manuscript.
- Persistence as reference: Many studies dealing with decadal prediction systems use uninitialized historical simulations of the same model or simple climatology as reference. Is there any particular reason why you have not tried this as well? Please do not get me wrong – I think it is a strength of your study that you consider persistence and random guessing. It just makes it harder to compare your results with other studies on decadal predictions.
- Some thoughts on L234-242: Could it be that the initialisation has a “negative” impact in the first years (some kind of initialisation shock) – which would explain why the predictive skill is highest for lead time ranges starting in year 3 and 4? This would also fit (to some extent) to previous studies on wind-related variables like Kruschke et al. (2014) or Moemken et al. (2016). However, these studies use uninitialized historical simulations as reference and not persistence…
- FIGURES
- For readers unfamiliar with Germany (and the German Bight in particular), it might be helpful to include a figure showing the region of interest. In this, you could also mark the grid points given in Table 1.
- Figure 2: Please add some explanation in the text (L226-230) about the structure of the plot (that it shows all possible lead time combinations etc.).
- Consider simplifying the captions of Figures 5 and 6 (the same applies to B3 and B4) by saying something like “Same as Figure 4, but for …”.
Specific comments:
- Moemken et al. (2020) should be Moemken et al. (2021)
- L14-16: Please add a reference for this statement.
- L17: “certain types of extremes” – Can you give one or two examples?
- L23: “potential” instead of “potential value”?
- L27: Please add some of the standard references for the topic of decadal predictions.
- L28: Please add explanation of the abbreviation “MiKlip”.
- L35ff: Moemken et al. (2021) have analysed a dynamically downscaled component of the MiKlip system – please add this information somewhere.
- L39-40: Consider citing Reyers et al. (2019) at the end of this statement.
- L49/50: “an assumption which was shown by Krueger et al. (2019) to be valid” – please rephrase.
- L62-65: Could you add a reference for this statement?
- L68: Probably a stupid question – but would it be better to explain what a hindcast is?
- L73: “skill of the model system” instead of “skill of model system”
- L76: Please add a reference for the Brier Skill Score.
- L85-86: The use of two parentheses directly following each other looks strange – can you merge them? The same applies to lines 94-96.
- L155: “model’s prediction skill” instead of “model’s predictions skill”
- L168: “The BSS is defined as” instead of “The BSS defined as”
- L179: “calculated” instead of “caulated”
- L193: “prediction skill” instead of “predictions skill”
- Figure 1: “Deterministic prediction skill” instead of “Prediction skill”?
- L210/211: Consider placing “(Fig. 1b)” after “… ACC is lower for lead year 7”
- L358: “predictions for short lead year ranges” instead of “predictions short lead year ranges”
- L394: “in such a way that” instead of “in such a way so that”
References
- Feldmann H, Pinto JG … Kottmeier C (2019) Skill and added value of the MiKlip regional decadal prediction system for temperature over Europe. Tellus A 71, https://doi.org/10.1080/16000870.2019.1618678
- Kruschke T, Rust HW, Kadow C, Leckebusch GC and Ulbrich U (2014) Evaluating decadal predictions of northern hemispheric cyclone frequencies. Tellus A 66 22830, https://doi.org/10.3402/tellusa.v66.22830
- Moemken J, Reyers M, Buldmann B and Pinto JG (2016) Decadal predictability of regional scale wind speed and wind energy potentials over Central Europe. Tellus A 68 29199, https://doi.org/10.3402/tellusa.v68.29199
- Moemken J, Feldmann H ... Marotzke M (2021) The regional MiKlip decadal prediction system for Europe: Hindcast skill for extremes and user-oriented variables. Int. J. Climatol. 41 (Suppl. 1) E1944-E1958, https://doi.org/10.1002/joc.6824
- Reyers M, Feldmann H … Schädler G and Kottmeier C (2019) Development and prospects of the regional MiKlip decadal prediction system over Europe: predictive skill, added value of regionalization, and ensemble size dependency. Earth Syst. Dynam. 10 171-187, https://doi.org/10.5194/esd-10-171-2019, 2019
Citation: https://doi.org/10.5194/egusphere-2022-288-RC1 -
AC1: 'Reply on RC1', Daniel Krieger, 19 Jul 2022
We sincerely thank Reviewer #1 for their constructive and insightful comments on our manuscript Skillful Decadal Prediction of German Bight Storm Activity. The comments greatly helped us to improve the manuscript and clarify key points.
Please find enclosed our response to the reviewer's comments.
- DATA
-
RC2: 'Comment on egusphere-2022-288', Anonymous Referee #2, 26 May 2022
-
AC2: 'Reply on RC2', Daniel Krieger, 19 Jul 2022
We sincerely thank Reviewer #2 for their constructive and insightful comments on our manuscript Skillful Decadal Prediction of German Bight Storm Activity. The comments greatly helped us to improve the manuscript and clarify key points.
Please find enclosed our response to the reviewer's comments.
-
AC2: 'Reply on RC2', Daniel Krieger, 19 Jul 2022
-
RC3: 'Comment on egusphere-2022-288', Anonymous Referee #3, 17 Jun 2022
Krieger et al have investigated the predictability of German Bight Storm Activity for various interannual lead times and averaging periods. They find limited skill, but some hope that high and low storm activity is predictable. It's a well written and clear paper and it is great that both deterministic and probabilistic skill is covered. I recommend that this paper be accepted, subject to some minor revisions as detailed below.
One issue is that the paper focusses on some unusual forecast lead times (4-10 years and 7 years) without properly motivating why they use these. It would not seem the most interesting lead times for a user of a storm activity forecast. There is frequent reference to short and long averaging periods and I was not always sure whether that referred specifically to these two periods or had been generalised somehow. But if it is the latter, it is not defined. The language needs to be cleaned up around this.
Figures which lead to firm conclusions are squirreled away in an appendix. I suggest all important figures need to be in the main paper. See minor comments below.
Another issue is that there is no deterministic skill for mslp anywhere near the German Bight (Figure 1), so how do you explain that you have skill in predicting storm activity there? This needs to be covered in the discussion.
Negative skill is presented as useful skill. It is true, you could multiply the forecast by -1 and get a good forecast on average. The problem is that the skill is possibly negative due to a poorly modelled teleconnection and if there is an individual year when that teleconnection is not strong, multiplying by -1 could be the wrong thing to do. Better to assume negative skill is not useful even if it is significant.
Finally, the text often refers to "tails" of the distribution and "extremes" when in fact the data refers to anomalies exceeding 1 sigma, which is neither in the tail or an extreme. These words need to be removed from the text.
Minor comments:
l.9 "short lead years" is this years 4-10? If so, this should be explicit in the abstract, which needs to stand alone.l.126 The choice of lead years used in the study is not well motivated. Why 4-10? As a user I would want to know what next year brings and how the next five years will look on average. Do you have a user that is interested in years 4-10?
l.149 Would you not get a better estimate of the mean and standard deviation if you calculated it once over all members, rather than each one separately? (Assuming the members are interchangeable).
l.162 As you are looking at a quantity that has decadal variability there is persistence from one year to the next. This means you do not have independent samples as assumed by the Fisher method. Perhaps you should try block bootstrapping or another method that does account for this? If not, what is the lead 1 autocorrelation? Perhaps a time series of derived storm activity should be included.
1.195 "activity of a number n of years" would be clearer if written "activity of n years"
l.205 "skill in a circular area west of the british isles" and l.211 "skill over scandinavia" - both these regions have negative skill, which means the model is doing something wrong and potentially not responding to teleconnections correctly.
l.224 "anyhow" this word is unnecessary for the sentence and a bit informal for a scientific text.
l.241 "convincing explanation" how about initialisation shock? Lead years 2,3 and 4 are poor and 1 probably less good than it should be, which points to initialisation shock.
l.246 "We expect" a shift in the tail is usually caused by a shift by the whole distribution (and hence the mean), so you don't need a large ensemble size to detect shifts in the tail (Eade et al, 2012). Maybe you mean changes in the shape of the tail? You look at 1 sigma, which I would not describe as being in the tail.
l.251: "short and long lead year ranges" what does this mean? When does a lead year range go from being short to long? If you mean year 7 and years 4-10, be explicit and write that.
l.260 "skill emerges over the German Bight" It would help the reader to mark the German Bight on Figure 3.
l.271 "Overall" you are generalising here to all short and long averaging periods, which you cannot do as you have only looked at one of each. You also have not defined what is short and what is long - is the boundary at an averaging range of 2 years? 3 years?
l.302 "we conclude" figures that lead to conclusions should not be in the extended materials. I suggest you put the coin flip reference figures in the paper and persistence figures in the appendices.
l.319 "anyhow" see comment for line 224. Also "overwhelmingly" there is no room for emotional reactions in a scientific text
l.323 "ill-suited" you know the probability of falling into the upper category, can you not use this as a reference forecast?
l.329-338 Caveats belong in the conclusions section, not the results section.
l.345 "Brier Score" should be "Brier Skill Score". In addition, you also tested it against a 50% forecast.
l.361 This is the conclusions section, you may speculate upon a cause
Figure 1: Please add something to the maps to show where the German Bight is, not everyone knows. You could also mark out your three points used for the geostrophic wind calculation (or a on a later figure if the resolution is too coarse here).
Eade, R., Hamilton, E., Smith, D. M., Graham, R. J., and Scaife, A. A.(2012), Forecasting the number of extreme daily events out to a decade ahead, J. Geophys. Res., 117, D21110, doi:10.1029/2012JD018015.
Citation: https://doi.org/10.5194/egusphere-2022-288-RC3 -
AC3: 'Reply on RC3', Daniel Krieger, 19 Jul 2022
We sincerely thank Reviewer #3 for their constructive and insightful comments on our manuscript Skillful Decadal Prediction of German Bight Storm Activity. The comments greatly helped us to improve the manuscript and clarify key points.
Please find enclosed our response to the reviewer's comments.
-
AC3: 'Reply on RC3', Daniel Krieger, 19 Jul 2022
Peer review completion
Journal article(s) based on this preprint
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
212 | 68 | 8 | 288 | 6 | 5 |
- HTML: 212
- PDF: 68
- XML: 8
- Total: 288
- BibTeX: 6
- EndNote: 5
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Sebastian Brune
Patrick Pieper
Ralf Weisse
Johanna Baehr
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(10606 KB) - Metadata XML