the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Robust global detection of forced changes in mean and extreme precipitation despite observational disagreement on the magnitude of change
Abstract. Detection and attribution (D&A) of forced precipitation change is challenging due to internal variability and limited spatial and temporal coverage of observational records. These factors result in a low signal-to-noise ratio of potential regional and even global trends. Here, we use a statistical method – ridge regression – to create physically interpretable fingerprints for detection of forced changes in mean and extreme precipitation with a high signal-to-noise ratio. The fingerprints are constructed using CMIP6 multi-model output masked to match coverage of three gridded precipitation observational datasets – GHCNDEX, HadEX3, and GPCC –, and are then applied to these observational datasets to assess the degree of forced change detectable in the real-world climate.
We show that the signature of forced change is detected in all three observational datasets for global metrics of mean and extreme precipitation. Forced changes are still detectable from changes in the spatial patterns of precipitation even if the global mean trend is removed from the data. This shows detection of forced change in mean and extreme precipitation beyond a global mean trend, and increases confidence in the detection method's power, as well as in climate models' ability to capture the relevant processes that contribute to large-scale patterns of change.
We also find, however, that detectability depends on the observational dataset used. Not only coverage differences but also observational uncertainty contribute to dataset disagreement, exemplified by times of emergence of forced change from internal variability ranging from 1998 to 2004 among datasets. Furthermore, different choices for the period over which the forced trend is computed result in different levels of agreement between observations and model projections. These sensitivities may explain apparent contradictions in recent studies on whether models under- or overestimate the observed forced increase in mean and extreme precipitation. Lastly, the detection fingerprints are found to rely primarily on the signal in the extratropical Northern Hemisphere, which is at least partly due to observational coverage, but potentially also due to the presence of a more robust signal in the Northern Hemisphere in general.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(3433 KB)
-
Supplement
(9191 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(3433 KB) - Metadata XML
-
Supplement
(9191 KB) - BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2022-568', Anonymous Referee #1, 02 Sep 2022
This paper proposes a ridge regression approach to the detection and attribution of externally forced changes in mean and extreme precipitation. This is an interesting idea that certainly merits exploration, but before devoting a lot of time to understanding the details of the paper and the results that are obtained, I think it is necessary for the authors to better explain their method and to situate it within the pantheon of methods that are already available for detection and attribution.
Ridge regression is a technique that “regularizes” regression problems, such as that described in equation (1) of the paper, in which the predictor variables contained in matrix X are multi-colinear. In the generalized least squares formulation of the regression used in detection and attribution this matrix is composed of model simulated estimates of the responses to external forcing in the form of space-time patterns of change. Depending on variable, period considered, domain of interest and how data are processed, the expected space-time patterns of responses to different forcing factors (often called fingerprints) can be strongly correlated, which results in a  regression “design matrix” X that may be ill-conditioned. Ridge regression is a technique that can be used to overcome this problem, although I imagine at the cost of introducimg some bias into the estimated signal scaling coefficients β. Note that referring to these coefficients as “fingerprints” seems unusual to me.
The concept of regularization, however, also arises in a second way in the detection and attribution problem. Considering again equation (1), the generalized least squares approach (and also its total least squares extension) requires knowledge of the variance-covariance matrix of the residuals ε, which are regarded as resulting from natural internal climate variability. Thus, the variance-covariance matrix is generally estimated from unforced control simulations, using as many climate-model simulated realisations of ε as possible. Even though many climate-model simulated realizations of ε are now generally available, the estimated variance-covariance matrix may not be of full rank or may remain uncertain. Thus, it is also often regularized, using an approach similar to the regularization used in ridge regression, but applied to the noise term rather than the signal term of equation (1). See Ribes et al (2013a, doi:10.1007/s00382-013-1735-7, and 2013b, doi:10.1007/s00382-013-1736-6). Presumably one would want to regularize both aspects of the problem, and also take signal uncertainty into account as is done in the total least squares approach to the regression problem (see again Ribes et al., 2013a and 2013b, and also Allen and Stott, 2003, doi:10.1007/s00382-003-0313-9).
How the combined model represented by equations (1-3) relates to existing techniques, and now the noise that results from internal variability comes into play and is accounted for in their subsequent application in the paper is not made clear, and I think should be clarified before results can be considered.
Also, I think it is necessary for the authors to discuss whether the proposed methods, which basically use linear statistical models that therefore implicitly assume Gaussian, or near Gaussian errors, are suitable for the data to which they are applied. Indicators of extreme precipitation, such as Rx1day at individual grid boxes, are certainly not Gaussian.
A final general comment is that the relatively heavy of use of acronyms in this paper is not very reader friendly.
Citation: https://doi.org/10.5194/egusphere-2022-568-RC1 - AC1: 'Reply on RC1', Iris de Vries, 08 Sep 2022
-
RC2: 'Comment on egusphere-2022-568', Anonymous Referee #2, 06 Sep 2022
Overall comments:
This study conducts a signal detection analysis for global changes in mean and extreme precipitation using three observational datasets and CMIP6 multi-model outputs. The authors apply a ridge regression (RR) method to construct fingerprints, which helps increase a signal-to-noise ratio of precipitation change patterns. Results show a robust detection of anthropogenic signals in all observations for both mean and extreme precipitation even when removing global mean trends, further supporting the human-induced intensification of global hydrological cycle. I find this paper very well written with sufficient details provided about methods as well as various sensitivity tests and therefore suggest publication after addressing some minor issues.
Â
Major comments:
1. Although method details are provided, it would be useful to explain more clearly what are benefits of the attribution approaches employed, including ridge regression, EOF-based metric for target variable, and GMST-based signal estimation. All of these procedures seem to contribute to increase signal-to-noise ratio but how they do and what step is more important. The authors provide some associated results from sensitivity tests but an overall explanation of their method possibly with a schematic would be helpful for readers to understand the contribution of each step to the final signal detection.
Â
2. An important motivation of considering different periods and datasets is opposing conclusions by previous studies about model overestimation or underestimation of the observed trends. I am wondering if the authors can go further and compare their results with some previous studies. For instance, if studies based on the latter half of 20th century trends find model underestimation, the authors can assess their model trends for the same/similar periods. Another point here is that the present study uses absolute units of precipitation while most of previous studies considered relative changes or aggregated values. It would be good to discuss possible influences of this difference.
Â
3. The lower detectability in GHCNDEX observations are suggested to be due to the poorer spatial coverage. Regarding this issue, I would suggest using Rx5d. As I understand, Rx5d has larger spatial coverage than Rx1d and comparison with Rx1d-based results may provide a way to support the authors’ interpretation. Another way would be to compare detection results from using a selected model run but with different spatial coverages applied.
Â
Minor comments:
L8: Indicating analysis period or trend period with signal detection would be useful here.
L17-19, L58-64: Better comparisons can be made by applying the same periods as those used in previous studies. See my major comment above.
L20-21: Is this confirmed by repeating detection analysis using NH-extratropics only?
L34: “discrepancies with respect to observations”. Its meaning is unclear.
L69-71: Need to explain what the previous studies have found additionally using these “data-science methods”. Also, what’s the novelty of this study compared with them? Is it detection based on spatial pattern information alone?
L108-109: “Trend biases due to this structural difference … negligible”. But the cited reference considered south-east Australia only?
L201: How to define S when global means are removed?
L212: “CMIP6 ssp245” should be “CMIP6 historical”?
L227: “virtually identical”. adding spatial correlation would help with this.
L314-316: This suggests possible dependence of Rx1d FRE on temperature, resembling global warming slowdown due to PDO influence?
L331-332: “results … hold when the global mean is used as FR target”. Then what are benefits of using EOF-based metric for target variable?
L382-383: “accuracy of the CMIP6 climate models in simulating the processes …”. It’s unclear how the authors get this conclusion. Observation-model agreement in residual variability? More explanation would be useful.
L394-395: “(not shown)”. This looks important and I suggest showing them in the supplement.
L428: “value of RR-based fingerprint construction”. What happens in detection or SNR without applying RR? See my major comment above.
Â
Citation: https://doi.org/10.5194/egusphere-2022-568-RC2 - AC2: 'Reply on RC2', Iris de Vries, 12 Oct 2022
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2022-568', Anonymous Referee #1, 02 Sep 2022
This paper proposes a ridge regression approach to the detection and attribution of externally forced changes in mean and extreme precipitation. This is an interesting idea that certainly merits exploration, but before devoting a lot of time to understanding the details of the paper and the results that are obtained, I think it is necessary for the authors to better explain their method and to situate it within the pantheon of methods that are already available for detection and attribution.
Ridge regression is a technique that “regularizes” regression problems, such as that described in equation (1) of the paper, in which the predictor variables contained in matrix X are multi-colinear. In the generalized least squares formulation of the regression used in detection and attribution this matrix is composed of model simulated estimates of the responses to external forcing in the form of space-time patterns of change. Depending on variable, period considered, domain of interest and how data are processed, the expected space-time patterns of responses to different forcing factors (often called fingerprints) can be strongly correlated, which results in a  regression “design matrix” X that may be ill-conditioned. Ridge regression is a technique that can be used to overcome this problem, although I imagine at the cost of introducimg some bias into the estimated signal scaling coefficients β. Note that referring to these coefficients as “fingerprints” seems unusual to me.
The concept of regularization, however, also arises in a second way in the detection and attribution problem. Considering again equation (1), the generalized least squares approach (and also its total least squares extension) requires knowledge of the variance-covariance matrix of the residuals ε, which are regarded as resulting from natural internal climate variability. Thus, the variance-covariance matrix is generally estimated from unforced control simulations, using as many climate-model simulated realisations of ε as possible. Even though many climate-model simulated realizations of ε are now generally available, the estimated variance-covariance matrix may not be of full rank or may remain uncertain. Thus, it is also often regularized, using an approach similar to the regularization used in ridge regression, but applied to the noise term rather than the signal term of equation (1). See Ribes et al (2013a, doi:10.1007/s00382-013-1735-7, and 2013b, doi:10.1007/s00382-013-1736-6). Presumably one would want to regularize both aspects of the problem, and also take signal uncertainty into account as is done in the total least squares approach to the regression problem (see again Ribes et al., 2013a and 2013b, and also Allen and Stott, 2003, doi:10.1007/s00382-003-0313-9).
How the combined model represented by equations (1-3) relates to existing techniques, and now the noise that results from internal variability comes into play and is accounted for in their subsequent application in the paper is not made clear, and I think should be clarified before results can be considered.
Also, I think it is necessary for the authors to discuss whether the proposed methods, which basically use linear statistical models that therefore implicitly assume Gaussian, or near Gaussian errors, are suitable for the data to which they are applied. Indicators of extreme precipitation, such as Rx1day at individual grid boxes, are certainly not Gaussian.
A final general comment is that the relatively heavy of use of acronyms in this paper is not very reader friendly.
Citation: https://doi.org/10.5194/egusphere-2022-568-RC1 - AC1: 'Reply on RC1', Iris de Vries, 08 Sep 2022
-
RC2: 'Comment on egusphere-2022-568', Anonymous Referee #2, 06 Sep 2022
Overall comments:
This study conducts a signal detection analysis for global changes in mean and extreme precipitation using three observational datasets and CMIP6 multi-model outputs. The authors apply a ridge regression (RR) method to construct fingerprints, which helps increase a signal-to-noise ratio of precipitation change patterns. Results show a robust detection of anthropogenic signals in all observations for both mean and extreme precipitation even when removing global mean trends, further supporting the human-induced intensification of global hydrological cycle. I find this paper very well written with sufficient details provided about methods as well as various sensitivity tests and therefore suggest publication after addressing some minor issues.
Â
Major comments:
1. Although method details are provided, it would be useful to explain more clearly what are benefits of the attribution approaches employed, including ridge regression, EOF-based metric for target variable, and GMST-based signal estimation. All of these procedures seem to contribute to increase signal-to-noise ratio but how they do and what step is more important. The authors provide some associated results from sensitivity tests but an overall explanation of their method possibly with a schematic would be helpful for readers to understand the contribution of each step to the final signal detection.
Â
2. An important motivation of considering different periods and datasets is opposing conclusions by previous studies about model overestimation or underestimation of the observed trends. I am wondering if the authors can go further and compare their results with some previous studies. For instance, if studies based on the latter half of 20th century trends find model underestimation, the authors can assess their model trends for the same/similar periods. Another point here is that the present study uses absolute units of precipitation while most of previous studies considered relative changes or aggregated values. It would be good to discuss possible influences of this difference.
Â
3. The lower detectability in GHCNDEX observations are suggested to be due to the poorer spatial coverage. Regarding this issue, I would suggest using Rx5d. As I understand, Rx5d has larger spatial coverage than Rx1d and comparison with Rx1d-based results may provide a way to support the authors’ interpretation. Another way would be to compare detection results from using a selected model run but with different spatial coverages applied.
Â
Minor comments:
L8: Indicating analysis period or trend period with signal detection would be useful here.
L17-19, L58-64: Better comparisons can be made by applying the same periods as those used in previous studies. See my major comment above.
L20-21: Is this confirmed by repeating detection analysis using NH-extratropics only?
L34: “discrepancies with respect to observations”. Its meaning is unclear.
L69-71: Need to explain what the previous studies have found additionally using these “data-science methods”. Also, what’s the novelty of this study compared with them? Is it detection based on spatial pattern information alone?
L108-109: “Trend biases due to this structural difference … negligible”. But the cited reference considered south-east Australia only?
L201: How to define S when global means are removed?
L212: “CMIP6 ssp245” should be “CMIP6 historical”?
L227: “virtually identical”. adding spatial correlation would help with this.
L314-316: This suggests possible dependence of Rx1d FRE on temperature, resembling global warming slowdown due to PDO influence?
L331-332: “results … hold when the global mean is used as FR target”. Then what are benefits of using EOF-based metric for target variable?
L382-383: “accuracy of the CMIP6 climate models in simulating the processes …”. It’s unclear how the authors get this conclusion. Observation-model agreement in residual variability? More explanation would be useful.
L394-395: “(not shown)”. This looks important and I suggest showing them in the supplement.
L428: “value of RR-based fingerprint construction”. What happens in detection or SNR without applying RR? See my major comment above.
Â
Citation: https://doi.org/10.5194/egusphere-2022-568-RC2 - AC2: 'Reply on RC2', Iris de Vries, 12 Oct 2022
Peer review completion
Journal article(s) based on this preprint
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
324 | 160 | 17 | 501 | 36 | 3 | 6 |
- HTML: 324
- PDF: 160
- XML: 17
- Total: 501
- Supplement: 36
- BibTeX: 3
- EndNote: 6
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Iris Elisabeth de Vries
Sebastian Sippel
Angeline Greene Pendergrass
Reto Knutti
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(3433 KB) - Metadata XML
-
Supplement
(9191 KB) - BibTeX
- EndNote
- Final revised paper