the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Causal deep learning models for studying the Earth system: soil moisture-precipitation coupling in ERA5 data across Europe
Abstract. The Earth system is a complex non-linear dynamical system. Despite decades of research, many processes and relations between Earth system variables are still poorly understood. Current approaches for studying relations in the Earth system may be broadly divided into approaches based on numerical simulations and statistical approaches. However, there are several inherent limitations to current approaches that are, for example, high computational costs, reliance on the correct representation of relations in numerical models, strong assumptions related to linearity or locality, and the fallacy of correlation and causality.
Here, we propose a novel methodology combining deep learning (DL) and principles of causality research in an attempt to overcome these limitations. The methodology combines the recent idea of training and analyzing DL models to gain new scientific insights in the relations between input and target variables with a theorem from causality research. This theorem states that a statistical model may learn the causal impact of an input variable on a target variable if suitable additional input variables are included. As an illustrative example, we apply the methodology to study soil moisture-precipitation coupling in ERA5 climate reanalysis data across Europe. We demonstrate that, harnessing the great power and flexibility of DL models, the proposed methodology may yield new scientific insights into complex, nonlinear and non-local coupling mechanisms in the Earth system.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(9627 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(9627 KB) - Metadata XML
- BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2022-105', Matthew Knepley, 12 Sep 2022
This paper was intended to "propose a novel methodology combining deep learning (DL) and principles of causality research". However, I do not believe it does so. It reiterates a standard theorem from causal models describing a causally sufficient set for some node X of a probabilistic graphical model. Then the authors claim to choose carefully such a set. If it were possible to do so apriori, there would be no confounding and no need for the causality formalism. After choosing this set, the interpolation of the joint probability distribution with a neural network follows standard practice. Since there is no real use of the mathematical formalism of causality, this cannot justify publication. Moreover, since "An extensive discussion of our results on soil moisture-precipitation coupling in terms of physical processes (e.g. Seneviratne et al., 2010; Santanello et al., 2018) and a comparison with results from other studies (e.g. Seneviratne et al., 2010; Taylor et al., 2012; Guillod et al., 2015; Tuttle and Salvucci, 2016; Imamovic et al., 2017) are postponed to a second paper", no new physical results are presented. Thus I recommend that the paper be rejected, and the authors submit a paper with the new physical insights included.
In the paper itself, some claims could be better supported by evidence. The authors claim that simulations are always more expensive than their deep learning scheme, but no data is provided. Simulations at what resolution? Is the cost of DNN training included? More nuance here would be helpful. Derivatives calculated from the DNN solution are used to quantify sensitivities and errors, but how accurate are these estimates? On page 17 , the authors stat that "In our example, the null hypothesis was rejected at a confidence level of 99 %", however it is later stated that only two samples were taken. This seems misleading at best. Clarification of what is meant by the 99% confidence level in this case would be very helpful.Citation: https://doi.org/10.5194/egusphere-2022-105-RC1 -
AC1: 'Reply on RC1', Tobias Tesch, 16 Sep 2022
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2022/egusphere-2022-105/egusphere-2022-105-AC1-supplement.pdf
-
AC1: 'Reply on RC1', Tobias Tesch, 16 Sep 2022
-
RC2: 'Comment on egusphere-2022-105', Anonymous Referee #2, 22 Sep 2022
The work seems to bring causality research in AI (more specifically Pearl, 2009) into hydrologic analysis. While I am no expert on causality analysis, it occurs to me there is some novelty in the authors' valiant effort in venturing into this realm alone and presenting a stab for hydrology, but there are also some concerns regarding clearly defining the real merit of the method. If the authors call for more research in this direction, the limitations and potential should be carefully discussed. The grand goal of the paper was to "learn causality", but the reality is that this is still very difficult from purely data-driven basis. I personally appreciate such explorations and think this concept is new to hydrology. I think the paper can be considered for publication after some substantial revisions. However, the authors will have to carefully qualify the applicability and limitations of the technique.
The most important issue --- as far as I can read, the key appears to be defining a sufficient set, which requires lots of subjective decisions and prior assumptions. The authors included previous-day precipitation and previous-day soil moisture because they think these variables will influence today's soil moisture. Also included are precipitation, daily temperature, humidity, wind. By the time you are done providing the sufficient set, you already need to inject lots of knowledge. We might wonder why we still need to run this causality test in the first place. I do see the point -- some of the decisions can be based on prior knowledge while the main causality gradient of interest (is soil moisture leading to more rainfall) may be unclear from our prior knowledge. This raises two issues: (i) there is only a niche of questions where this approach is meaningful: where we know enough to identify a causal graph and a sufficient set, but do not know the answer to the main question. This niche does exist; (ii) it will be much harder to apply where the causality or even the important factors are unknown, so the sales language of "learning a causality link" does not fit reality and should be carefully qualified.
As an initial demonstration the study also lacked a control experiment. In other words, if you replace today's soil moisture with a potential highly-correlated confounder, will the analysis show it is non-causal? This has not been demonstrated.
The writing of the article is also problematic:
(i) there should be a simple logical explanations for Theorem 1. I mean, the mathematical form can be accurate but does not help many people to understand the logic. You should translate this into simple, ordinary language. I don't believe the underlying logic is that remote.(ii) the Methods and Results are intermingled in an unhelpful way. Try to have more clear sections with dedicated functions.(iii) By the time I reach section 4 I am totally tired and cannot understand the rather complicated logic. Can you make this simpler?
Many unclear places:
(iv) How does the UNet represent the causal links in Figure 2? To my understanding all the inputs were treated in the same way(i) define "blocking a path"(ii) line 204 "further input variables" like what?(iiii) Page 6 needs lot of plain-language explanations.(iv) don't understanding "By including antecedent precipitation as input variable, or, in other words, conditioning on antecedent precipitation, we canexclude this correlation from our analysis."Citation: https://doi.org/10.5194/egusphere-2022-105-RC2 -
AC2: 'Reply on RC2', Tobias Tesch, 30 Sep 2022
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2022/egusphere-2022-105/egusphere-2022-105-AC2-supplement.pdf
-
AC2: 'Reply on RC2', Tobias Tesch, 30 Sep 2022
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2022-105', Matthew Knepley, 12 Sep 2022
This paper was intended to "propose a novel methodology combining deep learning (DL) and principles of causality research". However, I do not believe it does so. It reiterates a standard theorem from causal models describing a causally sufficient set for some node X of a probabilistic graphical model. Then the authors claim to choose carefully such a set. If it were possible to do so apriori, there would be no confounding and no need for the causality formalism. After choosing this set, the interpolation of the joint probability distribution with a neural network follows standard practice. Since there is no real use of the mathematical formalism of causality, this cannot justify publication. Moreover, since "An extensive discussion of our results on soil moisture-precipitation coupling in terms of physical processes (e.g. Seneviratne et al., 2010; Santanello et al., 2018) and a comparison with results from other studies (e.g. Seneviratne et al., 2010; Taylor et al., 2012; Guillod et al., 2015; Tuttle and Salvucci, 2016; Imamovic et al., 2017) are postponed to a second paper", no new physical results are presented. Thus I recommend that the paper be rejected, and the authors submit a paper with the new physical insights included.
In the paper itself, some claims could be better supported by evidence. The authors claim that simulations are always more expensive than their deep learning scheme, but no data is provided. Simulations at what resolution? Is the cost of DNN training included? More nuance here would be helpful. Derivatives calculated from the DNN solution are used to quantify sensitivities and errors, but how accurate are these estimates? On page 17 , the authors stat that "In our example, the null hypothesis was rejected at a confidence level of 99 %", however it is later stated that only two samples were taken. This seems misleading at best. Clarification of what is meant by the 99% confidence level in this case would be very helpful.Citation: https://doi.org/10.5194/egusphere-2022-105-RC1 -
AC1: 'Reply on RC1', Tobias Tesch, 16 Sep 2022
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2022/egusphere-2022-105/egusphere-2022-105-AC1-supplement.pdf
-
AC1: 'Reply on RC1', Tobias Tesch, 16 Sep 2022
-
RC2: 'Comment on egusphere-2022-105', Anonymous Referee #2, 22 Sep 2022
The work seems to bring causality research in AI (more specifically Pearl, 2009) into hydrologic analysis. While I am no expert on causality analysis, it occurs to me there is some novelty in the authors' valiant effort in venturing into this realm alone and presenting a stab for hydrology, but there are also some concerns regarding clearly defining the real merit of the method. If the authors call for more research in this direction, the limitations and potential should be carefully discussed. The grand goal of the paper was to "learn causality", but the reality is that this is still very difficult from purely data-driven basis. I personally appreciate such explorations and think this concept is new to hydrology. I think the paper can be considered for publication after some substantial revisions. However, the authors will have to carefully qualify the applicability and limitations of the technique.
The most important issue --- as far as I can read, the key appears to be defining a sufficient set, which requires lots of subjective decisions and prior assumptions. The authors included previous-day precipitation and previous-day soil moisture because they think these variables will influence today's soil moisture. Also included are precipitation, daily temperature, humidity, wind. By the time you are done providing the sufficient set, you already need to inject lots of knowledge. We might wonder why we still need to run this causality test in the first place. I do see the point -- some of the decisions can be based on prior knowledge while the main causality gradient of interest (is soil moisture leading to more rainfall) may be unclear from our prior knowledge. This raises two issues: (i) there is only a niche of questions where this approach is meaningful: where we know enough to identify a causal graph and a sufficient set, but do not know the answer to the main question. This niche does exist; (ii) it will be much harder to apply where the causality or even the important factors are unknown, so the sales language of "learning a causality link" does not fit reality and should be carefully qualified.
As an initial demonstration the study also lacked a control experiment. In other words, if you replace today's soil moisture with a potential highly-correlated confounder, will the analysis show it is non-causal? This has not been demonstrated.
The writing of the article is also problematic:
(i) there should be a simple logical explanations for Theorem 1. I mean, the mathematical form can be accurate but does not help many people to understand the logic. You should translate this into simple, ordinary language. I don't believe the underlying logic is that remote.(ii) the Methods and Results are intermingled in an unhelpful way. Try to have more clear sections with dedicated functions.(iii) By the time I reach section 4 I am totally tired and cannot understand the rather complicated logic. Can you make this simpler?
Many unclear places:
(iv) How does the UNet represent the causal links in Figure 2? To my understanding all the inputs were treated in the same way(i) define "blocking a path"(ii) line 204 "further input variables" like what?(iiii) Page 6 needs lot of plain-language explanations.(iv) don't understanding "By including antecedent precipitation as input variable, or, in other words, conditioning on antecedent precipitation, we canexclude this correlation from our analysis."Citation: https://doi.org/10.5194/egusphere-2022-105-RC2 -
AC2: 'Reply on RC2', Tobias Tesch, 30 Sep 2022
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2022/egusphere-2022-105/egusphere-2022-105-AC2-supplement.pdf
-
AC2: 'Reply on RC2', Tobias Tesch, 30 Sep 2022
Peer review completion
Journal article(s) based on this preprint
Model code and software
Causal deep learning models for studying the Earth system: soil moisture-precipitation coupling in ERA5 data across Europe - Software Code Tobias Tesch, Stefan Kollet, Jochen Garcke https://doi.org/10.5281/zenodo.6385040
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
715 | 183 | 19 | 917 | 17 | 8 |
- HTML: 715
- PDF: 183
- XML: 19
- Total: 917
- BibTeX: 17
- EndNote: 8
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Cited
1 citations as recorded by crossref.
Stefan Kollet
Jochen Garcke
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(9627 KB) - Metadata XML