Causal deep learning models for studying the Earth system: soil moisture-precipitation coupling in ERA5 data across Europe

Tesch, Tobias; Kollet, Stefan; Garcke, Jochen

doi:https://doi.org/10.5194/egusphere-2022-105

Preprints

https://doi.org/10.5194/egusphere-2022-105

Preprints

17 May 2022

| 17 May 2022

Causal deep learning models for studying the Earth system: soil moisture-precipitation coupling in ERA5 data across Europe

Tobias Tesch, Stefan Kollet, and Jochen Garcke

Abstract. The Earth system is a complex non-linear dynamical system. Despite decades of research, many processes and relations between Earth system variables are still poorly understood. Current approaches for studying relations in the Earth system may be broadly divided into approaches based on numerical simulations and statistical approaches. However, there are several inherent limitations to current approaches that are, for example, high computational costs, reliance on the correct representation of relations in numerical models, strong assumptions related to linearity or locality, and the fallacy of correlation and causality.

Here, we propose a novel methodology combining deep learning (DL) and principles of causality research in an attempt to overcome these limitations. The methodology combines the recent idea of training and analyzing DL models to gain new scientific insights in the relations between input and target variables with a theorem from causality research. This theorem states that a statistical model may learn the causal impact of an input variable on a target variable if suitable additional input variables are included. As an illustrative example, we apply the methodology to study soil moisture-precipitation coupling in ERA5 climate reanalysis data across Europe. We demonstrate that, harnessing the great power and flexibility of DL models, the proposed methodology may yield new scientific insights into complex, nonlinear and non-local coupling mechanisms in the Earth system.

Received: 26 Mar 2022 – Discussion started: 17 May 2022

Download & links

Preprint (PDF, 9627 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (9627 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

20 Apr 2023

| Highlight paper

Causal deep learning models for studying the Earth system

Tobias Tesch, Stefan Kollet, and Jochen Garcke

Geosci. Model Dev., 16, 2149–2166, https://doi.org/10.5194/gmd-16-2149-2023,https://doi.org/10.5194/gmd-16-2149-2023, 2023

Short summary Executive editor

Tobias Tesch et al.

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2022-105', Matthew Knepley, 12 Sep 2022

This paper was intended to "propose a novel methodology combining deep learning (DL) and principles of causality research". However, I do not believe it does so. It reiterates a standard theorem from causal models describing a causally sufficient set for some node X of a probabilistic graphical model. Then the authors claim to choose carefully such a set. If it were possible to do so apriori, there would be no confounding and no need for the causality formalism. After choosing this set, the interpolation of the joint probability distribution with a neural network follows standard practice. Since there is no real use of the mathematical formalism of causality, this cannot justify publication. Moreover, since "An extensive discussion of our results on soil moisture-precipitation coupling in terms of physical processes (e.g. Seneviratne et al., 2010; Santanello et al., 2018) and a comparison with results from other studies (e.g. Seneviratne et al., 2010; Taylor et al., 2012; Guillod et al., 2015; Tuttle and Salvucci, 2016; Imamovic et al., 2017) are postponed to a second paper", no new physical results are presented. Thus I recommend that the paper be rejected, and the authors submit a paper with the new physical insights included.

In the paper itself, some claims could be better supported by evidence. The authors claim that simulations are always more expensive than their deep learning scheme, but no data is provided. Simulations at what resolution? Is the cost of DNN training included? More nuance here would be helpful. Derivatives calculated from the DNN solution are used to quantify sensitivities and errors, but how accurate are these estimates? On page 17 , the authors stat that "In our example, the null hypothesis was rejected at a confidence level of 99 %", however it is later stated that only two samples were taken. This seems misleading at best. Clarification of what is meant by the 99% confidence level in this case would be very helpful.

Citation: https://doi.org/10.5194/egusphere-2022-105-RC1
- AC1: 'Reply on RC1', Tobias Tesch, 16 Sep 2022
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2022/egusphere-2022-105/egusphere-2022-105-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2022-105-AC1
RC2:
'Comment on egusphere-2022-105', Anonymous Referee #2, 22 Sep 2022

The work seems to bring causality research in AI (more specifically Pearl, 2009) into hydrologic analysis. While I am no expert on causality analysis, it occurs to me there is some novelty in the authors' valiant effort in venturing into this realm alone and presenting a stab for hydrology, but there are also some concerns regarding clearly defining the real merit of the method. If the authors call for more research in this direction, the limitations and potential should be carefully discussed. The grand goal of the paper was to "learn causality", but the reality is that this is still very difficult from purely data-driven basis. I personally appreciate such explorations and think this concept is new to hydrology. I think the paper can be considered for publication after some substantial revisions. However, the authors will have to carefully qualify the applicability and limitations of the technique.

The most important issue --- as far as I can read, the key appears to be defining a sufficient set, which requires lots of subjective decisions and prior assumptions. The authors included previous-day precipitation and previous-day soil moisture because they think these variables will influence today's soil moisture. Also included are precipitation, daily temperature, humidity, wind. By the time you are done providing the sufficient set, you already need to inject lots of knowledge. We might wonder why we still need to run this causality test in the first place. I do see the point -- some of the decisions can be based on prior knowledge while the main causality gradient of interest (is soil moisture leading to more rainfall) may be unclear from our prior knowledge. This raises two issues: (i) there is only a niche of questions where this approach is meaningful: where we know enough to identify a causal graph and a sufficient set, but do not know the answer to the main question. This niche does exist; (ii) it will be much harder to apply where the causality or even the important factors are unknown, so the sales language of "learning a causality link" does not fit reality and should be carefully qualified.

As an initial demonstration the study also lacked a control experiment. In other words, if you replace today's soil moisture with a potential highly-correlated confounder, will the analysis show it is non-causal? This has not been demonstrated.

The writing of the article is also problematic:

(i) there should be a simple logical explanations for Theorem 1. I mean, the mathematical form can be accurate but does not help many people to understand the logic. You should translate this into simple, ordinary language. I don't believe the underlying logic is that remote.(ii) the Methods and Results are intermingled in an unhelpful way. Try to have more clear sections with dedicated functions.(iii) By the time I reach section 4 I am totally tired and cannot understand the rather complicated logic. Can you make this simpler?

Many unclear places:

(iv) How does the UNet represent the causal links in Figure 2? To my understanding all the inputs were treated in the same way(i) define "blocking a path"(ii) line 204 "further input variables" like what?(iiii) Page 6 needs lot of plain-language explanations.(iv) don't understanding "By including antecedent precipitation as input variable, or, in other words, conditioning on antecedent precipitation, we canexclude this correlation from our analysis."

Citation: https://doi.org/10.5194/egusphere-2022-105-RC2
- AC2: 'Reply on RC2', Tobias Tesch, 30 Sep 2022
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2022/egusphere-2022-105/egusphere-2022-105-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2022-105-AC2

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2022-105', Matthew Knepley, 12 Sep 2022

This paper was intended to "propose a novel methodology combining deep learning (DL) and principles of causality research". However, I do not believe it does so. It reiterates a standard theorem from causal models describing a causally sufficient set for some node X of a probabilistic graphical model. Then the authors claim to choose carefully such a set. If it were possible to do so apriori, there would be no confounding and no need for the causality formalism. After choosing this set, the interpolation of the joint probability distribution with a neural network follows standard practice. Since there is no real use of the mathematical formalism of causality, this cannot justify publication. Moreover, since "An extensive discussion of our results on soil moisture-precipitation coupling in terms of physical processes (e.g. Seneviratne et al., 2010; Santanello et al., 2018) and a comparison with results from other studies (e.g. Seneviratne et al., 2010; Taylor et al., 2012; Guillod et al., 2015; Tuttle and Salvucci, 2016; Imamovic et al., 2017) are postponed to a second paper", no new physical results are presented. Thus I recommend that the paper be rejected, and the authors submit a paper with the new physical insights included.

In the paper itself, some claims could be better supported by evidence. The authors claim that simulations are always more expensive than their deep learning scheme, but no data is provided. Simulations at what resolution? Is the cost of DNN training included? More nuance here would be helpful. Derivatives calculated from the DNN solution are used to quantify sensitivities and errors, but how accurate are these estimates? On page 17 , the authors stat that "In our example, the null hypothesis was rejected at a confidence level of 99 %", however it is later stated that only two samples were taken. This seems misleading at best. Clarification of what is meant by the 99% confidence level in this case would be very helpful.

Citation: https://doi.org/10.5194/egusphere-2022-105-RC1
- AC1: 'Reply on RC1', Tobias Tesch, 16 Sep 2022
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2022/egusphere-2022-105/egusphere-2022-105-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2022-105-AC1
RC2:
'Comment on egusphere-2022-105', Anonymous Referee #2, 22 Sep 2022

The work seems to bring causality research in AI (more specifically Pearl, 2009) into hydrologic analysis. While I am no expert on causality analysis, it occurs to me there is some novelty in the authors' valiant effort in venturing into this realm alone and presenting a stab for hydrology, but there are also some concerns regarding clearly defining the real merit of the method. If the authors call for more research in this direction, the limitations and potential should be carefully discussed. The grand goal of the paper was to "learn causality", but the reality is that this is still very difficult from purely data-driven basis. I personally appreciate such explorations and think this concept is new to hydrology. I think the paper can be considered for publication after some substantial revisions. However, the authors will have to carefully qualify the applicability and limitations of the technique.

The most important issue --- as far as I can read, the key appears to be defining a sufficient set, which requires lots of subjective decisions and prior assumptions. The authors included previous-day precipitation and previous-day soil moisture because they think these variables will influence today's soil moisture. Also included are precipitation, daily temperature, humidity, wind. By the time you are done providing the sufficient set, you already need to inject lots of knowledge. We might wonder why we still need to run this causality test in the first place. I do see the point -- some of the decisions can be based on prior knowledge while the main causality gradient of interest (is soil moisture leading to more rainfall) may be unclear from our prior knowledge. This raises two issues: (i) there is only a niche of questions where this approach is meaningful: where we know enough to identify a causal graph and a sufficient set, but do not know the answer to the main question. This niche does exist; (ii) it will be much harder to apply where the causality or even the important factors are unknown, so the sales language of "learning a causality link" does not fit reality and should be carefully qualified.

As an initial demonstration the study also lacked a control experiment. In other words, if you replace today's soil moisture with a potential highly-correlated confounder, will the analysis show it is non-causal? This has not been demonstrated.

The writing of the article is also problematic:

(i) there should be a simple logical explanations for Theorem 1. I mean, the mathematical form can be accurate but does not help many people to understand the logic. You should translate this into simple, ordinary language. I don't believe the underlying logic is that remote.(ii) the Methods and Results are intermingled in an unhelpful way. Try to have more clear sections with dedicated functions.(iii) By the time I reach section 4 I am totally tired and cannot understand the rather complicated logic. Can you make this simpler?

Many unclear places:

(iv) How does the UNet represent the causal links in Figure 2? To my understanding all the inputs were treated in the same way(i) define "blocking a path"(ii) line 204 "further input variables" like what?(iiii) Page 6 needs lot of plain-language explanations.(iv) don't understanding "By including antecedent precipitation as input variable, or, in other words, conditioning on antecedent precipitation, we canexclude this correlation from our analysis."

Citation: https://doi.org/10.5194/egusphere-2022-105-RC2
- AC2: 'Reply on RC2', Tobias Tesch, 30 Sep 2022
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2022/egusphere-2022-105/egusphere-2022-105-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2022-105-AC2

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Tobias Tesch on behalf of the Authors (30 Nov 2022) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (28 Dec 2022) by Richard Mills

RR by Chaopeng Shen (22 Jan 2023)

ED: Publish as is (08 Feb 2023) by Richard Mills

AR by Tobias Tesch on behalf of the Authors (25 Mar 2023) Manuscript

Journal article(s) based on this preprint

20 Apr 2023

| Highlight paper

Causal deep learning models for studying the Earth system

Tobias Tesch, Stefan Kollet, and Jochen Garcke

Geosci. Model Dev., 16, 2149–2166, https://doi.org/10.5194/gmd-16-2149-2023,https://doi.org/10.5194/gmd-16-2149-2023, 2023

Short summary Executive editor

Tobias Tesch et al.

Model code and software

Causal deep learning models for studying the Earth system: soil moisture-precipitation coupling in ERA5 data across Europe - Software Code Tobias Tesch, Stefan Kollet, Jochen Garcke https://doi.org/10.5281/zenodo.6385040

Tobias Tesch et al.

Viewed

Total article views: 917 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
715	183	19	917	17	8

HTML: 715
PDF: 183
XML: 19
Total: 917
BibTeX: 17
EndNote: 8

Views and downloads (calculated since 17 May 2022)

Month	HTML	PDF	XML	Total
May 2022	128	40	7	175
Jun 2022	61	20	0	81
Jul 2022	41	7	1	49
Aug 2022	57	16	1	74
Sep 2022	94	28	7	129
Oct 2022	63	14	2	79
Nov 2022	50	8	0	58
Dec 2022	34	5	0	39
Jan 2023	40	10	1	51
Feb 2023	59	20	0	79
Mar 2023	56	10	0	66
Apr 2023	32	5	0	37

Cumulative views and downloads (calculated since 17 May 2022)

Month	HTML	PDF	XML	Total
May 2022	128	40	7	175
Jun 2022	61	20	0	81
Jul 2022	41	7	1	49
Aug 2022	57	16	1	74
Sep 2022	94	28	7	129
Oct 2022	63	14	2	79
Nov 2022	50	8	0	58
Dec 2022	34	5	0	39
Jan 2023	40	10	1	51
Feb 2023	59	20	0	79
Mar 2023	56	10	0	66
Apr 2023	32	5	0	37

Viewed (geographical distribution)

Total article views: 844 (including HTML, PDF, and XML) Thereof 844 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 20 Apr 2023

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (9627 KB)
Metadata XML

Many papers are currently being published applying deep learning to geoscientific applications. However, most of them only offer proof of concept results on highly idealised scenarios. This paper combines deep learning approaches with the structural causal models popularized by the work of Judea Pearl, and it applies this methodology to a real problem, analyzing soil moisture-precipitation coupling in climate reanalysis data. In contrast to many papers in this field, this promises actual insight in the scientific application of the work.

Short summary

A recent statistical approach for studying relations in the Earth system is to train deep learning (DL) models to predict Earth system variables given one or several others, and use interpretable DL to analyze the relations learned by the models. Here, we propose to combine the approach with a theorem from causality research to ensure that the deep learning model learns causal rather than spurious relations. As an example, we apply the method to study soil moisture-precipitation coupling.


Total:	0
HTML:	0
PDF:	0
XML:	0