<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" specific-use="SMUR" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">EGUsphere</journal-id>
<journal-title-group>
<journal-title>EGUsphere</journal-title>
<abbrev-journal-title abbrev-type="publisher">EGUsphere</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">EGUsphere</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub"></issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/egusphere-2025-4650</article-id>
<title-group>
<article-title>Cause-effect discovery in Hydrometeorological Systems: Evaluation of Causal Discovery methods</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Yadav</surname>
<given-names>Vivek Kumar</given-names>
<ext-link>https://orcid.org/0009-0000-7156-4450</ext-link>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Peel</surname>
<given-names>Murray</given-names>
<ext-link>https://orcid.org/0000-0002-3255-3692</ext-link>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Fowler</surname>
<given-names>Keirnan</given-names>
<ext-link>https://orcid.org/0000-0002-1983-0253</ext-link>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ryu</surname>
<given-names>Dongryeol</given-names>
<ext-link>https://orcid.org/0000-0002-5335-6209</ext-link>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Vishwakarma</surname>
<given-names>Bramha Dutt</given-names>
<ext-link>https://orcid.org/0000-0003-4787-8470</ext-link>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Interdisciplinary Centre for Water Research, Indian Institute of Science, Bengaluru, 560012, India</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Faculty of Engineering and Information Technology, The University of Melbourne, Melbourne 3010, Victoria, Australia</addr-line>
</aff>
<pub-date pub-type="epub">
<day>05</day>
<month>11</month>
<year>2025</year>
</pub-date>
<volume>2025</volume>
<fpage>1</fpage>
<lpage>51</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2025 Vivek Kumar Yadav et al.</copyright-statement>
<copyright-year>2025</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://egusphere.copernicus.org/preprints/2025/egusphere-2025-4650/">This article is available from https://egusphere.copernicus.org/preprints/2025/egusphere-2025-4650/</self-uri>
<self-uri xlink:href="https://egusphere.copernicus.org/preprints/2025/egusphere-2025-4650/egusphere-2025-4650.pdf">The full text article is available as a PDF file from https://egusphere.copernicus.org/preprints/2025/egusphere-2025-4650/egusphere-2025-4650.pdf</self-uri>
<abstract>
<p>Identifying the driver(s) of a process or phenomenon is central to understanding and predicting its future state. In complex hydrometeorological systems, a process can have multiple drivers dynamically coupled to the system across timescales. Thus, a robust method to identify drivers is imperative. In hydrological sciences, methods like multivariate regression and, more recently, Big Data machine-learning approaches rely on finding a &lt;em&gt;co&lt;/em&gt;-relation between variables, rather than identifying cause-effect relations. This study evaluates cause-effect discovery (Causal Discovery or CD) algorithms in hydrometeorological systems. Although earlier studies have made important contributions to exploring CD methods, they have primarily focused on bivariate methods in simple synthetic environments. Specifically, we evaluate the following four theoretically distinct multivariate CD algorithms, (i) TCDF (ii) VARLiNGAM, (iii) PCMCI+, and (iv) DYNOTEARS. We evaluate these algorithms within a large, complex simulated environment of the Global Land Data Assimilation System (GLDAS) where the drivers, reference truth, are known perfectly. We evaluate the drivers identified by CD methods against this reference truth and also contrast its results with the widely used method of &lt;em&gt;co&lt;/em&gt;-relation identification, Pearson&amp;rsquo;s Correlation Coefficient (PCC). The results show that CD methods identify fewer false drivers compared to PCC, across a range of K&amp;ouml;ppen-Geiger climate types. For example, PCC failed to distinguish true drivers from instantaneous and lagged cross-correlations, typically present in hydrometeorological systems. Whereas, CD methods eliminate a higher number of false instantaneous and lagged drivers. Thus, though PCC identifies the highest number of true drivers, it suffers from high false drivers. Overall, CD methods perform similar to or better than PCC, while PCMCI+ and DYNOTEARS performed the best. Further, we test whether time-series prediction models perform better when predictors are limited to those identified as causal by CD methods. Evaluation of surface soil moisture predictions during drought shows that CD-based models outperform PCC-based models and are more parsimonious. Thus, we demonstrate the effectiveness of using causal discovery to eliminate spurious relations and obtain a robust set of drivers for prediction and process understanding across different climate conditions. This study overviews, demonstrates and tests efficacy of CD methods in studying cause-effect relations in hydrometeorological systems. By exposing their capabilities and differences in a simulated environment, we hope to encourage their use in the real world and move beyond &lt;em&gt;co&lt;/em&gt;-relation.</p>
</abstract>
<counts><page-count count="51"/></counts>
</article-meta>
</front>
<body/>
<back>
</back>
</article>