<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" specific-use="SMUR" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">EGUsphere</journal-id>
<journal-title-group>
<journal-title>EGUsphere</journal-title>
<abbrev-journal-title abbrev-type="publisher">EGUsphere</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">EGUsphere</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub"></issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/egusphere-2026-2433</article-id>
<title-group>
<article-title>The Calibrated Rapid Assimilation and Forecasting Technique (CRAFT) for Earth system and ecological modeling using machine learning and Bayesian estimation</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Robbins</surname>
<given-names>Zachary James</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Tiede</surname>
<given-names>Lucas</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Koven</surname>
<given-names>Charlie</given-names>
<ext-link>https://orcid.org/0000-0002-3367-0065</ext-link>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Knox</surname>
<given-names>Ryan</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>McDowell</surname>
<given-names>Nate</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Xu</surname>
<given-names>Chonggang</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Los Alamos National Lab, Los Alamos, NM, USA</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Lawrence Berkley National Lab, Berkley, CA, USA</addr-line>
</aff>
<aff id="aff3">
<label>3</label>
<addr-line>Pacific Northwest National Lab, Richland, WA, USA</addr-line>
</aff>
<pub-date pub-type="epub">
<day>05</day>
<month>06</month>
<year>2026</year>
</pub-date>
<volume>2026</volume>
<fpage>1</fpage>
<lpage>21</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2026 Zachary James Robbins et al.</copyright-statement>
<copyright-year>2026</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://egusphere.copernicus.org/preprints/2026/egusphere-2026-2433/">This article is available from https://egusphere.copernicus.org/preprints/2026/egusphere-2026-2433/</self-uri>
<self-uri xlink:href="https://egusphere.copernicus.org/preprints/2026/egusphere-2026-2433/egusphere-2026-2433.pdf">The full text article is available as a PDF file from https://egusphere.copernicus.org/preprints/2026/egusphere-2026-2433/egusphere-2026-2433.pdf</self-uri>
<abstract>
<p>Increasing the mechanistic complexity of Earth system and ecological model provides the opportunity for improved understanding with numerical experimentation. However, complexity additionally presents greater difficulty in constraining parameters with data. Determining plausible parameter combinations requires a method by which to incorporate data streams, field observations, and their uncertainty. Bayesian methods of integrating datasets are often limited by the computational limits in running these complex mechanistic models. Machine learning can effectively integrate data and physical models by constructing emulations for the finite simulations needed for parameterization. We present the CRAFT (Calibrated Rapid Assimilation and Forecasting Technique) framework for ecological model parameterization and test it using the mechanism rich ecosystem demographic model, FATES-HYDRO (testing 42 parameters and evaluating it for 6 outputs). This framework uses emulation and parameter reduction to construct more rapidly running emulators and test posterior parametric distribution given observational data. We assess whether this mechanism can emulate the model outputs, the variance across the parameter space, and in future prediction (simulations 2020&amp;ndash;2100,) using synthetic model runs. Overall random forest models had an out-of-sample accuracy of 92&amp;ndash;99 % in reconstructing observational periods and showed no-significant difference with the physical model for change in most parameters (283/288 parameter and output combinations). 95 % CI posterior ranges of parameters produced FATES-HYDRO runs that had an RMSE for gross primary productivity (GPP) of 3.748 g C month&lt;sup&gt;-1&lt;/sup&gt;, for evapotranspiration (ET) 1.33 mm H&lt;sub&gt;2&lt;/sub&gt;O month&lt;sup&gt;-1&lt;/sup&gt;, for soil moisture 0.005 m&lt;sup&gt;2 &lt;/sup&gt;m&lt;sup&gt;-2&lt;/sup&gt;, 0.381 MPa for maximum leaf water potential (LWP&lt;sub&gt;max&lt;/sub&gt;), 0.44 MPa for minimum leaf water potential (LWP&lt;sub&gt;min&lt;/sub&gt;), 1.80 m&lt;sup&gt;2&lt;/sup&gt; m&lt;sup&gt;-1&lt;/sup&gt; for runoff (RO) when compared to the synthetic data. Future simulations had a RMSE for GPP of 22.55 gC m&lt;sup&gt;-2 &lt;/sup&gt;month&lt;sup&gt;-1&lt;/sup&gt;, ET had a RMSE of 7.82 mm H&lt;sub&gt;2&lt;/sub&gt;O month&lt;sup&gt;-1&lt;/sup&gt;, RO had an RMSE of 88.80 mm H&lt;sub&gt;2&lt;/sub&gt;O month&lt;sup&gt;-1&lt;/sup&gt;, monthly leaf water potential had an RMSE of 0.145 MPa, soil water content at 20 cm had an RMSE of 0.0138 m&lt;sup&gt;2 &lt;/sup&gt;m&lt;sup&gt;-2&lt;/sup&gt; when compared to the synthetic dataset. Overall, we show the CRAFT framework as a rapid and accurate semi-automated method to assimilate data and calculate posterior distributions in complex physical models. This framework could accelerate our scientific discovery through rapid accuracy improvement in process-based modeling and more mathematically robust prediction with constrained uncertainty in model parameters.</p>
</abstract>
<counts><page-count count="21"/></counts>
<funding-group>
<award-group id="gs1">
<funding-source>U.S. Department of Energy</funding-source>
<award-id>Next Generation Ecosystem Experiments-Tropics: Triad National Security, LLC (“Triad”) Contract grant # 89233218CNA000001</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body/>
<back>
</back>
</article>