Paleoclimate data assimilation with adaptive observation error inflation and adaptive localization
Abstract. Paleoclimate data assimilation methods significantly enhance the accuracy, spatiotemporal continuity, and global relevance of climate reconstructions by integrating Earth system models with proxy records. In this study, we further improve the algorithm by implementing two adaptive strategies—adaptive observation error inflation and adaptive localization—and systematically evaluate their performance in reconstructing temperature data over equatorial regions. For the adaptive observation error inflation experiments, two distinct methods were employed: the Adaptive observation error inflation (AOEI) method, which yields significant extreme improvements in specific regions but introduces notable local biases, and Huber Robust Estimation (HAOEI) method, which provides more robust and spatially consistent enhancement overall. In the adaptive localization experiments, observational density and correlation data were utilized to adjust the localization radius and weight matrix at each grid point. This approach effectively leverages sparse observational information, reduces spurious teleconnections, accurately reproduces the spatial structure of dominant climate variability modes, and optimizes the overall stability of the analyzed field.
Competing interests: At least one of the (co-)authors is a member of the editorial board of Geoscientific Model Development.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Paleoclimate data assimilation with adaptive observation error inflation and adaptive localization
Luo et al.
This manuscript proposes an adaptive observation error inflation and an adaptive localization for paleoclimate data assimilation. Since paleoclimate has large uncertainties for prior estimates and proxies, it is important to constrain the state by using the proxy information as much as possbile. Thus, covariance inflation and localization are essential components for paleoclimate data assimilation. The manuscript is well written. Please see my specific comments as below.
1. I think a flavor of EnKF is used for proxy data assimilation. Please give a brief introduction of the assimilation method, and how inflation and localization are applied within the assimilation framework.
2. Sections 2.1 and 2.2, compared to AOEI, HAOEI introduces a new parameter delta to scale the inflation, to avoid too aggressive inflation. However, HAOEI needs an empirically determined parameter delta, which adds additional uncertainties.
3. Although I personally can see the purpose of HAOEI, the foundation for HAOEI, or advantages of HAOEI over AOEI, needs more interpretation.
4. Section 2.3, for both localization radius L based on KDE and correlation estimated from time series of variables and proxies, it is unclear why these two techniques are proposed, and why they are superior to GC and other existing adaptive localization methods.
5. Moreover, as in my previous comment, there are empirical parameters introduced here too, as the bandwidth h, Lmin. Then more uncertainties are introduced, and it is hard to generalize the usage of such an adaptive method.
6. Section 3.2.1, the choice of the parameter delta and associated impacts on the assimilation results needs be discussed here.
Why the adaptive observation error inflation has impact on the region with large RMSEs around (20N 30E)?
7. Lines 213-216, do you use online or offline ensemble-based data assimilation? If it is offline, how does the imbalance occur?
8. Furthermore, paleoclimate data assimilation is quite different from the convective-scale DA, due to sparse proxies and large uncertainties. Please explain why a small localization lengthscale expected?