the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Quantifying the minimum ensemble size for asymptotic accuracy of the ensemble Kalman filter using the degrees of instability
Abstract. The ensemble Kalman filter (EnKF) is widely used for state estimation in chaotic dynamical systems, including the atmosphere and ocean. However, the required ensemble size for accurate state estimation remains unclear. In this study, we define filter accuracy based on its time-asymptotic performance relative to the observation noise. We then investigate the minimum ensemble size, m*, required to achieve this accuracy, linking it to the degrees of instability in the chaotic dynamics. Since the well-defined characteristic numbers of dynamical systems called the Lyapunov exponents (LEs) quantify the timeasymptotic exponential growth or decay rates of infinitesimal perturbations, we define the degrees of instability N+ by the number of positive LEs. In the EnKF, capturing such instabilities with limited ensemble is crucial for achieving long-term filter accuracy. Therefore, we propose an ensemble spin-up and downsizing method within data assimilation cycles. Numerical experiments applying the EnKF to the Lorenz 96 model show that the minimum ensemble size required for filter accuracy is estimated by m* = N+ +1. This study provides a practical estimate for the minimum ensemble size based on a priori information about the target dynamics, along with a method to achieve long-term accuracy.
Competing interests: Some authors are members of the editorial board of journal NPG.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.- Preprint
(908 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2025-5144', Anonymous Referee #1, 01 Dec 2025
-
AC1: 'Reply on RC1', Kota Takeda, 06 Jan 2026
We thank the editor and the reviewers for their careful reading of our manuscript and for their constructive comments. The suggestions have helped to improve the clarity and presentation of the paper.
Below, we provide point-by-point responses to all reviewer comments. Reviewer comments are reproduced in italics, followed by our responses.
In this manuscript, two goals are introduced: 1) the manuscript tries to demonstrate that the EnKF can asymptotically converge to a below observation errors as long as the ensemble member is more than the dimension of the unstable subspace. 2) the manuscript proposes an ensemble downsizing approach, which is supposed to accelerate the convergence of the ensemble Kalman filter (EnKF) with smaller ensemble size. However, I find this manuscript can be benefited from a more in-depth analysis, and therefore recommend for at least a major revision.
Response: We thank the reviewer for the careful assessment of the manuscript and for clearly summarizing its two main objectives. We agree that a more in-depth analysis can further strengthen the paper, and we address this point in detail in our responses to the specific comments below.
Major Comments
In L61 - 62, the manuscripts claims that it addresses the time-asymptotic accuracy of the ETKF instead of the time-averaged analysis error within a finite-time interval. However, I find it difficult to understand the differences between them. In the referenced literature, the time-average is typically calculated after a spin-up/burn-in time when the filters are converged, which is very similar to time-asymptotic accuracy. In fact, the results in Fig.2 are in general consistent with previous studies, e.g., with an observation error of 1, in Fig. 3 of Bocquet, M.: Ensemble Kalman filtering without the intrinsic need for inflation, Nonlin. Processes Geophys., 18, 735–750, https://doi.org/10.5194/npg-18-735-2011, 2011.
Response: Thank you for your accurate comment. Indeed, the time-asymptotic accuracy defined in Eq. (17) is closely related to the time-averaged RMSE after a spin-up period. Therefore, we will revise this statement. Instead, we will revise lines 61-62 to explain the distinction in criteria to define the minimum ensemble size between our approach using Eq. (17) and existing approaches such as Bocquet, M.: Ensemble Kalman filtering without the intrinsic need for inflation, Nonlin. Processes Geophys., 18, 735–750, https://doi.org/10.5194/npg-18-735-2011, 2011. Specifically, the key distinction concerns whether the observation noise level is fixed or treated as an asymptotic parameter. We aim to remove dependence on the choice of observation noise level to define the minimum ensemble size.
However, Fig.2 is still an interesting figure. The filter accuracy, as defined by Eq.(17), is violated for large observation errors. This accuracy only starts to appear from ensemble size larger than the unstable subspace (m = 14). These results show clear importance of the accuracy definition of Eq. (17) and the observation noises. Could the authors provide references and explanations of Eq.(17)?
Response: We will add appropriate references for Eq. (17). We use the notion of filter accuracy defined in Eq. (17) for the following two reasons.
(i) In the context of theoretical error analysis of filtering methods, including the EnKF, ‘time-asymptotic filter accuracy’ is commonly employed (Kelly et al., 2014; Kelly and Stuart, 2019; Takeda and Sakajo, 2024; Biswas and Branicki, 2024; Sanz-Alonso and Waniorek, 2025). Because this definition is based on asymptotic limits of time and observation noise level, it is well suited for rigorous mathematical analysis. Our study adopts this evaluation precisely with a view toward future theoretical developments.
(ii) The expected squared error is always greater than or equal to J times the squared RMSE, and the supremum over time is, in general, larger than the corresponding time-averaged error. Therefore, filter accuracy in the sense of Eq. (17) provides a more stringent criterion than the time-averaged RMSE. For this reason, we adopt this definition in our analysis (see Sect. 1 in Supplement for a comparison of error evaluation criteria).
It would be also of interests to have an explanation for the sensitivity to observation noises, which is absent from the manuscript. One possible explanation might be the observation noises affecting the linearity which is briefly demonstrate in the Appendix of in Marc Bocquet & Alberto Carrassi (2017) Four-dimensional ensemble variational data assimilation and the unstable subspace, Tellus A: Dynamic Meteorology and Oceanography, 69:1, 1304504, DOI: 10.1080/16000870.2017.1304504. If the linearity is the cause, would smaller observation interval allow for larger observation noises? Nevertheless, these results seem to suggest that to get accurate analysis, an ensemble size of 14 requires very stringent condition.
Response: We will incorporate a discussion on this point in the revised manuscript. In mathematical analyses, the assumption of the small observation noise is required for removing higher-order term in r in the analysis error and obtaining the bound of order r^2. An example of such an argument can be found, for instance, in Corollary 3.5 of Takeda and Sakajo (2024) Uniform Error Bounds of the Ensemble Transform Kalman Filter for Chaotic Dynamics with Multiplicative Covariance Inflation, SIAM/ASA J. Uncertainty Quantification, 12, 1315–1335, https://doi.org/10.1137/24M1637192, 2024. However, we may perform similar arguments with an assumption of the small observation interval instead of small observation noise.
The manuscript claims that the ensembled downsizing method can "mitigate the slow convergence of uncertainty". However, this is not clearly shown in available Figures. In the comparison between N_spinup = 0 and N_spinup = 720 in Fig.3, it is not clear that N_spinup=720 converges faster. Moreover, the figure presents m=15 which is different from m=14 as claimed in the conclusion section. However, this figure indeed shows only a smaller inflation factor is needed. However, it is not clear if this effect is related to the unstable-neutral subspace. Is it possible that the initial ensemble size mitigates the sampling errors? I wonder if it would be useful to investigate the alignment of error covariance matrix and Lyapunov vectors over time for the ensemble downsizing method.
Response: Thank you for your detailed comments. We will perform additional targeted numerical experiments to clarify these points.
(i) To clarify faster convergence with N_spinup=720, we will change Figure 3 by that with m=14 (one of panels will correspond to (a) in Figure 4).
(ii) To clarify the effect of the ensemble downsizing method related to the unstable subspace (which we are interested in), we will perform experiments with initial ensemble which has accurate mean close to the true state and inaccurate perturbations not aligned in the unstable subspace, and compare results for N_spinup=0, 720.These experiments can distinguish whether the ensemble downsizing method improves alignments of ensemble perturbations (as well as faster convergence to the true state).
A detailed quantitative analysis of the alignment between ensemble covariance eigenvectors and Lyapunov vectors is an important topic, but we consider it beyond the scope of the present manuscript and leave it for future work.
The motivation of F=16 is not very clear to me. Is it just used to validate the results for F=8?
Response: That is correct. This choice was made to confirm that the proposed approach remains effective even when N+ varies as a result of changes in the external forcing.
Minor Comments
L19: "For continuous-time non-autonomous dynamical system, ..., at least one of the LEs is zero". With this condition, this is still not completely correct for certain dissipative systems, I think.
Response: Thank you for your detailed comment. In the manuscript, we focus on autonomous systems. Therefore, we will rephrase the sentence to specify autonomous dynamical systems only.
L28: (ETKF, Bishop et al., 2001). If LaTex is used, try \citep[ETKF,][]{ref}. The same follows for cases in L140, L146.
Response: We will revise them accordingly.
L77, L117, L175: No bracket for the reference.
Response: We will revise them accordingly.
L106: Consistency for v_j(t) or v_j.
Response: We will revise the equation so that the notations are consistent.
L112: At least one exponent is zero
Response: We will revise the phrase accordingly.
L124: For Nx \in N number of variables, ...
Response: We will revise the phrase accordingly.
L128: used in data assimilation algorithms.
Response: We will change the preposition accordingly.
L165: What does this mean for "taking over the observation noises and initial ensemble"? Is the expectation computed over different observation noises?
Response: In practice, yes, we will revise the phrase. The expectation is defined theoretically as an expectation with respect to the underlying probability distributions of the observation noise and the initial ensemble. In numerical experiments, this expectation is approximated by averaging over different realizations of observation noise and different independently generated initial ensembles.
Figure 2 and Figure 6: in both x- and y-axis only 10^{-} is shown. caption: black line indicates the order of r^2
Response: We will revise the caption in Figure 2 and Figure 6.
Figure 3, 4: if spin-up time is 0, it is perhaps better to avoid 41 -> 14 in the legend.
Response: Thank you for proposing an improvement. We will avoid the redundant legends as you indicated
In summary, we will (i) clarify the conceptual distinction between time-asymptotic accuracy and time-averaged error, together with the treatment of the observation noise level, (ii) add relevant theoretical references, (iii) extend the numerical experiments to better demonstrate the effect of ensemble downsizing method, and (iv) revise figures and notation for clarity. We believe these revisions address the reviewer’s concern regarding the depth of analysis.
-
AC1: 'Reply on RC1', Kota Takeda, 06 Jan 2026
-
RC2: 'Comment on egusphere-2025-5144', Marc Bocquet, 04 Jan 2026
-
AC2: 'Reply on RC2', Kota Takeda, 14 Jan 2026
We thank the editor and reviewers for their careful reading of our manuscript and their constructive comments. These suggestions have significantly improved the clarity and presentation of the paper.
Below, we provide point-by-point responses to all reviewer comments. Reviewer comments are presented in italics, followed by our responses.
Overall, this is an interesting study. But it seems to lack novelty. The novelties that I could see are (i) the downsizing of the ensemble in the spin-up of the EnKF, (ii) using Eq. (17) to define asymptotic filter accuracy. But in both cases, I believe their introduction should be better justified, and their added value demonstrated. Concerning (i), in my experience whatever divergence the downsizing is meant to prevent can be addressed flawlessly with an adaptive inflation ensemble Kalman filter (EnKF) such as the EnKF-N (see below) in the Lorenz 96 context. Concerning (ii), the large majority of authors on similar topics use the mean root mean square error (RMSE) which is known to also be a measure of asymptotic filter accuracy in the Lorenz 96 context. I am not sure of the added value of the criterion based on Eq. (17) in the Lorenz 96 context. In any quick numerical tests that I performed, one implied the other (i.e., mean RMSE vs Eq. (17)). A configuration where the mean RMSE is proper but would violate your definition of filter accuracy Eq. (17) would be weird, and I was not able exhibit any. Can the authors prove me wrong?
Response: We thank the reviewer for the careful assessment of the manuscript and for clearly suggesting the two potential novelties (i) and (ii) of the current study. We will deeply compare our approaches with existing and standard approaches and clarify their differences through theoretical explanation and numerical evidence. We address these points in detail in our responses to the specific comments below.
Major remarks
The title is referring to a subject that has already been addressed by several other papers in the data assimilation and dynamical systems literature. Instead, the paper should point to its novelty, unless the paper is meant to be a topic overview (see below for a selection of references).
Response: We will revise the title to highlight the novelty of the current study within the existing literature.
Similarly, in the abstract the only sentence that may point to some novelty is Therefore, we propose an ensemble spin-up and downsizing method within data assimilation cycles (see below for a selection of references).
Response: We will revise the abstract to emphasize the novelty.
I believe that you have to make a disclaimer on the use of localisation. Most of the issues addressed in the manuscript are fixed by localisation in high-dimensional geophysical systems. I understand and support why one wants to carry on such an investigation without localisation. But this should quickly be mentioned once, as a disclaimer.
Response: Thank you for your suggestion. We aim to clarify the relationship between the degrees of freedom in dynamical systems and the required ensemble size, independent of localization effects. Incorporating localization is the next step. We will include a disclaimer explaining why we do not use localization.
Since using Eq. (17) as a filter accuracy criterion seems (to me) like the real novelty of the paper, you should better explain why you introduce it, how different it is from the criteria of accuracy used so far in the literature.
Response: We thank the reviewer for pointing this out. We agree that the introduction of Eq. (17) as a definition of filter accuracy is a central aspect of the present study, and we will clarify its motivation and its relation to existing accuracy criteria more explicitly in the revised manuscript. The main reason for introducing Eq. (17) as a filter accuracy criterion is to qualitatively distinguish whether the filter diverges. We will add further explanation of ‘filter accuracy’ in Eq. (17) based on the following two reasons.
(i) More stringent criterion than the mean RMSE: In general, it follows that . Therefore, filter accuracy in the sense of Eq. (17) provides a more stringent criterion than the time-averaged RMSE. For this reason, we adopt this definition in our analysis (see Sect. 1 in Supplement for a comparison of error evaluation criteria).
(ii) Treating observation noise level as asymptotic parameter: In contrast to many numerical studies where the observation noise level is fixed, Eq. (17) treats the noise amplitude r as an asymptotic parameter. This viewpoint is standard in theoretical analyses of filtering methods, including the EnKF (e.g., Kelly et al., 2014; Kelly and Stuart, 2019; Takeda and Sakajo, 2024; Biswas and Branicki, 2024; Sanz-Alonso and Waniorek, 2025). Because Eq. (17) is defined through asymptotic limits in both time and noise amplitude, it is well suited for rigorous mathematical analysis. The current study adopts this formulation with a view toward future theoretical developments.
You have to better explain the added value of the ensemble downsizing method, because spinning-up an ensemble with the Lorenz 96 model rarely poses any problem. Especially if one is using an adaptive inflation technique, such as Bocquet (2011); Bocquet et al. (2015); Raanes et al. (2019).
Response: We thank the reviewer for this comment. We agree that for the Lorenz–96 model with sufficiently large ensembles or with adaptive inflation, ensemble spin-up rarely poses a practical difficulty. However, the current study focuses on a borderline regime, where the ensemble size is close to the minimal dimension required to capture the unstable subspace (). In such cases, increasing the ensemble size is not possible for computational reasons. In this regime, although the ensemble mean may converge quickly, the alignment of ensemble perturbations with the unstable subspace can take a much longer time.
The proposed ensemble spin-up and downsizing method provides a constructive way to generate a small ensemble already aligned with unstable directions, thereby reducing the effective spin-up time in these borderline settings. Adaptive inflation methods mainly control the amplitude of uncertainty, whereas the downsizing approach targets the geometric structure of the ensemble. The two approaches are therefore complementary. We will clarify this point and provide supporting numerical results in the revised manuscript.
There are merits in considering Eq. (17) rather that say the mean RMSE. But it might be highly dependent on the chosen multiplicative scheme. If this the case, how to deal with such dependence on the inflation scheme in this context? This should be discussed.
Response: Thank you for the comment. The primary focus of the current study is to identify a lower bound on the ensemble size required to achieve filter accuracy under favorable conditions, rather than to optimize or analyze the inflation strategy itself. In this context, inflation is treated as a supporting mechanism that maintains sufficient ensemble spread, while the central object of interest is the geometric requirement that the ensemble captures the unstable subspace of the dynamics.
The dependence on inflation tuning and the use of adaptive inflation schemes (such as EnKF-N) are therefore complementary to, but beyond the scope of, the current study. We will clarify this scope and the dependence of inflation more explicitly in the revised manuscript.
Related remarks, suggestions, and typos
1. l.17-18: The degree of instability is quantified by the Lyapunov exponents (LE’s), which are defined as the exponential growth or decay rates of infinitesimal perturbations in the tangent space (Beckman and Ruler, 1985): This is not an accurate definition of the Lyapunov exponents.
Response: We will revise the sentence to provide the correct definition.
2. l.19-20: For continuous-time dynamical systems, such as ordinary differential equations, one of the Le’s is zero, corresponding to perturbations parallel to the vector field.: The existence of a zero Lyapunov exponent comes from the fact that such continuous in time dynamical system is autonomous, i.e. it does not explicitly depends on time (Haken, 1983). It stems from the time-translation invariance of the dynamics. It is critical to emphasise this point since many geophysical systems are not autonomous. Your current statement is wrong in general, and especially for most dynamics in the geosciences.
Response: Thank you for your detailed comment. In the manuscript, we focus on autonomous systems. Therefore, we will rephrase the sentence to specify autonomous dynamical systems only.
3. l.50: Bocquet et al. (2017); Bocquet and Carrassi (2017) are not papers about AUS but about the (ensemble) Kalman filters and smoothers, even though they are connected.
Response: We will rearrange the literature review properly.
4. l.53-55: Theoretical analyses for linear systems in (Bocquet et al., 2017; Bocquet and Carrassi, 2017) suggest that the rank of the ETKF covariance is asymptotically bounded by N+ due to the exponential decay of uncertainty in the stable subspace and the slower decay in the neutral subspace under some conditions.: You seem to downplay the results obtained in Gurumoorthy et al. (2017); Bocquet et al. (2017); Bocquet and Carrassi (2017). In particular, Bocquet et al. (2017) proved mathematically in the linear case that the ensemble should be of size N0 + 1 or greater. They went much farther than just suggestions or conjectures.
Response: We agree that Bocquet et al. (2017) provides a rigorous mathematical result for linear systems, proving that an ensemble size of at least N0 + 1 is required to properly approximate the error covariance. This theoretical result motivates the ensemble downsizing method considered in our study to maintain filter accuracy. We will revise the manuscript to clearly acknowledge this result, together with related numerical evidence.
5. l.62: However, these results do not address the time-asymptotic accuracy of the ETKF : On the contrary, Bocquet et al. (2017); Bocquet and Carrassi (2017); Grudzien et al. (2018a,b) are mainly focused on the time-asymptotic accuracy of the EnKF (and smoother as well). In particular, they use the mean RMSE which is a measure of asymptotic accuracy in an ergodic dynamics context. Bocquet et al. (2017) even demonstrate an analytic formula for the asymptotic (forecast or analysis) error covariance matrix of rank N0 in the linear case. Your statement seems to be wrong. What am I missing?
Response: Thank you for the accurate comment. We agree that our statement was incorrect. In an ergodic setting, the long-time mean RMSE (after an initial transient) is indeed a standard measure of time-asymptotic performance. Moreover, Bocquet et al. (2017) derive an analytic expression for the asymptotic error covariance (of rank N0) in the linear case. We will revise l.62 accordingly. In the revised manuscript, we will also clarify that Eq. (17) is introduced not to contrast “asymptotic vs time-averaged” per se, but for motivations explained above.
6. Hence, we conjecture that the minimum ensemble size for asymptotic accuracy of the ETKF is m* = N+ + 1where only the unstable directions are tracked by the forecast ensemble covariance. That has already been conjectured and in some cases proven, see e.g., Gurumoorthy et al. (2017); Bocquet et al. (2017); Bocquet and Carrassi (2017); Grudzien et al. (2018a,b). Your statement implies that you are the first to make such conjecture.
Response: We will revise the statement with appropriate citations.
7. l.78: Although our target model is the same as that in (Bocquet and Carrassi, 2017), our objective differs in that we focus on asymptotic accuracy and its dependence on the order of the observation noise: Bocquet and Carrassi (2017) also focus on asymptotic accuracy and they also report on the dependence onto the observation noise. So at this stage, I do not see any difference in both their objective and their focus.
Response: We agree that the sentence makes no difference. Our focus is to define the minimum ensemble size based on the dependency of asymptotic accuracy on the order of the observation noise, i.e., Eq. (17). On the other hand, Bocquet and Carrassi (2017) also focus on the dependency of asymptotic accuracy on the ensemble size and the order of the observation noise separately. Hence, their study investigates the minimum ensemble size for a fixed ensemble size.
8. l.93: f should be bold.
Response: We will revise the equation.
9. l.97: Eq.(3) should be Equation (3).
Response: We will revise the sentence accordingly.
10. l.112: . . . regardless of x0 in an invariant subset of RNx : is a verb missing here? I believe that I understand the sentence, but I am not so sure.
Response: We will revise the sentence to clarify the mathematical statement.
11. l.117-118: See (Kuptsov and Parlitz, 2012; Carrassi et al., 2022) for a more comprehensive introduction to LEs and their associated vectors. A very solid reference, especially because it was written by theoreticians of the geofluids, is Legras and Vautard (1996).
Response: Thank you for the valuable information. We will cite the reference appropriately.
12. l.128: . . . used to . . . : used for?
Response: We will revise the preposition.
13. p.6: Please use bold symbol for matrices (as you did for vectors) since this is the widely adopted convention for data assimilation papers, especially with journals connected to the geosciences.
Response: Thank you for the suggestion. We will use bold symbol for matrices in the revised manuscript.
14. Algorithm 2: To be rigorous, you need to tell that the SVD algorithm returns singular values and vectors by decreasing order.
Response: Thank you for the accurate comment. We will revise accordingly.
15. l.195: In this section, we assimilate observations every nobs = 5 integration steps.: This is a bit awkward. Why don’t you just write that the time interval is 0.05 which is the standard value used in such experiments.
Response: Thank you for the comment. We will rewrite the representation accordingly.
16. We set r=10-1 : The very often used value is r=1. Why this specific choice? I tested numerically the EnKF-N with r=1 and m=15, and I found it to achieve filter accuracy as defined by Eq. (17).
Response: Thank you for confirming the experiment by yourself. We will respond to this at comment#20 below.
17. p.11, Fig. 2: You would obtain very similar results for the mean RMSE, questioning the relevance (or added value) of the criterion Eq. (17).
Response: Thank you for the comment. We will respond to this at comment#25 below.
18. This property guarantees that the squared analysis error remains of order r2 : This seems like a tautology.
Response: We will remove the sentence since it provides no additional information.
19. The dashed line indicates the order of r2 : I don’t see the dashed line in the figure.
Response: Thank you for pointing out. The correct word is ‘black line’. We will revise it accordingly.
20. l.12, Figure 3: I used the EnKF-N and have no issues spinning-up the ETKF with r=0.1 and m=15. So what would be the point of Fig. 3? What is the corresponding mean RMSE which is very common in such study? I obtain with the EnKF-N 0.017 for the mean RMSE (one single run, not tuning at all).
Response: We agree that the purpose of Fig. 3 is unclear. Therefore, we will remove it from the revised manuscript.
21. p.10: I am not sure to see the point of the experiments of Fig. 4, i.e with r=10-4 With the EnKF-N, I obtain in the same setup a mean RMSE of 0.46 × 10-4 which is on par with your best run (α=1.5).
Response: The point of Fig. 4 is to show that (i) the RMSE converges to small values without the ensemble spin-up, but, (ii) the convergent speed is very slow with m = N+ + 1. We will add numerical experiments to show that (iii) the convergent speed is very fast with the ensemble spin-up and downsizing to m = N+ + 1 = 14. We will arrange these experiments and figures in the revised manuscript to present the added value of the ensemble spin-up and downsizing method.
22. p.11: What is the point of the experiments with? You could justify them.
Response: This experiment aims to confirm that the proposed approach remains effective even when N+ varies as a result of changes in the external forcing.
23. Figures 2 and 6: some of the y-axis labels are missing.
Response: Thank you for the comment. We will define notations used in labels of Figures 2 and 6 appropriately.
24. l.226: We proposed an ensemble downsizing method for the EnKF to generate an ensemble aligned with the unstable subspace of the dynamics: Fine, but what is the added value of such scheme? Using an adaptive multiplicative inflation EnKF such as the EnKF-N does the job even without tuning the inflation.
Response: As Fig. 4, the convergence speed with m=14 is very slow even with the optimal inflation factor if we do not use the ensemble spin-up and downsizing method. We expect that a similar behavior would be observed for adaptive inflation schemes such as EnKF-N.
25. l.227-228: . . . we verified our conjecture that the minimum ensemble size for asymptotic accuracy is m* = N+ + 1, where is the number of positive LEs: This conjecture has been already formulated and verified in the literature, and in some cases mathematically proven. If you meant that your statement specifically applies to the criterion Eq. (17), you have to be clearer and additionally explained why Eq. (17) would be different from using the mean RMSE as a filter accuracy indicator. I even believe that in the context of the Lorenz 96 model, one implies the other. And the few numerical tests that I performed in support of this review concur.
Response: We agree with the reviewer that the conjecture m* = N+ + 1 has already been formulated and, in some cases, rigorously established in the literature under accuracy criteria based on the mean RMSE or related quantities. We did not intend to claim originality of the conjecture itself.
Instead, our contribution is to revisit this conjecture specifically under the filter accuracy criterion defined by Eq. (17). While Eq. (17) is generally more stringent than criteria based on the mean RMSE our numerical experiments indeed indicate that, for the Lorenz–96 model considered here, both criteria lead to the same minimal ensemble size. We will clarify this point explicitly in the revised manuscript.
The added value of Eq. (17) lies not in changing the numerical outcome for this particular model, but in providing a formulation that treats the observation noise level as an asymptotic parameter, thereby allowing a clear qualitative separation between filter accuracy and filter divergence. This formulation aligns naturally with existing mathematical literature and facilitates future rigorous analysis. We will revise the text to make this scope and positioning clearer and to properly acknowledge prior results.
-
AC2: 'Reply on RC2', Kota Takeda, 14 Jan 2026
Model code and software
KotaTakeda/enkf_ensemble_downsizing Kota Takeda https://doi.org/10.5281/zenodo.17319854
Interactive computing environment
Jupyter Notebook in Binder Kota Takeda https://mybinder.org/v2/gh/KotaTakeda/enkf_ensemble_downsizing/binder-test?urlpath=%2Fdoc%2Ftree%2Ftest.ipynb
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 287 | 56 | 26 | 369 | 63 | 69 |
- HTML: 287
- PDF: 56
- XML: 26
- Total: 369
- BibTeX: 63
- EndNote: 69
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
In this manuscript, two goals are introduced: 1) the manuscript tries to demonstrate that the EnKF can asymptotically converge to a below observation errors as long as the ensemble member is more than the dimension of the unstable subspace. 2) the manuscript proposes an ensemble downsizing approach, which is supposed to accelerate the convergence of the ensemble Kalman filter (EnKF) with smaller ensemble size. However, I find this manuscript can be benefited from a more in-depth analysis, and therefore recommend for at least a major revision.
1. In L61 - 62, the manuscripts claims that it addresses the time-asymptotic accuracy of the ETKF instead of the time-averaged analysis error within a finite-time interval. However, I find it difficult to understand the differences between them. In the referenced literature, the time-average is typically calculated after a spin-up/burn-in time when the filters are converged, which is very similar to time-asymptotic accuracy. In fact, the results in Fig.2 are in general consistent with previous studies, e.g., with an observation error of 1, in Fig. 3 of Bocquet, M.: Ensemble Kalman filtering without the intrinsic need for inflation, Nonlin. Processes Geophys., 18, 735–750, https://doi.org/10.5194/npg-18-735-2011, 2011.
However, Fig.2 is still an interesting figure. The filter accuracy, as defined by Eq.(17), is violated for large observation errors. This accuracy only starts to appear from ensemble size larger than the unstable subspace (m = 14). These results show clear importance of the accuracy definition of Eq. (17) and the observation noises. Could the authors provide references and explanations of Eq.(17)? It would be also of interests to have an explanation for the sensitivity to observation noises, which is absent from the manuscript. One possible explanation might be the observation noises affecting the linearity which is briefly demonstrate in the Appendix of in Marc Bocquet & Alberto Carrassi (2017) Four-dimensional ensemble variational data assimilation and the unstable subspace, Tellus A: Dynamic Meteorology and Oceanography, 69:1, 1304504, DOI: 10.1080/16000870.2017.1304504. If the linearity is the cause, would smaller observation interval allow for larger observation noises? Nevertheless, these results seem to suggest that to get accurate analysis, an ensemble size of 14 requires very stringent condition.
2. The manuscript claims that the ensembled downsizing method can "mitigate the slow convergence of uncertainty". However, this is not clearly shown in available Figures. In the comparison between N_spinup = 0 and N_spinup = 720 in Fig.3, it is not clear that N_spinup=720 converges faster. Moreover, the figure presents m=15 which is different from m=14 as claimed in the conclusion section. However, this figure indeed shows only a smaller inflation factor is needed. However, it is not clear if this effect is related to the unstable-neutral subspace. Is it possible that the initial ensemble size mitigates the sampling errors? I wonder if it would be useful to investigate the alignment of error covariance matrix and Lyapunov vectors over time for the ensemble downsizing method.
3. The motivation of F=16 is not very clear to me. Is it just used to validate the results for F=8?
Minor comments:
L19: "For continuous-time non-autonomous dynamical system, ..., at least one of the LEs is zero". With this condition, this is still not completely correct for certain dissipative systems, I think.
L28: (ETKF, Bishop et al., 2001). If LaTex is used, try \citep[ETKF,][]{ref}. The same follows for cases in L140, L146.
L77, L117, L175: No bracket for the reference.
L106: Consistency for v_j(t) or v_j.
L112: At least one exponent is zero
L124: For Nx \in N number of variables, ...
L128: used in data assimilation algorithms.
L165: What does this mean for "taking over the observation noises and initial ensemble"? Is the expectation computed over different observation noises?
Figure 2 and Figure 6: in both x- and y-axis only 10^{-} is shown. caption: black line indicates the order of r^2
Figure 3, 4: if spin-up time is 0, it is perhaps better to avoid 41 -> 14 in the legend.