Iterative run-time bias corrections in an atmospheric GCM (LMDZ v6.3)
Abstract. Run-time bias corrections of atmospheric circulation models can be based on nudging (newtonian relaxation) to an atmospheric reanalysis. In this case, the time increments of selected state variables are modified by adding the nudging terms obtained with an uncorrected version the model. This is a well-known method to improve the models’ representation of large-scale circulation patterns. In this work, we propose and evaluate a variant of this method, consisting of iterative nudging: the corrected model is itself nudged towards the reanalysis, and the resulting nudging terms are added to the initial ones to calculate the new, iterated correction terms. This procedure can be iterated an arbitrary number of times. Evaluating the LMDZ atmospheric general circulation model (AGCM) for a varying number of iterations of nudging to the ERA5 reanalysis for the period 1981–2000, we show that the simulated large-scale circulation patterns over the period 2001–2020 are consistently improved when the bias correction procedure is iterated compared to the non-iterated original procedure. However, while there is a clear benefit of one or two iterations of the bias correction method, signs of over-correction appear after about three iterations.
This paper considers a class of runtime model corrections that seek to directly improve the climatological annual cycle of selected prognostic variables in an atmospheric climate model by applying cyclostationary corrections to their governing equations. Such state-independent Empirical Runtime Bias Corrections (ERBCs) are distinct from state-dependent runtime model corrections, which aim to improve aspects of the model formulation itself (e.g. Watt-Meyer et al., 2021) rather than the model’s basic state.
The paper focuses specifically on the derivation of the cyclostationary ERBC correction terms. At present, two approaches exist for deriving these bias corrections. The first is the “classical” nudging-based runtime bias correction method, initially derived by Guldberg et al. (2005). The second, referred to as Climatological Adaptive Bias Correction (CABCOR), was introduced more recently by Scinocca and Kharin (2024) and has been shown in direct comparisons to produce significantly larger bias reductions than the classical approach.
The present study investigates whether an iterative application of the classical approach “can lead to a more perfect bias reduction”. In addressing this question, the authors examine the relative roles of in-sample and out-of-sample validation in determining the number of iterations applied to the classical method. The study makes an interesting contribution to the ERBC literature. My recommendation is major revision primarily due to the need to more clearly relate the study’s main conclusions to the existing literature on both the classical and CABCOR approaches. My detailed comments follow.
John Scinocca
Major Points
1. Dependence of results on value of tau in the classical method
In Scinocca and Kharin (2024), hereafter referred to as SK24, a systematic analysis was presented in Section 3 of how the properties of the nudging applied in the N_0 calibration simulations, most notably the value of the nudging timescale tau (and, more generally, the spatial filtering of the nudging tendencies), affect the amount of bias reduction achieved in the corresponding ERBC model simulations (i.e. C_0). The results of that analysis were summarized in Fig. 1 of SK24. For convenience, I reproduce that figure here. Panel a shows the global bias reduction of the prognostic fields corrected in the nudging runs (R-ADAPT in that study or N_0 here), while panel b shows the associated global bias reduction in the ERBC model simulations (R-ERBC in that study or C_0 here) as a function of tau and the spatial filtering applied to the nudging tendencies. As the present study does not employ spatial filtering, the relevant information corresponds to the information along a horizontal line at the top of this figure (i.e. grid-point nudging).
In the absence of spatial filtering, SK24 identified an optimal nudging timescale of approximately tau = 3d (for the N_0 runs) that yields the smallest biases in the corrected ERBC model simulations (C_0). As discussed in the final three paragraphs of Section 3 of SK24, this optimal value lies between weaker nudging, which limits the effectiveness of the bias correction, and stronger nudging, which introduces artifacts associated with the action of unbalanced motions. There, SK24 argued that, for the classical approach, a separation between balanced and unbalanced dynamics cannot be achieved in principle, and can only be approximated in practice through appropriate choices of the temporal (and spatial) properties of the relaxation applied in the nudging runs.
While the specific optimal value of tau will be model dependent, the analysis presented in SK24 suggests that there may be scope to further reduce biases in the C_0 correction runs of the present study through an adjustment of tau toward longer timescales. Given that the authors explicitly consider both the efficiency of the process (e.g. Section 4.5) and the pursuit of “more perfect” bias reductions, it would therefore seem important to identify the optimal value of tau for their configuration, denoted here as tau_opt. This could likely be accomplished with a relatively small number of additional experiments and would provide a useful baseline against which the performance of the iterative approach could be assessed.
Consideration of the role of tau_opt in this study raises several important questions:
a) Does the reduction of biases achieved using the iterative approach with tau = 1 d exceed the reduction obtained in a C_0 simulation using tau_opt?
b) If tau_opt were used instead of tau=1d within the iterative framework, would the approach still yield a meaningful additional improvement? The authors show in Fig. 1 that the effective strength of the nudging decreases with each successive calibration simulation N_i. Reducing tau from 1 d to the optimal value (approximately 3 d) similarly weakens the nudging. This raises the possibility that the iterative approach may, at least in part, represent a more computationally expensive means of approaching the same optimal bias reduction that could be achieved more directly through an appropriate choice of tau in the classical method.
Clarifying these issues would seem essential for assessing the practical utility of the iterative approach proposed in this study.
2. Out-of-sample validation.
An important aspect of the present study is the evaluation of the effectiveness of the ERBC model outside the period used to derive the bias correction. The authors have used this approach to identify the point at which bias reduction in the out-of-sample period ceases and iterations should be stopped – even if the correction continues to reduce biases when validated during the calibration period (e.g. ll 213-216).
If the whole system were stationary, then this out-of-sample validation approach would be appropriate. However, the period 1981-2020 contains some of the strongest historical climate-change forcings. So it is not strictly stationary. For example, in Krinner et al. (2020), idealized experiments were performed to evaluate the efficacy of the ERBC approach under time-evolving climate change forcings. There, it was found that the bias reduction achieved through ERBC evolved with time (e.g. Fig. 1 of that study). In principle, the out-of-sample validation period (2021-2020) has more of a climate-change signal than the calibration period (1981-2000). It is expected, therefore, that there will be some degradation of the impact of the ERBC during the validation period due to the change in external forcings. While the climate-change signal might be small during this period, so too is the degradation of the bias reduction induced by the ERBC after 2-3 iterations.
This is a central aspect of the study’s method. It is not clear how this can be evaluated within the study’s experimental setup. At minimum, the authors should to include a discussion on this point and place caveats on their conclusions about terminating the number of iterations.
3. The relationship of ERBC and Model tuning.
The final step in the development of all climate models is a tuning exercise in which all free model physics parameters are assigned values based on a process which generally minimizes model biases during the historical period. ERBC is typically applied to a finalized model (what is called the free model in this study). Different finalized models have different inherent annual-cycle climatological biases. In this sense, the model physics was tuned to operate optimally in the presence of such biases. When ERBC is applied to the model, say on u, v, and T, it reduces the biases in the finalized model by construction. In principle, this will push the behaviour of physical parameterizations out of optimal performance. In practice, the extent to which this is problematic depends on the magnitude of biases in the finalized model, their overlap with specific physical processes of interest, and the ability of the ERBC to significantly reduce those biases.
There is a large cold bias throughout the troposphere of free (finalized) LMDZ model (Fig. 5), which parameterizations such as convection have been tuned to compensate for. Significantly reducing this bias by applying ERBC to T would throw such compensation out of balance. It is not unexpected then, that the convection in runs of LMDZ with T correction would alter convective activity in a negative way as indicated by the authors (l 177). Other models with less of a temperature bias in their finalized models might not suffer the same degradation in convective behaviour and so benefit from T runtime bias correction.
This model tuning argument would seem to be the better explanation for why some models benefit from T runtime correction while others do not. The authors, however, attempt to argue that there is a conceptual reason for limiting the runtime bias corrections to the dynamics (winds) to avoid interaction with the model physics. Model physics formulations are often developed and validated offline against observed inputs of winds, temperature, and specific humidity. The performance of the parameterizations should not inherently suffer when biases in such inputs are reduced. They suffer because the values of their free parameters were set in the presence of model biases in the finalized model configuration. The authors should offer up this explanation in Section 4.1 and where appropriate throughout the paper.
4. Relevance to SK24
ll 280–281. “While tests of the ‘direct compensation’ method recently proposed by Scinocca and Kharin (2024) yielded unsatisfying results with the LMDZ AGCM, …”. This statement is problematic.
The CABCOR method itself is not model dependent, in the same sense that the classical nudging-based method is not model dependent. Irrespective of the model, they both reduce annual-cycle climatological biases by construction. In SK24, a systematic analysis demonstrated that, for a fixed climate model version, the CABCOR approach consistently produced significantly larger bias reductions in the ERBC model than the classical nudging approach, including in cases where the classical approach employed optimal relaxation parameters. This was shown to be a robust methodological result rather than a model-specific outcome.
Stating that the CABCOR approach yielded “unsatisfying results” relative to the classical approach for a single LMDZ model version, therefore, appears to be at odds with the primary conclusions of SK24. If the authors would like to refute the results of SK24 by asserting that CABCOR performs poorly relative to the classical method, this claim must be substantiated and explained in much greater detail. Alternatively, if the authors do not intend to refute the results of SK24, they should clearly clarify what is meant by “unsatisfying results,” how this assessment was made, and why it is not inconsistent with the findings reported in that study.
Minor Points