Effectiveness of Multivariate Bias Correction in Hydrology and Agriculture: A Systematic Review
Abstract. Climate impact assessments in hydrology and agriculture often rely heavily on outputs from Global Climate Models (GCMs). However, a fundamental scale mismatch exists between the coarse resolution of GCMs and the fine-scale, multivariate data required by impact models. While Multivariate Bias Correction (MBC) methods have emerged as a solution to restore inter-variable dependencies (e.g., the correlation between precipitation and temperature), it remains unclear whether statistical improvements in climate data translate into more accurate impact projections. This study presents a systematic review of 39 peer-reviewed articles to evaluate the added value of MBC across hydrological and agricultural domains.
Our synthesis reveals a critical "validation gap" where superior statistical performance does not consistently yield improved impact simulations. We identify a divergence in added value dictated by the characteristic response time scales of the receiving systems. Agricultural models, which are often sensitive to immediate, daily compounded extremes (e.g., heat stress during low soil moisture), demonstrate a clear benefit from MBC. In contrast, general rainfall-runoff models often function as spatiotemporal integrators, acting as low-pass filters that dampen high-frequency incoherence; consequently, simpler univariate methods frequently perform equally well for bulk streamflow simulation. Furthermore, we highlight the risks of non-stationarity, where methods calibrated to historical correlations may fail under future climate regimes. We conclude that future method development must pivot from purely statistical refinement to more process-aware, regime-dependent frameworks. The ultimate goal is to produce methods capable of addressing non-stationarity and determining when – or if – multivariate correction adds value over simpler univariate approaches.
This manuscript presents a systematic review of multivariate bias correction (MBC) methods for climate-model outputs used in hydrology and agriculture. A main contribution is the identification of a “validation gap”: methods that improve inter-variable dependence in climate data do not always improve downstream impact simulations. The review highlights bias non-stationarity, degradation of temporal autocorrelation, challenges in high-dimensional settings, and the possibility that impact-model results can be influenced by the structure of the impact model itself rather than by the climate correction method alone. The paper highly worth publishing in HESS. I just put some minor revision:
1) Please resolve all numerical inconsistencies in the review flow. The manuscript alternates between 60 vs 63 unique studies and 39 vs 40 included studies. These numbers must match across the text, Figure 2, Table 1, and Section 2.
2) In several places, claims are written too categorically. Phrases such as “clear superiority,” “overwhelmingly positive,” or “100% of relevant studies” should be softened or accompanied by exact denominators and caveats about sample size and study heterogeneity.
3) The manuscript would benefit from one compact table that separates studies by domain, validation type, number of variables corrected, and whether the comparison included a univariate benchmark. That would make the synthesis more transparent.
4) On the hydrology side, it would be helpful to distinguish more explicitly between streamflow simulation, snow/hydroclimate process simulation, and flood/drought hazard applications, since the value of MBC appears to differ across these subdomains.
5) Please check wording around method classes. Some readers may find the distinction between “marginal/dependence,” “all-in-one,” and “successive conditional” too brief; one more sentence on advantages and drawbacks of each class would improve accessibility.
6) There are a few places where the paper appears to conflate review evidence with author interpretation. For example, the “low-pass filter” explanation is good, but should be introduced as a synthesis or conceptual interpretation rather than as a directly demonstrated universal result.
7) Because this is aimed at an interdisciplinary audience, a short glossary of major acronyms such as MBCn, dOTC, MRNBC, MRQNBC, and R2D2 would improve readability, despite Table 3 already helping with this.
8) I do strongly recommend that the authors cite the recent study, “Analysis of historical global warming impacts on climatological trends for the partially gauged Hirmand River Basin based on multiple data products and bias correction methods,” as it is relevant to the manuscript’s discussion of bias-correction performance, trend preservation, and hydroclimatic applications in a data-scarce basin.