the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A Novel Classifier-Guided Ensemble Framework for Global Terrestrial Evapotranspiration Estimates
Abstract. Evapotranspiration (ET) is a key hydrological and meteorological variable, serving as the critical nexus between water and energy exchanges. However, accurate estimation of global ET remains a challenging task, as process-based ET algorithms are often inadequate to capture the nonlinear relationship among environmental factors, and the application of data-driven ET algorithms is hindered by sparse and uncertain ET observations. In this study, we developed a novel ensemble framework that integrates three existing ET models (process-based algorithm, machine learning-based ET model, and hybrid model), aiming to provide high-precision terrestrial ET estimates. The framework is guided by an additional classifier that can achieve dynamic per-pixel model selection, thus fully utilizing the spatiotemporal dynamics of each model's distinct advantages in mapping global ET and avoiding the typical underestimation of high values by ensemble methods. Comprehensive validation of the model was carried out using in-situ ET observations from the FLUXNET2015 dataset, catchment-scale water balance ET dataset, and six global-scale ET products, including comparisons to individual base models and another Attention-Based ensemble model. The quantitative comparisons across statistical metrics (RMSE, MAE, R2, KGE) indicate that our ensemble model outperforms other evaluated models, especially in extreme samples. Meanwhile, the introduction of classifier can not only significantly enhance the algorithmic robustness and generalizability, but also allow us to gain a basic understanding of the mechanisms behind model selection by interpretability analysis. The study demonstrated the effectiveness of the proposed framework in enhancing ET estimation robustness, thereby providing a valuable reference for the estimation of other similar variables.
- Preprint
(4324 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2025-4782', Anonymous Referee #1, 12 Jan 2026
- AC1: 'Reply on RC1', Weiguang Wang, 12 Feb 2026
-
RC2: 'Comment on egusphere-2025-4782', Anonymous Referee #2, 16 Jan 2026
This study uses a Classifier-Guided Ensemble model to integrate process-based algorithm, ML-based model, and Hybrid model to identify the best model for a given case and subsequently estimate evapotranspiration (ET). The results demonstrate that the selected dominant model outperforms the individual constituent models. And the contribution of input parameters is also analyzed, which identifies the key drivers across all 3 model types. Overall, this study offers a tool to select the better model for a specific location among different models. Here, I have some comments and suggestions for the authors to consider in improving the manuscript, as well as some questions:
- Line 142: The manuscript mentioned “ET as the sum of daytime and nighttime components”, but the calculation methods for these two components are not clearly described. Please provide more details on how daytime and nighttime ET are estimated and combined.
- Lines 163-164: How did you extract the relatively reliable data from global ET products?
- Lines 168-169: The manuscript mentioned “data from other pixels were excluded to reduce uncertainty”, what is the meaning of other pixels? Do you mean the pixels from other finer products but within the same pixel already settled?
- Lines 171-173: You get the dominant model for both time and spatial scales. If for the same spatial, the dominant model varies frequently with time, what is your strategy for forecasting? Do you select the dominant model for a specific location with more better-results?
- Line 233: The variables from ERA5-land product were used for training. Were the input variables for the other five models derived from the same data source? If not, were inter-dataset comparisons conducted? If differences among variables are substantial, the training process may introduce model bias.
- Figure 5: The colors of Classifier-Guided and Hybrid model are very similar in the axis plots. Using more distinct colors would improve the clarity and comparability of model performance.
- Lines 557-558: The sentence appears to be incomplete and should be revised for clarity.
Citation: https://doi.org/10.5194/egusphere-2025-4782-RC2 -
AC2: 'Reply on RC2', Weiguang Wang, 12 Feb 2026
We thank the reviewer for taking the time to review our work and for providing valuable comments to improve the manuscript.
Please refer to the attached PDF for point-by-point responses to comments.
-
RC3: 'Reply on AC2', Anonymous Referee #2, 12 Feb 2026
Thank you for your detailed revisions and responses to my previous comments. I appreciate the effort you have put into improving the manuscript, and overall, I am satisfied with the revisions.
I only have one minor remaining question. For forecasting, there won't be any available measurements to determine the dominant model among the forecasting results from different models. Under this condition, your model should only rely on the previously training data to do the forecast. From your response, you mentioned that a dynamic strategy for choosing the dominant model for a specific location and time will be used for forecasting. I believe you used the strategy for validation, but I am not clear about the strategy. Has this strategy been described in the your manuscript?
Citation: https://doi.org/10.5194/egusphere-2025-4782-RC3 - AC3: 'Reply on RC3', Weiguang Wang, 14 Feb 2026
-
RC3: 'Reply on AC2', Anonymous Referee #2, 12 Feb 2026
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 353 | 181 | 37 | 571 | 130 | 240 |
- HTML: 353
- PDF: 181
- XML: 37
- Total: 571
- BibTeX: 130
- EndNote: 240
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Overall, I was very impressed with the written manuscript and the scientific rigor of the analysis. The introduction and methods clearly describe the complex ML, ensemble, and processed based ET estimates as well as training and validation analysis. The results showed high accuracy, and the figures were well developed and easily visualized. I have minor comments below:
Line 100: Precipitation was not used as an input covariate. Please provide an explanation for why this was not included.
Line 110: You do a great job outlining the models available with Autogluon. In line 110, you end the list with “etc.” – leading me to believe that there are even more algorithms available. In line 111, can you please list the specific ML algorithms you used?
Line 110: You say “Autogluon can combine them” – can you provide more specifics on this (perhaps just changing the phrasing) – did Autogluon combine them in your research OR autogluon can combine them – but you did not in your research. I think simply stating “Autogluon combined all the algorithms mentioned above” would suffice.
Figure 1: This is a great workflow figure – and helps visualize the process.
Line 164: You say you used 6 well known ET products – can you please cite them after the sentence in line 164? Or perhaps say “refer to section 3.3”
Figure 5: Can you include the sites and land cover types of the three lowest RMSE in Line 350 paragraph. It would provide additional detail than “the majority of land covers”. I think even reference table 3 in the paragraph of Line 350 would be helpful.
Table 2: This seems repetitive and unnecessary. It is unclear how it is different than Figure 4.
The overall importance of the paper is not clear. Are the global ET estimates available for public use – if so, is a link available to the dataset and what is the spatial and temporal resolution? If the estimates are available – I recommend making it clear and provide specific details and explanations about the use of the data.
On the other hand, is the paper simply a methodologic paper meant to explain a novel method for estimating global ET –although you explain the research, it would be hard for another researcher to reproduce for local or global ET estimates. If so, I recommend making it clear that the data are meant to remain proprietary, but the manuscript simply provides novel methodology.