the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Predictive Performances of Machine Learning– and Deep Learning–Based Univariate and Multivariate Reservoir Inflow Predictions in the Chao Phraya River Basin
Abstract. This study demonstrated the predictability of Machine Learning (ML)– and Deep Learning (DL)–based univariate and multivariate predictions of reservoir inflows of Bhumibol (BB) and Sirikit (SK), two major dams in the Chao Phraya River Basin. XGBoost, tree–based ensemble–, and LSTM, deep neural network–based algorithms were selected for development of daily and monthly prediction models. For univariate prediction, the inflows of the BB and SK dams were predicted separately using two individual models. In contrast, for multivariate prediction, a single model was developed to simultaneously predict the inflows of both the BB and SK dams facilitating the integrated decision–making processes. Across all prediction scenarios, ML– and DL–based models demonstrated superior performances in predicting daily reservoir inflows for BB and SK dams compared to monthly predictions, achieving NSE values of 0.86 and 0.77, respectively. Since modeling with LSTM algorithm can effectively handle larger datasets, this enables single multivariate prediction model to predict closer results to those individual univariate models performed by XGBoost and LSTM for BB and SK prediction. XGBoost models mostly outperformed LSTM when tested on the datasets for both daily and monthly univariate predictions. Among all prediction scenarios, underprediction of low reservoir inflows and overprediction of high reservoir inflows by both univariate and multivariate models were consistently existed. Therefore, extracting specific and informative insights from the results of each model type, forecasting horizon, and algorithms used can significantly enhance decision–making support for both real–time reservoir operation and long–term reservoir management planning.
- Preprint
(1157 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 15 Apr 2025)
-
RC1: 'Comment on egusphere-2025-16', Anonymous Referee #1, 24 Mar 2025
reply
This manuscript explores the application of two widely known data-driven algorithms—XGBoost and LSTM—in both univariate and multivariate modes for daily and monthly inflow predictions at two key reservoirs in the Chao Phraya River Basin. The topic is timely and relevant in the context of AI-driven hydrological forecasting. However, the manuscript, in its current form, fails to meet the scientific standards and novelty threshold expected by Hydrology and Earth System Sciences. The work is largely confirmatory, methodologically simplistic, and lacks both theoretical depth and critical interpretation. It represents an incremental application of well-established techniques without significant advancement in methodology, theory, or hydrological insight. Below are my detailed comments:
1. Despite the claim of contributing to reservoir inflow forecasting through multivariate models, the study does not introduce any methodological innovation. The application of XGBoost and LSTM, both extensively used in hydrology, adds no novelty unless combined with a new model architecture, uncertainty treatment, explainability component, or integration with process-based models. The experimental setting is rudimentary, and the results primarily confirm what has already been established in dozens of prior studies. Moreover, the assertion that multivariate prediction of inflows has rarely been studied is not substantiated and contradicts recent literature. The references cited are selective and outdated, omitting more advanced hybrid or physics-informed ML approaches currently under development in the hydrological community.
2. The manuscript fails to clearly define its scientific objectives or hypotheses. The rationale behind comparing univariate and multivariate approaches is weakly stated and not embedded in a theoretical or operational framework. The problem formulation is generic and reads more like a technical report than a scientific investigation.
3. The literature review is overly descriptive and lacks synthesis. It resembles an annotated bibliography rather than a critical narrative. Foundational works on multivariate time series modeling, ensemble learning, recent benchmarks on hybrid models, and the emerging field of physics-informed ML in hydrology are all missing. Furthermore, no discussion is provided on model explainability, uncertainty quantification, or generalization capacity, all of which are central themes in the current hydrological ML research agenda.
4. The methodology exhibits some critical flaws:
- No hyperparameter optimization strategy is described beyond brute-force listing of combinations.
- Feature selection is based solely on Pearson correlation, ignoring non-linear dependencies or mutual information approaches.
- The study does not address overfitting or generalization. Despite LSTM being known for susceptibility to overfitting, no regularization, dropout, or model selection strategy is employed.
- No benchmark model is used for reference, which is standard in HESS-level contributions.
5. The manuscript presents no discussion of data quality, treatment of missing values, stationarity, or outlier detection.
6. The results section is overly descriptive, listing metrics without proper analysis or critical discussion. Additionally, the model performances reported are relatively modest, especially for monthly inflow prediction, yet are uncritically presented as acceptable.
7. The discussion does not provide new hydrological or methodological insight. There is no exploration of why certain models perform better under given conditions, nor any effort to relate findings to hydrological processes. The difference in performance between the two dams, for instance, is acknowledged but not explained.
8. The implications for operational decision-making—often emphasized in the introduction—are not convincingly revisited.
9. The conclusions are largely a restatement of the results, without any critical reflection or forward-looking perspective. The authors do not acknowledge the substantial limitations of their study—particularly the lack of generalization, interpretability, and robustness of the models.
10. The manuscript suffers from structural repetition and verbosity. Some figures (e.g., radar plots) are poorly designed and do not enhance interpretability.
Citation: https://doi.org/10.5194/egusphere-2025-16-RC1
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
129 | 26 | 7 | 162 | 4 | 5 |
- HTML: 129
- PDF: 26
- XML: 7
- Total: 162
- BibTeX: 4
- EndNote: 5
Viewed (geographical distribution)
Country | # | Views | % |
---|---|---|---|
United States of America | 1 | 26 | 18 |
Thailand | 2 | 20 | 14 |
China | 3 | 17 | 12 |
undefined | 4 | 7 | 5 |
Japan | 5 | 7 | 5 |
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
- 26