A hybrid model based on Boruta feature selection and neural network for forecasting hydrological drought
Abstract. Accurate hydrological drought prediction is vital for water management. This study proposes a hybrid model combining Boruta feature selection, convolutional neural network (CNN), and bidirectional long short-term memory (BiLSTM) methods, to predict hydrological drought in the Huaihe River Basin of China. The Boruta algorithm selected key predictors from 31 potential drought-influencing factors. By comparing the established model Boruta-CNN-BiLSTM with other models, including Boruta-CNN-LSTM, Boruta-CNN-XGBoost, Boruta-BiLSTM, Boruta-LSTM, and Boruta-XGBoost, the results show Boruta significantly enhances all models. The Boruta-CNN-BiLSTM model has achieved the highest accuracy across 28 basin grid regions, exhibiting the largest performance gains. Furthermore, the prediction performance of the model is mainly influenced by factors such as precipitation, volumetric soil water (0–7 cm), volumetric soil water (7–28 cm) and surface net solar radiation. The model's prediction performance is most affected by precipitation, followed by volumetric soil water (0–7 cm), volumetric soil water (7–28 cm), and surface net solar radiation has the least impact. It provides enhanced support for basin-scale drought risk assessment and water resources management.
The manuscript presents a relevant and well-structured contribution to hydrological drought prediction by proposing a hybrid Boruta–CNN–BiLSTM framework. The integration of feature selection with deep learning is timely and aligns well with current research trends in hydroinformatics and data-driven modeling.
One of the main strengths of the study is the systematic combination of feature selection (Boruta) and hybrid deep learning architectures, which addresses a common limitation in drought prediction models, the presence of redundant or irrelevant predictors. The use of 31 potential predictors and their reduction through Boruta provides a clear methodological advantage and improves model interpretability
Overall, the manuscript is methodologically sound, clearly organized, and relevant for both scientific and applied drought prediction contexts. I will put some minor comments:
The manuscript is strong and suitable for publication after minor revisions. The suggested comments mainly aim to improve clarity, positioning, and broader impact rather than requiring major methodological changes.