Integrating Physical-Based Xinanjiang Model and Deep Learning for Interpretable Streamflow Simulation: A Multi-Source Data Fusion Approach across Diverse Chinese Basins
Abstract. The simulation of streamflow is a complex task due to its intricate formation process. Existing single models struggle to accurately capture the stochastic, non-stationary, and nonlinear dynamics of basin streamflow in changing environments. This study integrated process-driven hydrological mechanism models with data-driven deep learning models, considering various factors like hydrology, meteorology, environment, and the interconnected effects of upstream and downstream rivers, to form an interpretable hybrid streamflow simulation model. The study collected multiple external variables to better understand the hydrologic system complexity and used the Maximum Information Coefficient (MIC) to analyze their relationship with streamflow. Subsequently, the Xinanjiang (XAJ) model with physical mechanisms was employed, alongside the TCN-GRU model integrating Temporal Convolutional Network (TCN) and Gated Recurrent Unit (GRU), for separate streamflow simulations. Furthermore, a combined method was employed, achieving nonlinear ensemble through Random Forest (RF), resulting in the hybrid XAJ-TCN-GRU model. This model shows promising results in simulating streamflow in four different basins in China, achieving high Nash-Sutcliffe Efficiency (NSE) values with 0.991, 0.971, 0.984, and 0.986 for the Wuding River, Chu River, Jianxi River, and Qingyi River respectively. In terms of streamflow simulation, flood simulating, and interval simulation, this model outperforms other benchmark models. Additionally, the study quantified the contributions of each hydro-meteorological variable to the long-term streamflow trend using mean absolute SHAP values (SHAPABS), Feature Importance (FI), and Permutation Feature Importance (PFI), thereby enhancing the model's external interpretability. The results of this study are of significant importance for optimizing water resource management and mitigating flood disasters.