Preprints
https://doi.org/10.5194/egusphere-2026-1834
https://doi.org/10.5194/egusphere-2026-1834
30 Apr 2026
 | 30 Apr 2026
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Fusing Satellite Embeddings to Improve Streamflow Reconstruction Across River Networks

Haomei Lin, Peirong Lin, Fenghe Zhang, Louise Slater, Yuan Yang, Ming Pan, Qiming Qin, and Aizhong Hou

Abstract. Reconstructing streamflow across river networks is increasingly challenging in the context of heavily modified land surface conditions. Here we present a Data Integration model with Satellite Embeddings (DISE), a reach-scale residual-learning framework that integrates Google Satellite Embeddings (SE; compact learned vector representations of satellite imagery) from the AlphaEarth Foundation Model with a recently developed discharge simulation (GRADES-hydroDL) by learning corrections toward gauge observations. We evaluate DISE at 41 gauging stations in the Yangtze River Basin using leave-one-station-out cross-validation, with embeddings aggregated over each reach’s contributing subcatchment. Simulations incorporating SE consistently outperform the GRADES-hydroDL baseline, with mean aggregation emerging as the most balanced strategy. Improvements are most pronounced for magnitude and bias: compared to GRADES-hydroDL, median KGE increases from 0.485 to 0.594 and median NSE from 0.301 to 0.533, while correlation gains remain modest, suggesting SE primarily help the model capture streamflow volume and variability rather than timing. Control experiments further show that SE enhance spatial generalization beyond both meteorological forcings and traditional hydro-environmental reach attributes (RiverATLAS): compared to the base configuration without spatial context, adding SE alone increases median KGE from 0.473 to 0.594; when SE are further added on top of RiverATLAS, median KGE increases from 0.497 to 0.567. Once SE are included, adding RiverATLAS can even slightly reduce performance. Embedding-driven gains weaken where streamflow is governed by processes not directly visible from surface imagery, particularly complex reservoir operations. Nevertheless, SE can still provide useful information when forcing-based corrections are limited. These results demonstrate that SE provide analysis-ready, information-rich representations of land surface heterogeneity that measurably strengthen streamflow reconstruction across river networks. DISE offers a scalable pathway to inject high-resolution Earth observation context into river-network modeling, improving predictions in basins where conventional forcings and hydro-environmental descriptors are often insufficient.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Haomei Lin, Peirong Lin, Fenghe Zhang, Louise Slater, Yuan Yang, Ming Pan, Qiming Qin, and Aizhong Hou

Status: open (until 11 Jun 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Haomei Lin, Peirong Lin, Fenghe Zhang, Louise Slater, Yuan Yang, Ming Pan, Qiming Qin, and Aizhong Hou
Haomei Lin, Peirong Lin, Fenghe Zhang, Louise Slater, Yuan Yang, Ming Pan, Qiming Qin, and Aizhong Hou
Metrics will be available soon.
Latest update: 30 Apr 2026
Download
Short summary
Accurate streamflow reconstruction is crucial for water resources assessment. Traditional models struggle with land-surface changes and human impacts. This study proposes a residual learning framework that leverages Google’s high-resolution satellite embeddings—compact Earth observation data—to improve hydrologic simulations. Our results show that this AI-driven spatial context enhances river modeling accuracy, offering a scalable solution for data-scarce regions.
Share