Preprints
https://doi.org/10.5194/egusphere-2026-1527
https://doi.org/10.5194/egusphere-2026-1527
24 Apr 2026
 | 24 Apr 2026
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

A Basin-Aware Global Framework for Computationally Efficient Surface Water Inundation Prediction

Arik M. Tashie, Isaac D. Gerg, Evan Koester, Carlos D. Hoyos, Eduardo Galindo, and David J. Farnham

Abstract. Predicting surface water inundation at regional to global scales presents a fundamental tension: bespoke local models achieve high accuracy but require proprietary data and are difficult to scale, while globally trained systems offer broad coverage but demand substantial computational infrastructure and may lack flexibility for regional customization. We present the Basin-Aware Global Inundation Modeling framework (BAGIM), which addresses this gap by combining globally available, freely accessible datasets with basin-scale calibration to capture regional hydrological specificity. We evaluate six model architectures across eight geographically diverse basins to test three hypotheses: (1) that hydrologically meaningful feature engineering is more impactful than architectural complexity, (2) that basin-scale training mitigates regional biases in global datasets, and (3) that basin-aware models can generalize to extreme events beyond the training distribution. Our experiments demonstrate that tree-based ensembles (XGBoost, Random Forest) consistently outperform more complex deep learning architectures, achieving median F1 scores of approximately 0.5 against OPERA DSWx-S1 reference data, performance that approaches the inherent uncertainty ceiling imposed by disagreement among remote sensing products themselves in settings with small, shallow, and intermittent water bodies. We find that features commonly assumed essential for operational flood forecasting (i.e., coincident river-basin streamflow, Height Above Nearest Drainage, and elevation) are neither sufficient nor strictly necessary for reliable prediction, with well-engineered meteorological and terrain features achieving comparable performance without explicit streamflow inputs. This challenges a core assumption underlying many current operational flood forecasting systems. Cross-basin transfer experiments reveal limited transferability, reinforcing the importance of basin-aware calibration. Further, models trained exclusively on non-extreme events produce directionally correct predictions for out-of-sample extremes, though with conservative bias (higher precision, lower recall). We suggest that a design philosophy prioritizing feature engineering and regional calibration over architectural complexity enables accessible deployment without sacrificing predictive skill.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Arik M. Tashie, Isaac D. Gerg, Evan Koester, Carlos D. Hoyos, Eduardo Galindo, and David J. Farnham

Status: open (until 05 Jun 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Arik M. Tashie, Isaac D. Gerg, Evan Koester, Carlos D. Hoyos, Eduardo Galindo, and David J. Farnham
Arik M. Tashie, Isaac D. Gerg, Evan Koester, Carlos D. Hoyos, Eduardo Galindo, and David J. Farnham

Viewed

Total article views: 71 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
56 12 3 71 1 2
  • HTML: 56
  • PDF: 12
  • XML: 3
  • Total: 71
  • BibTeX: 1
  • EndNote: 2
Views and downloads (calculated since 24 Apr 2026)
Cumulative views and downloads (calculated since 24 Apr 2026)
Latest update: 25 Apr 2026
Download
Short summary
Surface water extent reflects interactions of meteorology, terrain, and land cover that vary regionally. We present a basin-aware framework using open global data to predict inundation at 30-m resolution. Evaluating six architectures across eight diverse basins, we find that hydrologically meaningful feature engineering outweighs model complexity, tree-based ensembles match or exceed deep learning without GPU infrastructure, and basin-scale calibration mitigates regional biases in global data.
Share