Real-time flood forecasting with Machine Learning using scarce rainfall-runoff data

Defontaine, Théo; Ricci, Sophie; Lapeyre, Corentin J.; Marchandise, Arthur; Le Pape, Etienne

doi:https://doi.org/10.5194/egusphere-2023-2621

Théo Defontaine, Sophie Ricci, Corentin J. Lapeyre, Arthur Marchandise, and Etienne Le Pape

Abstract. Flooding is the most devastating natural hazard that our society must adapt to worldwide, especially as the severity and the occurrence of flood events intensify with climate change. Several initiatives have joined efforts in monitoring and modelling river hydrodynamics, in order to provide Decision Support System services with accurate flood prediction at extended forecast lead times. This work presents how fully data-driven machine learning models predict discharge with better performance and extended lead-time, with respect to the current empirical Lag and Route model used operationally at the local flood forecasting services for the Garonne River in Toulouse. The database is composed of discharge and rainfall data, upstream of Toulouse, for 36 flood events over the past 15 years (40 k data points). This scarce data set is used to train a Linear Regression, a Gradient Boosting Regressor and a MultiLayer Perceptron in order to forecast the discharge in Toulouse at 6-hour and 8-hour lead times. We showed that the machine learning approach outperforms the empirical Lag and Route for 6-hour lead-time. It also provides a reliable solution for extended lead times and saves the implementation of a new empirical Lag and Route model. It was demonstrated that the scarcity and the heterogeneity of the data heavily weigh on the learning strategy and that the layout of the learning and validation sets should be adapted to the presence of outliers. It was also shown that the addition of rainfall data increases the predictive performance of machine learning models, especially for longer lead times. Different strategies for rainfall data preprocessing were investigated. This study concludes that, with the present test case, time-averaged rain information should be favored over instantaneous or time varying data.

Received: 06 Nov 2023 – Discussion started: 31 Jan 2024

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Country	#	Views	%
United States of America	1	330	31
China	2	117	11
Germany	3	80	7
France	4	69	6
United Kingdom	5	42	4


Total:	0
HTML:	0
PDF:	0
XML:	0

Real-time flood forecasting with Machine Learning using scarce rainfall-runoff data

Viewed

Viewed (geographical distribution)

Cited

1 citations as recorded by crossref.