Preprints
https://doi.org/10.5194/egusphere-2023-1872
https://doi.org/10.5194/egusphere-2023-1872
31 Aug 2023
 | 31 Aug 2023
Status: this preprint is open for discussion.

Potential of Machine learning techniques compared to MIKE-SHE model for drain flow predictions in tile-drained agricultural areas of Denmark

Hafsa Mahmood, Ty P. A. Ferré, Raphael J. M. Schneider, Simon Stisen, Rasmus R. Frederiksen, and Anders V. Christiansen

Abstract. Temporal drain flow dynamics and understanding of their underlying controlling factors are important for water resource management in tile-drained agricultural areas. The use of physics-based water flow models to understand tile drained systems is common. These models are complex, with large parameter sets and require high computational effort. The primary goal of this study was to examine whether simpler, more efficient machine learning (ML) models can provide acceptable solutions.

The specific aim of our study was to assess the potential of ML tools for predicting drain flow time series in multiple catchments subject to a range of climatic and landscape conditions. The investigation is based on unique data containing time series of daily drain flow in multiple field scale drain sites in Denmark. The data include: climate (precipitation, potential evapotranspiration, temperature); geological properties (clay fraction, first sand layer thickness, first clay layer thickness); and topographical indexes (curvature, Topographical wetness indexes, Topographical position index, elevation). Both static and dynamic variables are used in the prediction of drain flows. The ML algorithm extreme gradient boosting (XGBoost) and convolutional neural network (CNN) were examined, and the results were compared with a physics-based distributed model (MIKE-SHE).

The results show that XGBoost performs similarly to the physics-based MIKE-SHE models, and both outperform CNN. Both ML models required significantly less effort to build, train, and run than MIKE-SHE. In addition, the ML models support efficient feature importance analysis. This showed that climatic variables were important for CNN models and XGBoost. The results support the use of ML models for hydrologic applications with sufficient data for training. Further, the insights offered by the feature importance analysis may support further data collection and developments of physics-based models when existing data are insufficient to support ML approaches.

Hafsa Mahmood et al.

Status: open (until 07 Nov 2023)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse

Hafsa Mahmood et al.

Hafsa Mahmood et al.

Viewed

Total article views: 195 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
141 47 7 195 5 5
  • HTML: 141
  • PDF: 47
  • XML: 7
  • Total: 195
  • BibTeX: 5
  • EndNote: 5
Views and downloads (calculated since 31 Aug 2023)
Cumulative views and downloads (calculated since 31 Aug 2023)

Viewed (geographical distribution)

Total article views: 175 (including HTML, PDF, and XML) Thereof 175 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 03 Oct 2023
Download
Short summary
Temporal drain flow dynamics and understanding of their underlying controlling factors are important for water resource management in tile-drained agricultural areas. This study examine whether simpler, more efficient machine learning (ML) models can provide acceptable solutions compared to traditional physics based models. We predicted drain flow time series in multiple catchments subject to a range of climatic and landscape conditions.