Preprints
https://doi.org/10.5194/egusphere-2025-3539
https://doi.org/10.5194/egusphere-2025-3539
08 Sep 2025
 | 08 Sep 2025

Validation Strategies for Deep Learning-Based Groundwater Level Time Series Prediction Using Exogenous Meteorological Input Features

Fabienne Doll, Tanja Liesch, Maria Wetzel, Stefan Kunz, and Stefan Broda

Abstract. Due to the growing reliance on machine learning (ML) approaches for predicting groundwater levels (GWL), it is important to examine the methods used for performance estimation. A suitable performance estimation method provides the most accurate estimate of the accuracy the model would archive on completely unseen test data to provide a solid basis for model selection decisions. This paper investigates the suitability of different performance evaluation strategies, namely blocked cross-validation (bl-CV), repeated out-of-sample validation (repOOS), and out-of-sample validation (OOS), for evaluating one-dimensional convolutional neural network (1D-CNN) models for predicting groundwater level (GWL) using exogenous meteorological input data. Unlike previous comparative studies, which mainly focused on autoregressive models, this work uses a non-autoregressive approach based on exogenous meteorological input features without incorporating past groundwater levels for groundwater level prediction. A dataset of 100 GWL time series was used to evaluate the performance of the different validation methods. The study concludes that bl-CV provides the most representative performance estimates of actual model performance compared to the other two performance evaluation methods examined. The most commonly used OOS validation yielded the most uncertain performance estimate in this study. The results underscore the importance of carefully selecting a performance estimation strategy to ensure that model comparisons and adjustments are made on a reliable basis.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share

Journal article(s) based on this preprint

07 Apr 2026
Validation strategies for deep learning-based groundwater level time series prediction using exogenous meteorological input features
Fabienne Doll, Tanja Liesch, Maria Wetzel, Stefan Kunz, and Stefan Broda
Geosci. Model Dev., 19, 2657–2675, https://doi.org/10.5194/gmd-19-2657-2026,https://doi.org/10.5194/gmd-19-2657-2026, 2026
Short summary
Fabienne Doll, Tanja Liesch, Maria Wetzel, Stefan Kunz, and Stefan Broda

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-3539', Anonymous Referee #1, 27 Sep 2025
    • AC1: 'Reply on RC1', Fabienne Doll, 18 Nov 2025
  • RC2: 'Comment on egusphere-2025-3539', Anonymous Referee #2, 04 Oct 2025
    • AC2: 'Reply on RC2', Fabienne Doll, 18 Nov 2025
  • RC3: 'Comment on egusphere-2025-3539', Anonymous Referee #3, 06 Oct 2025
    • AC3: 'Reply on RC3', Fabienne Doll, 18 Nov 2025

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-3539', Anonymous Referee #1, 27 Sep 2025
    • AC1: 'Reply on RC1', Fabienne Doll, 18 Nov 2025
  • RC2: 'Comment on egusphere-2025-3539', Anonymous Referee #2, 04 Oct 2025
    • AC2: 'Reply on RC2', Fabienne Doll, 18 Nov 2025
  • RC3: 'Comment on egusphere-2025-3539', Anonymous Referee #3, 06 Oct 2025
    • AC3: 'Reply on RC3', Fabienne Doll, 18 Nov 2025

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Fabienne Doll on behalf of the Authors (16 Dec 2025)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (30 Dec 2025) by Dan Lu
RR by Anonymous Referee #3 (11 Jan 2026)
ED: Publish as is (27 Jan 2026) by Dan Lu
AR by Fabienne Doll on behalf of the Authors (10 Feb 2026)  Manuscript 

Post-review adjustments

AA – Author's adjustment | EA – Editor approval
AA by Fabienne Doll on behalf of the Authors (10 Feb 2026)   Author's adjustment   Manuscript
EA: Adjustments approved (10 Feb 2026) by Dan Lu

Journal article(s) based on this preprint

07 Apr 2026
Validation strategies for deep learning-based groundwater level time series prediction using exogenous meteorological input features
Fabienne Doll, Tanja Liesch, Maria Wetzel, Stefan Kunz, and Stefan Broda
Geosci. Model Dev., 19, 2657–2675, https://doi.org/10.5194/gmd-19-2657-2026,https://doi.org/10.5194/gmd-19-2657-2026, 2026
Short summary
Fabienne Doll, Tanja Liesch, Maria Wetzel, Stefan Kunz, and Stefan Broda
Fabienne Doll, Tanja Liesch, Maria Wetzel, Stefan Kunz, and Stefan Broda

Viewed

Total article views: 1,991 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
1,661 289 41 1,991 39 38
  • HTML: 1,661
  • PDF: 289
  • XML: 41
  • Total: 1,991
  • BibTeX: 39
  • EndNote: 38
Views and downloads (calculated since 08 Sep 2025)
Cumulative views and downloads (calculated since 08 Sep 2025)

Viewed (geographical distribution)

Total article views: 1,899 (including HTML, PDF, and XML) Thereof 1,899 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 11 Apr 2026
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
With the growing use of machine learning for groundwater level (GWL) prediction, proper performance estimation is crucial. This study compares three validation strategies—blocked cross-validation (bl-CV), repeated out-of-sample (repOOS), and out-of-sample (OOS)—for 1D-CNN models using meteorological inputs. Results show that bl-CV offers the most reliable performance estimates, while OOS is the most uncertain, highlighting the need for careful method selection.
Share