Preprints
https://doi.org/10.5194/egusphere-2026-277
https://doi.org/10.5194/egusphere-2026-277
07 Apr 2026
 | 07 Apr 2026
Status: this preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

UK-Flow15-QC: A quality control framework for better river flow data in hydrological research

Felipe Fileni, Hayley J. Fowler, Elizabeth Lewis, Fiona McLay, Gemma Coxon, David Archer, Emma Bruce, Longzhi Yang, Matt Fry, Hollie Cooper, and Ollie Swain

Abstract. The significant increase in computing power over the past 70 years has progressively enabled the use of extensive datasets for hydrological modelling. The colossal scale of these datasets, i.e., over one million timesteps per station for a 30-year record at 15-min resolution, makes implementing effective quality control (QC) particularly challenging. In this study, we present a national-scale, open-source quality-control framework tailored for the UK’s 15-minute river flow dataset, UK-Flow15, which is described in Part 1 of this paper series. The framework combines manual visual inspection of anomalies with automated detection of statistical artefacts, incorporating both established and novel procedures. In particular, we introduce methods to evaluate high-flow events by comparing them with rainfall records and flow observations from neighbouring catchments. Application of the framework within a UK dataset reveals that while many stations maintain generally reliable records, over 20 % exhibit visually identifiable issues such as truncations, discontinuities, or missing data. Automated checks indicate that most (78 %) stations contain at least isolated segments of suspicious behaviour. Our high-flow event validation procedures confirm most peak flows, but also flag a small proportion of events as potentially spurious due to a lack of consistency with nearby flow (10.5 %) or rainfall (14.5 %) support. We further demonstrate that data quality has a measurable impact on hydrological modelling, with catchments containing flagged anomalies producing the least reliable simulations in terms of NSE and High-Flow Bias. By making flagged data and metadata openly accessible, the framework enables users to make informed decisions about data suitability. This work highlights the critical importance of rigorous QC in sub-daily hydrology and provides a scalable tool to support the development of more reliable, high-resolution hydrological data.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Felipe Fileni, Hayley J. Fowler, Elizabeth Lewis, Fiona McLay, Gemma Coxon, David Archer, Emma Bruce, Longzhi Yang, Matt Fry, Hollie Cooper, and Ollie Swain

Status: open (until 19 May 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Felipe Fileni, Hayley J. Fowler, Elizabeth Lewis, Fiona McLay, Gemma Coxon, David Archer, Emma Bruce, Longzhi Yang, Matt Fry, Hollie Cooper, and Ollie Swain

Model code and software

UK-Flow15-QC F. Fileni https://github.com/felipef93/UK-Flow15-QC

Felipe Fileni, Hayley J. Fowler, Elizabeth Lewis, Fiona McLay, Gemma Coxon, David Archer, Emma Bruce, Longzhi Yang, Matt Fry, Hollie Cooper, and Ollie Swain
Metrics will be available soon.
Latest update: 07 Apr 2026
Download
Short summary
This study develops and applies a national framework to identify and flag issues in UK river flow data by combining systematic visual checks with automated tests. We demonstrate the importance of visual inspection, refine and sensitivity-test traditional quality-control methods, and develop new high-flow checks tailored to high-resolution data. We then quantify the frequency of these issues and demonstrate that, if overlooked, they can impact scientific results and lead to misleading conclusions
Share