Preprints
https://doi.org/10.5194/egusphere-2023-656
https://doi.org/10.5194/egusphere-2023-656
19 Apr 2023
 | 19 Apr 2023

A Random Forest approach to quality-checking automatic snow-depth sensor measurements

Giulia Blandini, Francesco Avanzi, Simone Gabellani, Denise Ponziani, Hervé Stevenin, Sara Ratto, Luca Ferraris, and Alberto Viglione

Abstract. State-of-the-art snow sensing technologies currently provide an unprecedented amount of data from both remote sensing satellites and ground sensors, but their assimilation into dynamic models is bounded to data quality, which is often low − especially in mountain, high-elevation, and unattended regions where snow is the predominant land-cover feature. To maximize the value of snow-depth measurements, we developed a Random Forest classifier to automatize the quality assurance/quality control (QA/QC) procedure of near-surface snow depth measurements collected through ultrasonic sensors, with particular reference to differentiate snow cover from grass or bare ground data and to detecting random errors (e.g., spikes). The model was trained and validated using a split-sample approach of an already manually classified dataset of 18 years of data from 43 sensors in Aosta Valley (north-western Italian Alps), and then further validated using 3 years of data from 27 stations across the rest of Italy (with no further training or tuning). The F1 score was used as scoring metric, being it the most suited to describe the performances of a model in case of a multi-class imbalanced classification problem. The model proved to be both robust and reliable in the classification of snow cover vs. grass/bare ground in Aosta Valley (F1 values above 90 %), yet less reliable in rare random-error detection, mostly due to the dataset imbalance (samples distribution: 46.46 % snow, 49.21 % grass/bare ground, 4.34 % error). No clear correlation with snow-season climatology was found in the training dataset, which further suggests robustness of our approach. The application across the rest of Italy yielded F1 scores on the order of 90 % for snow and grass/bare ground, thus confirming results from the testing region and corroborating model robustness and reliability, with again a less skillful classification of random errors (values below 5 %). This machine learning algorithm of data quality assessment will provide more reliable snow ground data, enhancing their use in snow models.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Journal article(s) based on this preprint

15 Dec 2023
A random forest approach to quality-checking automatic snow-depth sensor measurements
Giulia Blandini, Francesco Avanzi, Simone Gabellani, Denise Ponziani, Hervé Stevenin, Sara Ratto, Luca Ferraris, and Alberto Viglione
The Cryosphere, 17, 5317–5333, https://doi.org/10.5194/tc-17-5317-2023,https://doi.org/10.5194/tc-17-5317-2023, 2023
Short summary
Giulia Blandini, Francesco Avanzi, Simone Gabellani, Denise Ponziani, Hervé Stevenin, Sara Ratto, Luca Ferraris, and Alberto Viglione

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-656', Anonymous Referee #1, 13 Jun 2023
    • AC1: 'Reply on RC1', Giulia Blandini, 22 Sep 2023
  • RC2: 'Comment on egusphere-2023-656', Anonymous Referee #2, 04 Sep 2023
    • AC2: 'Reply on RC2', Giulia Blandini, 22 Sep 2023

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-656', Anonymous Referee #1, 13 Jun 2023
    • AC1: 'Reply on RC1', Giulia Blandini, 22 Sep 2023
  • RC2: 'Comment on egusphere-2023-656', Anonymous Referee #2, 04 Sep 2023
    • AC2: 'Reply on RC2', Giulia Blandini, 22 Sep 2023

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
ED: Publish subject to minor revisions (review by editor) (07 Oct 2023) by Guillaume Chambon
AR by Giulia Blandini on behalf of the Authors (18 Oct 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Publish as is (01 Nov 2023) by Guillaume Chambon
AR by Giulia Blandini on behalf of the Authors (03 Nov 2023)

Journal article(s) based on this preprint

15 Dec 2023
A random forest approach to quality-checking automatic snow-depth sensor measurements
Giulia Blandini, Francesco Avanzi, Simone Gabellani, Denise Ponziani, Hervé Stevenin, Sara Ratto, Luca Ferraris, and Alberto Viglione
The Cryosphere, 17, 5317–5333, https://doi.org/10.5194/tc-17-5317-2023,https://doi.org/10.5194/tc-17-5317-2023, 2023
Short summary
Giulia Blandini, Francesco Avanzi, Simone Gabellani, Denise Ponziani, Hervé Stevenin, Sara Ratto, Luca Ferraris, and Alberto Viglione
Giulia Blandini, Francesco Avanzi, Simone Gabellani, Denise Ponziani, Hervé Stevenin, Sara Ratto, Luca Ferraris, and Alberto Viglione

Viewed

Total article views: 594 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
434 140 20 594 13 10
  • HTML: 434
  • PDF: 140
  • XML: 20
  • Total: 594
  • BibTeX: 13
  • EndNote: 10
Views and downloads (calculated since 19 Apr 2023)
Cumulative views and downloads (calculated since 19 Apr 2023)

Viewed (geographical distribution)

Total article views: 598 (including HTML, PDF, and XML) Thereof 598 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 18 Sep 2024
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
Automatic snow depth data are a valuable source of information for hydrologists, but they also tend to be noisy. To maximize the value of these measurements for real-world applications, we developed an automatic procedure to differentiate snow cover from grass or bare ground data, as well as to detect random errors. This procedure can enhance snow ground data quality , thus providing more reliable data for snow models.