Imagery classification of stream stage to support ephemeral stream monitoring

Ogle, Sarah E.; McGurk, Garrett; Jensen, Anahita; Ralph, Fred Martin; Levy, Morgan C.

doi:10.5194/egusphere-2025-2297

Preprints

https://doi.org/10.5194/egusphere-2025-2297

Preprints

20 Aug 2025

| 20 Aug 2025

Imagery classification of stream stage to support ephemeral stream monitoring

Sarah E. Ogle, Garrett McGurk, Anahita Jensen, Fred Martin Ralph, and Morgan C. Levy

Abstract. Intermittent rivers and ephemeral streams (IRES) constitute a large fraction of global river networks, provide important ecosystem services, and are increasing in number with climate change. Yet, observing stage and calculating discharge in IRES can be technologically and methodologically challenging. To address this problem, we develop a method to classify relative stage categories from field camera imagery, creating a time series of categorical flow states without the need for direct stage measurements. Specifically, we employ a Logistic Regression model to classify conditions of no water, low water levels, or high water levels for an ephemeral stream located in the upper Russian River watershed of California (U.S.). We trained our algorithm using hourly field camera images from 2017–2023, and validated the image classifications with 15-minute continuous stage observations. We then used image classifications to perform quality control on the continuous stage time series. Next, we compared the image classifications to publicly accessible modeled discharge from the NOAA National Water Model CONUS Retrospective Dataset. We discuss how in-situ monitoring including field cameras and the classification of field camera imagery, combined with surface meteorology and soil moisture observations, provides detailed hydrologic information important for understanding how climate change affects IRES. Because the image classification approach is transferable to other ephemeral stream sites equipped only with field cameras, this methodology provides a low-cost option for observing relative stage on sparsely-measured IRES that can augment existing hydrologic modeling used by water managers.

Received: 16 May 2025 – Discussion started: 20 Aug 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 29626 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (29626 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

06 Feb 2026

Image-based classification of stream stage to support ephemeral stream monitoring

Sarah E. Ogle, Garrett McGurk, Anahita Jensen, Fred Martin Ralph, and Morgan C. Levy

Hydrol. Earth Syst. Sci., 30, 709–742, https://doi.org/10.5194/hess-30-709-2026,https://doi.org/10.5194/hess-30-709-2026, 2026

Short summary

Sarah E. Ogle, Garrett McGurk, Anahita Jensen, Fred Martin Ralph, and Morgan C. Levy

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2025-2297', Anonymous Referee #1, 25 Aug 2025
I find this article to be generally well-written, well-structured, and applies a transferrable methodology to classify stream conditions in ephemeral streams in a single study watershed. The authors support their claims and provide adequate figures to support their argument.
I did find that some of the discussion sections strayed beyond the scope of the study described in the introduction section to discuss other features of the watershed and ephemeral streams more broadly. The paper would be strengthened by focusing on its central contribution.
I did find that a limitation of the study was that it focused on a subset of images from a single site. The methodology was demonstrated and its performance evaluated against predictions from the National Water Model, but statements about its transferability to other locations or are undercut by the limited nature of the data.
I have minor comments regarding clarity and a few considerations not in the original text but overall find the article a suitable contribution to HESS:
Page 7, line 155: What defines “environmental damage”? Tampering? Batteries dying?

Page 9, Lines 173-180: The National Water Model (NWM) is trained/calibrated to gage flows, how close is the closest calibration site? In figure A1 looks like it is on the East Fork of Russian River, so not on the stream you are monitoring. Worth pointing out in this section.

Figure 5: You need axis labels indicating which axis is predicted and which is observed.

Page 9, line 197: Indeed, cropping vegetation may be helpful here – if there is a mediterranean climate, vegetation dynamics and streamflow ephemerality are both highly seasonal the model could learn more from the banks (which could make up more of the image) than the channel where intended.

Page 10, line 204: You only labelled 12.8% of the total images you had available – this is acceptable but is a relatively small dataset for training or reporting performance (your testing set is 3.9% of your total image dataset) that will represent the population. This is a limitation of the study, since as you note the lighting can be very different at different times of day/year. Ideally you have a big enough testing set to represent performance at each class during different lighting (and vegetation and channel) conditions.

Page 10, line 203-210: Random sampling was used in the selection of images for training/testing, which is acceptable, but this means the performance is only representative of historical conditions coincident with the label dataset. The performance reported in this paper is not representative of model prediction on new unseen imagery. This point is worth noting to make sure the reader knows what the model performance represents.

Page 12, lines 247-250: Why were these manual weights selected?

Page 14, line 298: Is there a reference for the 0.028 m³ s^-1threshold for NWM flow? How sensitive are your results to this selection? The selection of the threshold appears arbitrary at the moment.

Figure 7: Why are there negative stage values? And why are there purple high water dots in panel 7 when stage is reported negative? Is that supposed to be a diagnostic tool for quality assurance of the stage data, which leads to the record in panel b? The paragraph in the main text where Figure 7 is mentioned does not walk the reader through this. Also in Figure 7 are the stage observations without any dots times where there was no imagery or times where the imagery classification was deemed not high confidence? Consider adding shading to indicate “no imagery available” and another color of dot to indicate “no high confidence prediction” or something similar so the absence of data is clear.

Page 28, line 446-448 and Page 29, line 464-465: Is there a citation or the claim of not having enough imagery to train a CNN? The Gupta et al. and Noto et al. studies you cite have about as much imagery as you do. You report ~4,700 images, which is more than at 2 of Gupta et al. 's sites.

Section 4.4: This section largely diverges from the central contribution of the study (a methodology to classify images) and into a lot of site-specific information that is largely conjecture about processes and reads as redundant to the prior section (4.3). This section could be eliminated.

Page 33, line 604: Is there a citation for the claim that “these efforts have struggled to translate to IRES”? Neither study cited included nonperennial streams.

Section 4.6: This section is only loosely connected with the central contribution of the paper (image classification model) and is material that could be included in the introductory material. This section could be eliminated.

Conclusion section is missing: It is traditional to summarize the paper’s contribution in a conclusion section, one is missing here.
Citation: https://doi.org/10.5194/egusphere-2025-2297-RC1
- AC1: 'Reply on RC1', Sarah Ogle, 16 Oct 2025
  
  Thank you for your thoughtful review of this article. I have attached our response as a PDF.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2297-AC1
RC2:
'Comment on egusphere-2025-2297', Anonymous Referee #2, 26 Sep 2025

This manuscript presents a timely, and valuable study that addresses a critical challenge in hydrology: monitoring intermittent rivers and ephemeral streams (IRES). The application of a relatively simple logistic regression model to classify flow states from field camera imagery is both pragmatic and innovative. The methodology is clearly described, the results are robust and convincingly presented, and the discussion thoughtfully places the work in the broader context of IRES monitoring, climate change, and water management. The integration of image classifications for quality control of stage data is a particularly strong and practical contribution. The manuscript is generally well-written and structured. I believe it represents a significant contribution to the field and is a strong candidate for publication after revisions. In the end, I've attached a document with some suggestions that may improve the manuscript.

Citation: https://doi.org/10.5194/egusphere-2025-2297-RC2
- AC2: 'Reply on RC2', Sarah Ogle, 16 Oct 2025
  
  Thank you for your thoughtful review of this article. I have attached our response as a PDF.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2297-AC2

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2025-2297', Anonymous Referee #1, 25 Aug 2025
I find this article to be generally well-written, well-structured, and applies a transferrable methodology to classify stream conditions in ephemeral streams in a single study watershed. The authors support their claims and provide adequate figures to support their argument.
I did find that some of the discussion sections strayed beyond the scope of the study described in the introduction section to discuss other features of the watershed and ephemeral streams more broadly. The paper would be strengthened by focusing on its central contribution.
I did find that a limitation of the study was that it focused on a subset of images from a single site. The methodology was demonstrated and its performance evaluated against predictions from the National Water Model, but statements about its transferability to other locations or are undercut by the limited nature of the data.
I have minor comments regarding clarity and a few considerations not in the original text but overall find the article a suitable contribution to HESS:
Page 7, line 155: What defines “environmental damage”? Tampering? Batteries dying?

Page 9, Lines 173-180: The National Water Model (NWM) is trained/calibrated to gage flows, how close is the closest calibration site? In figure A1 looks like it is on the East Fork of Russian River, so not on the stream you are monitoring. Worth pointing out in this section.

Figure 5: You need axis labels indicating which axis is predicted and which is observed.

Page 9, line 197: Indeed, cropping vegetation may be helpful here – if there is a mediterranean climate, vegetation dynamics and streamflow ephemerality are both highly seasonal the model could learn more from the banks (which could make up more of the image) than the channel where intended.

Page 10, line 204: You only labelled 12.8% of the total images you had available – this is acceptable but is a relatively small dataset for training or reporting performance (your testing set is 3.9% of your total image dataset) that will represent the population. This is a limitation of the study, since as you note the lighting can be very different at different times of day/year. Ideally you have a big enough testing set to represent performance at each class during different lighting (and vegetation and channel) conditions.

Page 10, line 203-210: Random sampling was used in the selection of images for training/testing, which is acceptable, but this means the performance is only representative of historical conditions coincident with the label dataset. The performance reported in this paper is not representative of model prediction on new unseen imagery. This point is worth noting to make sure the reader knows what the model performance represents.

Page 12, lines 247-250: Why were these manual weights selected?

Page 14, line 298: Is there a reference for the 0.028 m³ s^-1threshold for NWM flow? How sensitive are your results to this selection? The selection of the threshold appears arbitrary at the moment.

Figure 7: Why are there negative stage values? And why are there purple high water dots in panel 7 when stage is reported negative? Is that supposed to be a diagnostic tool for quality assurance of the stage data, which leads to the record in panel b? The paragraph in the main text where Figure 7 is mentioned does not walk the reader through this. Also in Figure 7 are the stage observations without any dots times where there was no imagery or times where the imagery classification was deemed not high confidence? Consider adding shading to indicate “no imagery available” and another color of dot to indicate “no high confidence prediction” or something similar so the absence of data is clear.

Page 28, line 446-448 and Page 29, line 464-465: Is there a citation or the claim of not having enough imagery to train a CNN? The Gupta et al. and Noto et al. studies you cite have about as much imagery as you do. You report ~4,700 images, which is more than at 2 of Gupta et al. 's sites.

Section 4.4: This section largely diverges from the central contribution of the study (a methodology to classify images) and into a lot of site-specific information that is largely conjecture about processes and reads as redundant to the prior section (4.3). This section could be eliminated.

Page 33, line 604: Is there a citation for the claim that “these efforts have struggled to translate to IRES”? Neither study cited included nonperennial streams.

Section 4.6: This section is only loosely connected with the central contribution of the paper (image classification model) and is material that could be included in the introductory material. This section could be eliminated.

Conclusion section is missing: It is traditional to summarize the paper’s contribution in a conclusion section, one is missing here.
Citation: https://doi.org/10.5194/egusphere-2025-2297-RC1
- AC1: 'Reply on RC1', Sarah Ogle, 16 Oct 2025
  
  Thank you for your thoughtful review of this article. I have attached our response as a PDF.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2297-AC1
RC2:
'Comment on egusphere-2025-2297', Anonymous Referee #2, 26 Sep 2025

This manuscript presents a timely, and valuable study that addresses a critical challenge in hydrology: monitoring intermittent rivers and ephemeral streams (IRES). The application of a relatively simple logistic regression model to classify flow states from field camera imagery is both pragmatic and innovative. The methodology is clearly described, the results are robust and convincingly presented, and the discussion thoughtfully places the work in the broader context of IRES monitoring, climate change, and water management. The integration of image classifications for quality control of stage data is a particularly strong and practical contribution. The manuscript is generally well-written and structured. I believe it represents a significant contribution to the field and is a strong candidate for publication after revisions. In the end, I've attached a document with some suggestions that may improve the manuscript.

Citation: https://doi.org/10.5194/egusphere-2025-2297-RC2
- AC2: 'Reply on RC2', Sarah Ogle, 16 Oct 2025
  
  Thank you for your thoughtful review of this article. I have attached our response as a PDF.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2297-AC2

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload

ED: Reconsider after major revisions (further review by editor and referees) (06 Nov 2025) by Alberto Guadagnini

AR by Sarah Ogle on behalf of the Authors (11 Dec 2025) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (15 Dec 2025) by Alberto Guadagnini

RR by Anonymous Referee #1 (19 Dec 2025)

RR by Anonymous Referee #2 (23 Dec 2025)

ED: Publish subject to minor revisions (review by editor) (28 Dec 2025) by Alberto Guadagnini

AR by Sarah Ogle on behalf of the Authors (06 Jan 2026) Author's response Author's tracked changes Manuscript

ED: Publish as is (07 Jan 2026) by Alberto Guadagnini

AR by Sarah Ogle on behalf of the Authors (14 Jan 2026) Author's response Manuscript

Journal article(s) based on this preprint

06 Feb 2026

Image-based classification of stream stage to support ephemeral stream monitoring

Sarah E. Ogle, Garrett McGurk, Anahita Jensen, Fred Martin Ralph, and Morgan C. Levy

Hydrol. Earth Syst. Sci., 30, 709–742, https://doi.org/10.5194/hess-30-709-2026,https://doi.org/10.5194/hess-30-709-2026, 2026

Short summary

Sarah E. Ogle, Garrett McGurk, Anahita Jensen, Fred Martin Ralph, and Morgan C. Levy

Viewed

Total article views: 2,214 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,936	246	32	2,214	45	50

HTML: 1,936
PDF: 246
XML: 32
Total: 2,214
BibTeX: 45
EndNote: 50

Views and downloads (calculated since 20 Aug 2025)

Month	HTML	PDF	XML	Total
Aug 2025	570	28	5	603
Sep 2025	1,206	16	3	1,225
Oct 2025	61	42	9	112
Nov 2025	24	22	6	52
Dec 2025	34	50	6	90
Jan 2026	31	73	2	106
Feb 2026	10	14	1	25
Mar 2026	1	0	1
Apr 2026	0

Cumulative views and downloads (calculated since 20 Aug 2025)

Month	HTML	PDF	XML	Total
Aug 2025	570	28	5	603
Sep 2025	1,206	16	3	1,225
Oct 2025	61	42	9	112
Nov 2025	24	22	6	52
Dec 2025	34	50	6	90
Jan 2026	31	73	2	106
Feb 2026	10	14	1	25
Mar 2026	1	0	1
Apr 2026	0

Viewed (geographical distribution)

Total article views: 2,180 (including HTML, PDF, and XML) Thereof 2,180 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 11 Apr 2026

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (29626 KB)
Metadata XML

Short summary

Intermittent streams are vital to ecosystems and water supply but are hard to monitor and increasingly affected by climate change. To address this, we used field camera images from 2017–2023 at a stream in northern California to train a machine learning model that classifies streamflow as dry, low, or high. This low-cost method enables monitoring of changing intermittent stream conditions and supports water management in data-scarce regions.


Total:	0
HTML:	0
PDF:	0
XML:	0

Imagery classification of stream stage to support ephemeral stream monitoring

Journal article(s) based on this preprint

Interactive discussion

Interactive discussion

Peer review completion

Suggestions for revision or reasons for rejection

Journal article(s) based on this preprint

Viewed

Viewed (geographical distribution)