the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Imagery classification of stream stage to support ephemeral stream monitoring
Abstract. Intermittent rivers and ephemeral streams (IRES) constitute a large fraction of global river networks, provide important ecosystem services, and are increasing in number with climate change. Yet, observing stage and calculating discharge in IRES can be technologically and methodologically challenging. To address this problem, we develop a method to classify relative stage categories from field camera imagery, creating a time series of categorical flow states without the need for direct stage measurements. Specifically, we employ a Logistic Regression model to classify conditions of no water, low water levels, or high water levels for an ephemeral stream located in the upper Russian River watershed of California (U.S.). We trained our algorithm using hourly field camera images from 2017–2023, and validated the image classifications with 15-minute continuous stage observations. We then used image classifications to perform quality control on the continuous stage time series. Next, we compared the image classifications to publicly accessible modeled discharge from the NOAA National Water Model CONUS Retrospective Dataset. We discuss how in-situ monitoring including field cameras and the classification of field camera imagery, combined with surface meteorology and soil moisture observations, provides detailed hydrologic information important for understanding how climate change affects IRES. Because the image classification approach is transferable to other ephemeral stream sites equipped only with field cameras, this methodology provides a low-cost option for observing relative stage on sparsely-measured IRES that can augment existing hydrologic modeling used by water managers.
- Preprint
(29626 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 07 Oct 2025)
-
RC1: 'Comment on egusphere-2025-2297', Anonymous Referee #1, 25 Aug 2025
reply
I find this article to be generally well-written, well-structured, and applies a transferrable methodology to classify stream conditions in ephemeral streams in a single study watershed. The authors support their claims and provide adequate figures to support their argument.
I did find that some of the discussion sections strayed beyond the scope of the study described in the introduction section to discuss other features of the watershed and ephemeral streams more broadly. The paper would be strengthened by focusing on its central contribution.
I did find that a limitation of the study was that it focused on a subset of images from a single site. The methodology was demonstrated and its performance evaluated against predictions from the National Water Model, but statements about its transferability to other locations or are undercut by the limited nature of the data.
I have minor comments regarding clarity and a few considerations not in the original text but overall find the article a suitable contribution to HESS:
- Page 7, line 155: What defines “environmental damage”? Tampering? Batteries dying?
- Page 9, Lines 173-180: The National Water Model (NWM) is trained/calibrated to gage flows, how close is the closest calibration site? In figure A1 looks like it is on the East Fork of Russian River, so not on the stream you are monitoring. Worth pointing out in this section.
- Figure 5: You need axis labels indicating which axis is predicted and which is observed.
- Page 9, line 197: Indeed, cropping vegetation may be helpful here – if there is a mediterranean climate, vegetation dynamics and streamflow ephemerality are both highly seasonal the model could learn more from the banks (which could make up more of the image) than the channel where intended.
- Page 10, line 204: You only labelled 12.8% of the total images you had available – this is acceptable but is a relatively small dataset for training or reporting performance (your testing set is 3.9% of your total image dataset) that will represent the population. This is a limitation of the study, since as you note the lighting can be very different at different times of day/year. Ideally you have a big enough testing set to represent performance at each class during different lighting (and vegetation and channel) conditions.
- Page 10, line 203-210: Random sampling was used in the selection of images for training/testing, which is acceptable, but this means the performance is only representative of historical conditions coincident with the label dataset. The performance reported in this paper is not representative of model prediction on new unseen imagery. This point is worth noting to make sure the reader knows what the model performance represents.
- Page 12, lines 247-250: Why were these manual weights selected?
- Page 14, line 298: Is there a reference for the 0.028 m3 s-1 threshold for NWM flow? How sensitive are your results to this selection? The selection of the threshold appears arbitrary at the moment.
- Figure 7: Why are there negative stage values? And why are there purple high water dots in panel 7 when stage is reported negative? Is that supposed to be a diagnostic tool for quality assurance of the stage data, which leads to the record in panel b? The paragraph in the main text where Figure 7 is mentioned does not walk the reader through this. Also in Figure 7 are the stage observations without any dots times where there was no imagery or times where the imagery classification was deemed not high confidence? Consider adding shading to indicate “no imagery available” and another color of dot to indicate “no high confidence prediction” or something similar so the absence of data is clear.
- Page 28, line 446-448 and Page 29, line 464-465: Is there a citation or the claim of not having enough imagery to train a CNN? The Gupta et al. and Noto et al. studies you cite have about as much imagery as you do. You report ~4,700 images, which is more than at 2 of Gupta et al. 's sites.
- Section 4.4: This section largely diverges from the central contribution of the study (a methodology to classify images) and into a lot of site-specific information that is largely conjecture about processes and reads as redundant to the prior section (4.3). This section could be eliminated.
- Page 33, line 604: Is there a citation for the claim that “these efforts have struggled to translate to IRES”? Neither study cited included nonperennial streams.
- Section 4.6: This section is only loosely connected with the central contribution of the paper (image classification model) and is material that could be included in the introductory material. This section could be eliminated.
- Conclusion section is missing: It is traditional to summarize the paper’s contribution in a conclusion section, one is missing here.
Citation: https://doi.org/10.5194/egusphere-2025-2297-RC1
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
998 | 32 | 5 | 1,035 | 21 | 25 |
- HTML: 998
- PDF: 32
- XML: 5
- Total: 1,035
- BibTeX: 21
- EndNote: 25
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1