the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
River intermittency: mapping and upscaling of water occurrence using unmanned aerial vehicle, Random Forest and remote sensing landscape attributes
Abstract. Although intermittent rivers exist naturally, global changes have a direct influence on streamflow permanence. Measurements and modelling in temporary rivers are still scarce and yet, essential for prediction and understanding of scarcity scenarios. Thus, this work aims to map and model the spatio-temporal dynamics of an intermittent river. The study area is the Umbuzeiro River in the Brazilian Semiarid (∼100 km), whose spatially coherent streamflow occurs exclusively in the wettest months during the rainy season. We conducted twelve UAV surveys between March and November 2022 in selected river reaches. With the imagery from UAV surveys, we classified river reaches into "Wet", "Transition", "Dry" or "Not Determined" with visual inspection of 1.0 m reaches. In order to explain the observed patterns, we analysed 40 candidate predictors based on static and dynamic landscape attributes. Among these, altitude, drainage area, distance from dams, and dynamic predictors proved to be most informative in Random Forest models. We selected three Random Forest models based on the different dynamic predictors. The models differ in the source and type of dynamic predictor used to capture the temporal dynamics: (a) series of Sentinel MNDWI; (b) series of Planetscope NDVI; and (c) accumulated precipitation (30 days). All model variants successfully mimicked river intermittency with an accuracy of around 80 % for both test and training. Models (a) and (b) captured the temporal dynamics in model extrapolation to the whole river. When analysing the spatial distribution of intermittency, models (a) and (c) better identified areas more prone to "Wet" or "Transition" classes. This way, model (a) was identified as the most successful in simulating intermittency both temporally and spatially. The use of Sentinel MNDWI in model (a) aggregates enough spatial information, so the model can better simulate water occurrence classes. The findings presented here emphasize the possibility of using this index even in narrow temporary rivers. The results provide insight into the hydrological diversity of semi-arid rivers and are, therefore, important to understand their role in water availability.
- Preprint
(38962 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2025-884', Anonymous Referee #1, 30 Jul 2025
-
AC1: 'Reply on RC1', Nazaré Suziane Soares, 26 Sep 2025
For easier reference, please refer to the attached color PDF. In the PDF, the reviewer’s comments are in black, and our responses are in blue. On the website, our answers are displayed in bold paragraphs, or in italics when providing a direct transcription of the manuscript.
Reviewer’s comment:General Information
The study investigates spatial and temporal dynamics of river intermittency in the Umbuzeiro River (Brazilian semiarid region) using a combination of UAV surveys, remote sensing, and Random Forest (RF) modeling. The authors assess water presence classes: “Wet”, “Transition”, “Dry”, and “Not Determined”, across different times and river segments. They evaluate three RF model variants based on different dynamic predictors: Sentinel MNDWI, Planetscope NDVI, and 30-day accumulated precipitation.This is a well-structured and technically rigorous study that addresses a critical gap in intermittent river monitoring in semi-arid environments. It provides an important proof of concept for combining UAV-based mapping and machine learning for ecohydrological research. The results are promising and provide a methodological blueprint for future upscaling efforts, though there is room for refinement in automation, validation, and generalization.
Despite the interest in the topic, I believe some changes in the paper are needed before it can be considered ready for publication, listed in the specific comments.
Response: We thank the reviewer for their thoughtful and constructive feedback. We are pleased that the reviewer recognized the relevance of our study and the potential contribution of our methodology to ecohydrological research in semi-arid intermittent rivers. We appreciate the positive assessment of our approach, particularly the integration of UAV-based mapping and machine learning, as well as the acknowledgment of its promise for future upscaling efforts.
We have carefully considered the reviewer’s comments and have made the suggested revisions to improve the manuscript, particularly in relation to automation, validation, and generalization, as detailed in our responses to the specific comments below.
Specific Comments
- LL121-123:"The "Not Determined" class included those reaches where it was not possible to discern”. It may be useful to evaluate the integration of soil moisture indices or evapotranspiration estimates as additional dynamic predictors.
We appreciate the reviewer’s suggestion to explore additional predictors such as soil moisture and evapotranspiration, which are indeed relevant variables for understanding flow intermittency and ecohydrological dynamics in dryland river systems. We agree that these parameters have the potential to enhance predictive models and contribute to a more comprehensive understanding of the hydrological processes at play.
However, in the context of our study, such data are not available at the spatial and temporal resolution required for integration into our current framework. In semi-arid and data-scarce regions like our study area, in situ measurements of soil moisture and evapotranspiration are typically unavailable, and reliance on remotely sensed or modeled estimates—while valuable at broader scales—can introduce additional uncertainty, particularly when applied at fine spatial scales.
We emphasize the value of fine-scale hydrological monitoring in data-scarce semi-arid environments, such as the AEB (Aiuaba Experimental Basin), where long-term field-based studies have provided critical insights into local runoff generation processes (Figueiredo et al., 2016) and soil moisture dynamics (Costa et al., 2013). These efforts demonstrate that small-scale, in situ monitoring can reveal ecohydrological processes and spatial patterns that are not captured by coarse-resolution remote sensing products.
Given these limitations, our methodological choice was to prioritize field-based and directly observable predictors, such as vegetation cover and geomorphological attributes, which could be reliably mapped and validated at high resolution. We believe that this approach reduces uncertainty and strengthens the applicability of the model in small-scale, heterogeneous environments. We agree that future studies could benefit from incorporating additional hydrological variables as data availability improves, we added a brief discussion of this point in section 4.6 Work limitations of the revised manuscript.
- L143: "These indices are are summarized in Table 1" .The two verbs reported need to be corrected.
Thank you for pointing this out. The duplicated verb has been corrected in the revised manuscript. The sentence now reads: “These indices are summarized in Table 1.”
- L189-192: "Damming structures are mapped all along the Umbuzeiro River by using Planetscope images so as to visually locate each dam." The river path and dam structures are mapped manually using high-resolution imagery and field data. In particular, the geolocation of the dams was also performed manually, which limits scalability and reproducibility for larger or other basins. Could automated surveying using satellite data be considered?
We appreciate the reviewer’s observation regarding the scalability and reproducibility of manually mapped features. In our study, the mapping of small dams was performed manually based on high-resolution UAV imagery and the authors' prior field knowledge. While a full validation campaign was beyond the scope of this work, a subset of the mapped structures—approximately 10%—was verified through field visits and drone imagery, providing some level of confirmation of their existence and location.
We acknowledge, however, that manual mapping can be a limiting factor for broader applications. To address this concern, we note that our method is flexible and can be applied using publicly available global datasets. In particular, the Global Surface Water Explorer (Pekel et al., 2016) offers valuable information on water occurrence and seasonality and can serve as a proxy for identifying surface water bodies and small reservoirs in other regions. Variables derived from this dataset were already incorporated into our model as predictors, which supports the potential for scaling the approach beyond our study area.
A brief discussion was added to section 4.6 Work limitations of the revised manuscript highlighting this methodological flexibility and the use of global datasets.
- L257: "4.1 Observed water intermittency: UAV imagery". From the text, it appears that the UAV-based classification is visual and not verified with in-situ hydrological measurements or ground truth sampling. The lack of objective thresholds could introduce bias in the classification of “Transition” and “Wet” classes. Probably longer UAV survey campaigns, covering more years, could better capture interannual variability.
We thank the reviewer for this insightful comment. The UAV-based classification was indeed based on visual interpretation, supported by field observations and local knowledge of the study area. Although no streamflow measurements or water level sensors were available, presence/absence of ponded water in specific river reaches was verified during field visits conducted during the UAV survey.
We acknowledge that the lack of objective thresholds—such as fixed depth criteria—introduces some degree of subjectivity, especially in distinguishing between “Wet” and “Transition” conditions. Additionally, we agree that longer-term UAV monitoring campaigns, capturing multiple hydrological years, would enhance the ability to detect interannual variability and improve the robustness of the classification.
These aspects will be acknowledged as limitations in section 4.6 Work limitations of the revised manuscript, along with a brief reflection on the trade-offs involved in using visual interpretation in data-scarce environments.
- LL326-328: "Temporal variations along the year can also be observed in comparison to monthly precipitation." What is the period used to calibrate the model to identify the seasonality of events? The model using 30-day precipitation (Model c) underperforms in capturing seasonal transitions. This approach limits the model’s ability to generalize across varying wet/dry years.
We thank the reviewer for this important observation. Model c was trained using data from five UAV campaigns conducted over a single hydrological year, encompassing a range of seasonal conditions—from wetter months with ponded water to dry season and the rewetting period in November. The model used precipitation accumulated over the 30 days prior to each UAV campaign as a proxy for short-term hydrological conditions.
While this variable helped capture intra-annual variability and presented good accuracy for testing datasets, we agree that the use of a single-year time series and an accumulation window limit the model’s ability to fully represent seasonal transitions and interannual variability. This is particularly relevant in semi-arid environments, where the timing and intensity of rainfall events vary significantly from year to year. The limited performance of Model c in capturing transitional classes may reflect these constraints.
We acknowledge this as a limitation of our current dataset, and a brief note was added to section 4.6 Work limitations of the revised manuscript suggesting that future work incorporating multi-year datasets and other precipitation accumulation periods may improve the model’s generalization to other hydrological years.
- LL351-353: "Model (a) is even more specific in this respect and indicates mainly areas in the lowest part of the basin. The identification of areas prone to wetter conditions is very important even in the smallest of scales because they can be key areas for river ecology, for instance". Although model (a) achieved the best results, the use of Sentinel MNDWI data may not be generalizable to narrower or canopy-covered watercourses, especially in forested catchments.
We thank the reviewer for raising this important point. We agree that the use of MNDWI derived from Sentinel-2 imagery has known limitations when applied to narrow channels or watercourses under dense canopy cover, where spectral signals from water may be obstructed or diluted by surrounding vegetation.
Additionally, we note that areas with dense canopy cover, where the riverbed is obscured and water detection is not possible, were classified as “Not Determined” in our approach. This category itself may indicate zones of persistent canopy cover, which can be related to local hydrological conditions, such as groundwater influence or permanent pools beneath vegetation. Thus, even in forested stretches, the vegetation structure can provide indirect information about wetness patterns.
Nonetheless, we recognize that in more heavily forested or topographically complex catchments, alternative data sources or higher-resolution sensors may be needed to accurately identify surface water. We included a brief note in section 4.6 Work limitations of the revised manuscript acknowledging this limitation and suggesting caution when applying this approach to other regions with different land cover characteristics.
- L378: "for spectral indexes based on UAV data". It's better to use indeces. "Indexes" is commonly used to refer to alphabetical lists in books, for example. In contrast, "indeces" is used in more technical, scientific, and mathematical contexts.
Thank you for this observation. We agree with the reviewer’s suggestion and have replaced “indexes” with “indices” in the revised manuscript to better align with scientific terminology.
In response to the reviewer’s suggestions, we added the following text to the section 4.6 Work Limitations to the revised version of the manuscript:
“Although high-resolution UAV imagery enabled detailed visual classification of flow permanence classes, the method relied on expert interpretation without direct hydrological measurements such as streamflow or water level data. This introduces potential subjectivity, particularly in distinguishing between “Wet” and “Transition” conditions. Field observations were conducted during the UAV campaigns and helped inform classification, but were limited to qualitative assessments of ponded water, as no surface flow was observed during the study year.”
“We based our analysis on data from a single hydrological year. While this approach allowed for the capture of short-term hydroclimatic variability, it limits the model’s capacity to generalize across years with different rainfall patterns. Future studies including multi-year data and alternative temporal windows could help address this limitation.”
“Additionally, we acknowledge the potential value of integrating other dynamic hydrological variables, such as soil moisture and evapotranspiration, into predictive models of flow intermittency. These variables are relevant for ecohydrological modeling, particularly in dryland environments. However, in the context of this study, such data were unavailable at the spatial and temporal resolutions required to support fine-scale modeling.”
“As for our choice of predictors, we recognize that spatial resolution and ease of access vary greatly among them. The selected predictors are consistent among models; but great importance is given to dam identification. Both “distance from” and “distance to” dams ranked as highly important in model performance, and they already represent half the number of static predictors. In our study area, we identify many different types of damming structures —ranging from small rural weirs to larger reservoirs— future studies could benefit from classifying them into distinct functional or structural categories.” (This paragraph was already present, but it was rewritten)
“The manual mapping of small dams enabled a more realistic representation of water retention in the basin. However, dam mapping may also limit applicability of our model to other regions as it requires considerable manual effort and, potentially, familiarity with the study area. That said, the methodology is adaptable: similar analyses can be replicated using global datasets such as the Global Surface Water Explorer (Pekel et al., 2016), which provides historical water occurrence based on Landsat imagery.”
"Finally, although Sentinel-2 data used in Model a yielded strong results in our study area, the performance of its spectral indices—particularly MNDWI—may decline in more heavily forested or topographically complex catchments. In these cases, alternative data sources or higher-resolution sensors may be needed to accurately identify surface water. However, it may be that dense vegetation can also serve as an indirect indicator of groundwater presence or surface wetness, and these segments were conservatively labeled as “Not Determined” in our classification.”
- L394: "Conclusion". Probably the conclusion could be more detailed and not a simple summary of the study done; they could report some details shown in the graphs and tables.
We thank the reviewer for this suggestion. We agree that the conclusion section can benefit from a more detailed discussion that goes beyond summarizing the study. In the revised manuscript, we will expand the conclusion to include references to key findings illustrated in figures and tables, as well as highlight the broader implications of the results for ecohydrological monitoring and the scalability of the proposed method. The revised Conclusion section incorporates the following insights:
“The "Wet" and "Dry" classes follow the rainy season dynamics, and the longer wet patches are present in the most downstream section. The "Transition" class is very heterogeneous because it represents areas with mixed information: such as wet/dry patches with algae and sparce vegetation. During the rainy season, the vegetation has full and dense canopies, that is why the "Not Determined" class, i.e. the river reaches where we cannot see the riverbed from the UAV-imagery, can be found more frequently during the rainy season and in narrower river stretches. This feature represents a major source of uncertainty, limiting the available data acquired through optical remote sensing.”
“The modelling framework developed in this study contributes to a broader understanding of flow intermittency as a spatially complex and highly dynamic process over time. The integration of high-resolution predictors, especially related to dam presence, landscape attributes and satellite indices, offers a scalable and adaptable approach for mapping wetness conditions in other dryland river systems. These insights are particularly relevant in the context of increasing climate variability and water stress, as they point to key landscape features that can be targeted for monitoring or management. Our results demonstrate that even in the absence of extensive hydrometric data, meaningful patterns can be derived from the careful integration of remote sensing and field-based observations.”
References for works used in this response:
Costa, C. A. G., Lopes, J. W. B., Pinheiro, E. A. R., Araújo, J. C. de, & Gomes Filho, R. R. (2013). Spatial behaviour of soil moisture in the root zone of the Caatinga biome. Revista Ciência Agronômica, 44(4), 685–694.De Figueiredo, J. V., de Araújo, J. C., Medeiros, P. H. A., & Costa, A. C. (2016). Runoff initiation in a preserved semiarid Caatinga small watershed, Northeastern Brazil. Hydrological Processes, 30(13), 2390-2400.
Pekel, J. F., Cottam, A., Gorelick, N., & Belward, A. S. (2016). High-resolution mapping of global surface water and its long-term changes. Nature, 540(7633), 418-422.
- LL121-123:"The "Not Determined" class included those reaches where it was not possible to discern”. It may be useful to evaluate the integration of soil moisture indices or evapotranspiration estimates as additional dynamic predictors.
-
AC1: 'Reply on RC1', Nazaré Suziane Soares, 26 Sep 2025
-
RC2: 'Comment on egusphere-2025-884', Anonymous Referee #2, 27 Aug 2025
This study addresses a critical gap in intermittent river monitoring in the Umbuzeiro River, Brazil, which I consider significant with implications for other arid areas and for anticipating future climate-change impacts. The integration of UAV surveys, Random Forest classification, and remote sensing–based landscape attributes is a valuable approach. However, the manuscript in its current form suffers from presentation and clarity issues that obscure the scientific contributions. Substantial revision is needed to improve communication, sharpen the articulation of the scientific questions more process-oriented interpretation of results in the context of Earth surface processes.
Major Comments
- Scientific framing
- While the methodological framework is carefully outlined, the manuscript must articulate the scientific question more strongly. The introduction hints at this, but the discussion largely focuses on model outcomes without interpretation in terms of physical or hydrological processes. For example: “distance from the last dam is the most important predictor” does not require a complex Random Forest model to identify. Similarly, Section 4.2 notes that elevation and drainage area are important predictors, but these are already well established. This journal is not a primarily GIS/remote sensing journal and so I encourage the authors to discuss What is the added value of this modeling exercise? Does it allow optimization of resolution? What unexpected findings emerged? What are the implications for other arid environments or climate-driven changes in intermittency?
- Presentation quality
- The manuscript follows the standard structure, but the communication of ideas and concepts requires improvement. At present, several sections read like figure captions rather than explanatory text (e.g., line 219). Statements should be fine-tuned for accuracy and consistency (e.g., lines 109–111).
- Section 3.1 Modelling workflow requires rewriting: the repeated use of the generic term “data” is too vague. Please specify clearly whether you are referring to UAV-derived classifications or predictor variables.
- Figures require improvement. For example, continuous variables such as monthly precipitation should be plotted as interpolated lines with points not bar plots. Figure captions must be complete and self-contained.
- Methods
- Although all equations are listed, the conceptual meaning of the metrics (e.g., balanced accuracy) is not unpacked, remember your audience. Please provide intuitive descriptions and, if possible, a conceptual diagram.
- Please clarify why Table 2 lists 25 predictors, but Fig. 8 refers to 40.
- Results and interpretation
- Some results are not fully explained. Why does Model (c) predict >75% dry in March, peak wet season, but not in November, second wet season? Why is Model (a), with 5-day frequency predictors, described as the most successful in simulating intermittency when it has the coarsest temporal resolution?
- Please interpret results in physical terms, not only in terms of model metrics.
- References and context
- The references are strongly weighted toward recent Brazilian studies. Please include some of the classic papers on ephemeral and intermittent streams.
- The paper should also acknowledge interannual variability in flow connectivity, which is critical for interpreting results beyond a single year.
Minor Comments
Lines 109–111: Be consistent in describing rainy season timing. April–July are not “the first months of the year.”
Line 134–135: Do you need to direct readers to the Sentinel user guide?
Lines 135 & 161: Clarify the role of Google Earth Engine (GEE) for Sentinel and MapBiomas.
Line 140: Specify what was done in R.
Line 165: Where in Figure 1 is this shown?
Line 172: Please provide information on data access (can it be downloaded and where?).
Line 203: Suggested rewording: “containing the type of variable…”
Line 207: Cite Table 2 after “candidate predictors.”
Line 219: Reword to “The specific predictors employed in each model are summarized in Table 2.”
Line 360: Section 4.6 Add acknowledgment of interannual variability and how the intermittency might vary
Line 419: Avoid anthropomorphizing landscapes; specify which aspects of river metabolism are relevant.
Figures
- Fig. 1: Add discharge/precipitation panel to establish arid context, I know precipitation data is presented in Figure 9
- Fig. 2: Show/explain how exactly the UAV surveys are used
- Fig. 3: Quantify misfit between flow accumulation (DEM) and visual mapping. Adjust blue line color for visibility.
- Fig. 4: Add discharge/precipitation to support text (lines 109–110).
- Fig. 5: Missing panel label (a). Clarify “1.0 m reaches,” as 1 m seems implausibly small.
- Fig. 7: Clarify why parameters with high Mean Decrease Accuracy were selected.
- Fig. 8: Please clarify why Table 2 lists 25 predictors, but Fig. 8 refers to 40.
- Fig. 9: Explain Model (c) behavior, see Major comments.
- Fig. 10: Clarify how “High” and “Low” were calculated. Can values be normalized and expressed as percentages?
Tables
- Table 2: Organize predictors by model (a–c), consistent with Figs. 7, 9, and 10. Replace “constant” as that is not a frequency
- Table 3: Define balanced accuracy conceptually, I know the equation is given, and present Train/Test values in a clearer layout.
Data & codes
Please include the data and codes to comply with ESurf's FAIR policies
Citation: https://doi.org/10.5194/egusphere-2025-884-RC2 -
AC2: 'Reply on RC2', Nazaré Suziane Soares, 26 Sep 2025
For easier reference, please refer to the attached color PDF. In the PDF, the reviewer’s comments are in black, and our responses are in blue. On the website, our answers are displayed in bold paragraphs, or in italics when providing a direct transcription of the manuscript. We encountered difficulties inserting images directly into the comments, so all revised graphs are included in the attached PDF.
Reviewer’s comment: This study addresses a critical gap in intermittent river monitoring in the Umbuzeiro River, Brazil, which I consider significant with implications for other arid areas and for anticipating future climate-change impacts. The integration of UAV surveys, Random Forest classification, and remote sensing–based landscape attributes is a valuable approach. However, the manuscript in its current form suffers from presentation and clarity issues that obscure the scientific contributions. Substantial revision is needed to improve communication, sharpen the articulation of the scientific questions more process-oriented interpretation of results in the context of Earth surface processes.
Response: We sincerely thank the reviewer for the positive evaluation of our study and for highlighting its relevance to dryland hydrology and ecohydrological monitoring. We also appreciate the constructive suggestions aimed at improving the manuscript’s clarity and scientific framing. We acknowledge that aspects of the presentation—particularly the articulation of our research questions and the interpretation of our results in light of geomorphological and hydrological processes—can be improved. In the revised manuscript, we:
- Refine the introduction to clearly articulate the research questions and the knowledge gap addressed;
- Improve the flow and clarity of the text, with particular attention to the explanation of methods and model interpretation;
- Strengthen the discussion by linking results more explicitly to hydrological and landscape processes, such as flow regulation by dams and topographic accumulation.
These revisions will help to better communicate the scientific contributions of the study and its applicability to broader contexts, including climate-change-related shifts in river regimes. We thank the reviewer again for the thoughtful and helpful feedback.
Major Comments
- Scientific framing
• While the methodological framework is carefully outlined, the manuscript must articulate the scientific question more strongly. The introduction hints at this, but the discussion largely focuses on model outcomes without interpretation in terms of physical or hydrological processes. For example: “distance from the last dam is the most important predictor” does not require a complex Random Forest model to identify. Similarly, Section 4.2 notes that elevation and drainage area are important predictors, but these are already well established. This journal is not a primarily GIS/remote sensing journal and so I encourage the authors to discuss What is the added value of this modeling exercise? Does it allow optimization of resolution? What unexpected findings emerged? What are the implications for other arid environments or climate-driven changes in intermittency?We thank the reviewer for this thoughtful and constructive comment. We agree that the manuscript would benefit from a clearer articulation of the scientific question and a stronger framing of the results in terms of hydrological and geomorphological processes. In the revised version, we revisit the introduction to explicitly define the knowledge gap we aim to address — namely, the limited understanding of fine-scale spatial patterns of intermittency in intermittent rivers, and the role of multiple landscape controls in shaping these patterns. An excerpt added to the Introduction section is presented below:
“This study addresses a critical knowledge gap on spatial patterns of intermittency in a river. We investigate how to effectively map these patterns by incorporating field observations and landscape attributes, along with other variables, into the modeling process. Although the general role of topographic and climatic drivers is well established, little is known about their interaction with human modifications, such as farmer dams, to influence the presence of water on river reaches. Furthermore, methodological approaches capable of integrating multiple data sources (e.g., UAV, landscape attributes, and machine learning) are still limited, particularly in intermittent rivers where wet patches are hard to map.”
“Here, we explore how different environmental and anthropogenic variables contribute to the occurrence of water in intermittent reaches. When we combine field-based classifications with Random Forest modelling, we investigate not only the prediction accuracy of different data sources, but also the relative importance of physical attributes and land use drivers on river wetness patterns.”While predictors such as elevation or drainage area are indeed well established in hydrological modeling, our goal was to explore how these and other variables (e.g., dam proximity, vegetation cover) interact in a high-resolution spatial context, and how machine learning methods like Random Forest can help quantify the relative importance of such predictors across heterogeneous, data-scarce environments. The added value of the model lies in its ability to integrate multi-source data (UAV, satellite, field observations), operate at a fine spatial resolution, and be used as a framework for extrapolation in ungauged systems.
In the discussion section, we also expand the interpretation of our findings with a focus on physical processes — for instance, the role of upstream impoundments in controlling downstream drying patterns, or how topographic accumulation may reflect water retention. Although some key predictors were expected, others offer insights into local regulation mechanisms that might be overlooked in coarser-scale analyses. We also reflect on the potential implications of our approach for studying climate-driven shifts in river intermittency in other basins. To address the reviewer’s comment, we will expand the Discussion section with the following excerpts:
4.2 Identification of important predictors
“Model performance patterns align with known physical drivers of river intermittency. For example, river reaches with small contributing areas were more frequently classified as dry. Conversely, regions with bigger contributing tend to sustain surface water for longer periods.”
“Although predictors related to landscape attributes are well-established in hydrology, they gain new value here through their interaction with other spatial variables at finer resolution. For example, areas with high accumulation but located downstream of dams often remain dry, suggesting that topography alone does not explain surface wetness in human-modified catchments.”
“This consistent selection of “distance to the next dam” and “distance from the last dam” as top predictors highlights the strong influence of anthropogenic flow regulation on surface water distribution. While the relevance of dams in altering flow regimes is widely acknowledged, our findings provide a fine-scale perspective on how even small impoundments can generate abrupt shifts in flow permanence along the network. This is particularly relevant in semi-arid regions, where it is common to use small-scale storage and retention of surface water. However, as river networks become more intermittent due to climate-driven changes, this can be a reality in all climates.”
“Water accumulation in intermittent river seems to be governed by processes captured by landscape attributes (including human-made interferences) and satellite indices. The consistent selection of these predictors across models reinforces them as controls and indicators to water retention.”The revised Conclusion section incorporate the following insights:
“The modelling framework developed in this study contributes to a broader understanding of flow intermittency as a spatially complex and highly dynamic process over time. The integration of high-resolution predictors, especially related to dam presence, landscape attributes and satellite indices, offers a scalable and adaptable approach for mapping wetness conditions in other dryland river systems. These insights are particularly relevant in the context of increasing climate variability and water stress, as they point to key landscape features that can be targeted for monitoring or management. Our results demonstrate that even in the absence of extensive hydrometric data, meaningful patterns can be derived from the careful integration of remote and field-based observations.” - Presentation quality
• The manuscript follows the standard structure, but the communication of ideas and concepts requires improvement. At present, several sections read like figure captions rather than explanatory text (e.g., line 219). Statements should be fine-tuned for accuracy and consistency (e.g., lines 109–111).We thank the reviewer for this helpful observation. We agree that certain parts of the manuscript required revisions to improve clarity and narrative flow. In response, we made the following changes:
Line 219:
Original: "The rightmost column in Table 2 shows which predictor is employed in each model."
Revised: “The specific predictors employed in each model are summarized in Table 2."Lines 109–111:
Original: "We perform UAV surveys every month during the rainy season (Apr–Jul), and then again in November of 2022 (Fig. 4). Since the rainy season usually occurs during the first months of the year (Soares et al., 2024), satellite images are normally not useful during this period due high cloudiness."
Revised: "We perform UAV surveys monthly during the rainy season (Apr to Jul), and once more in November 2022 (Fig. 4). As noted by Soares et al. (2024), the rainy season typically occurs in the first half of the year in this region. The high frequency of cloud cover during the rainy season typically limits the availability of usable satellite imagery, reinforcing the need for UAV-based data collection during this period."In addition to the specific revisions mentioned above, we carefully reviewed the manuscript to improve overall clarity and ensure that all sections communicate ideas effectively, avoiding figure-caption-like phrasing and enhancing consistency throughout.
• Section 3.1 Modelling workflow requires rewriting: the repeated use of the generic term “data” is too vague. Please specify clearly whether you are referring to UAV-derived classifications or predictor variables.We appreciate the reviewer’s observation. In response, we carefully revised Section 3.1 to avoid using the term “data” in a generic way. We now clearly specify the type of data being referenced throughout the section — distinguishing between UAV-derived classifications and predictor variables. The section now reads:
“Workflow follows the flowchart shown in Fig. 2. There are three main steps: (1) UAV data collection and mapping, (2) model training, and (3) application to the whole river. Step 1 consists of the river course visual mapping. Then, we conduct UAV surveys multiple times in river reaches, producing high-resolution orthomosaics. We use the images to classify 1 meter reaches in water occurrence classes, which are used as ground-truth data for model training and evaluation. In step 2, input data from predictor variables is used during the training of Random Forest classification models in order to obtain the same class as in the observed data (UAV-derived classification). Training includes the use of static and dynamic landscape attributes as candidate predictors to explain the observed patterns of water occurrence. In a recursive process, we compute the importance of the candidate predictors (or features). We test three models based on predictor subgroups and select the most important predictors in each model The Random Forest models are retrained with the selected predictors in order to obtain the best models. In step 3, the models are applied to the entire river.”
• Figures require improvement. For example, continuous variables such as monthly precipitation should be plotted as interpolated lines with points not bar plots. Figure captions must be complete and self-contained.
We thank the reviewer for the suggestion. While we understand the recommendation to use line plots for continuous variables, we chose to represent monthly precipitation using bar plots because it is an aggregated variable over discrete time intervals (months), rather than a continuously measured variable. This is a common approach in hydrology and climatology to visually emphasize the magnitude of rainfall in each month.
That said, we acknowledge the importance of clarity and completeness in figure presentation. We will review all figure captions to ensure they are self-contained and provide sufficient information for interpretation without referring back to the main text.
- Methods
• Although all equations are listed, the conceptual meaning of the metrics (e.g., balanced accuracy) is not unpacked, remember your audience. Please provide intuitive descriptions and, if possible, a conceptual diagram.Thank you for this valuable comment. We agree that the inclusion of intuitive descriptions enhances the accessibility of the manuscript, particularly for readers who may be less familiar with performance metrics. We have now added brief conceptual explanations for each metric presented — including overall accuracy, balanced accuracy, precision, sensitivity and specificity — highlighting their relevance in the context of imbalanced classes:
“To evaluate performance per class, a one-vs-rest strategy was adopted: each water occurrence class was treated as the positive class in turn, with all remaining classes grouped as negative. This approach allows for consistent metric calculation across imbalanced classes. A conceptual diagram is presented in Fig. 6.”
Figure included in the PDF
“Overall accuracy (Eq. 1) reflects the proportion of correctly classified instances over the entire dataset and is useful for assessing general performance. However, it may be biased in datasets with imbalanced class distributions. To address this, we also report balanced accuracy (Eq. 4), which equally weights performance across classes by averaging sensitivity (true positive rate) and specificity (true negative rate). This metric provides a more reliable measure of performance when class sizes differ significantly. Together, these metrics allow us to evaluate how well each model discriminates among the different water occurrence classes, considering both general correctness and class-specific sensitivity.”
Now Figure 6. Confusion matrix used to compute classification performance metrics. Each color represents each of the possible combinations between predicted and observed classes: TP = true positives, TN = true negatives, FP = false positives, FN = false negatives. In this example, the “positive” class is “Wet”. This process was repeated for each class.As can be observed, we included a conceptual diagram (now Figure 6) that visually illustrates how the main metrics are derived from the confusion matrix. We also maintain the equations in the text to help the reader understand the calculation process.
• Please clarify why Table 2 lists 25 predictors, but Fig. 8 refers to 40.
Thank you for pointing out this important distinction. Some of the variables presented in Table 2 refer to predictor groups or data sources, each of which may yield multiple individual variables used in the modeling. For example, the land use/land cover predictors is derived from two classification schemes — one with 8 classes and another with 7 — and for each scheme, we compute the class frequency within the surrounding area as separate predictor variables.
As a result, although Table 2 presents 25 predictors, the full set expands to 40 individual predictor variables, which are shown in now Fig. 8. We have clarified this point in the manuscript to avoid confusion:
“While 25 predictors are individually listed, two of them correspond to land use and land cover (LULC) classifications, which include 8 and 7 classes respectively. The frequency of each class is treated as a separate predictor, resulting in a total of 40 variables used in the models.”
- Results and interpretation
• Some results are not fully explained. Why does Model (c) predict >75% dry in March, peak wet season, but not in November, second wet season? Why is Model (a), with 5-day frequency predictors, described as the most successful in simulating intermittency when it has the coarsest temporal resolution?
• Please interpret results in physical terms, not only in terms of model metrics.Thank you for this important comment. We revised the discussion to incorporate physical interpretations of model results beyond statistical performance.
Specifically, we now relate model outputs to hydrological and ecological processes that govern intermittency. For example, we highlight how predictors such as vegetation indices and topographic convergence correspond to physical controls on water retention. We also discuss how the timing of rainfall may explain temporal discrepancies in model predictions — especially between March and November.
The unexpectedly high proportion of predicted dry conditions in March — typically part of the peak wet season — appears to reflect limitations in the predictor variables used, particularly those derived from simpler datasets (e.g. Model (c) with accumulated precipitation (30 days) as the dynamic variable). While March is climatologically wetter, short-term hydrological conditions (e.g., rainfall deficits or delays in runoff response) can vary substantially. In contrast, the November period, although later in the year, coincided with isolated rainfall events that triggered surface water reappearance — something that may be better captured by the rainfall used in Model (c).
As for Model (a), although it uses predictor variables with a 5-day revisit frequency, it consistently performed better in simulating intermittent dynamics. This is likely because the satellite image captures better the momentary condition, such as recent water presence or vegetation response, which are more stable and informative for distinguishing intermittent patterns than precipitation-only predictors.
We clarified these interpretations in the discussion section to better reflect the physical reasoning behind the model behaviors. Excerpts added to the discussion section are presented below:
4.4 Umbuzeiro River: temporal extrapolation with models
“Although March is part of the rainy season, the pattern of drying predicted by Model (c) may reflect lagged hydrological responses not captured by the accumulated precipitation for that month. In contrast, the prediction for November coincided with isolated rainfall events that triggered surface water reappearance — something that may be better captured by the accumulated rainfall.”
“Since Model (c) only uses rainfall data as dynamic predictor, it was expected to follow the seasonal behavior of water occurrence. However, the poor performance of this model in its application to the whole river indicates that it cannot extrapolate local-trained conditions considering the landscape attributes together with precipitation.”
“Interestingly, the stronger performance of Model (a) and (b) suggests that satellite-driven variables may better capture ecologically meaningful signals of intermittency, possibly due to their ability to represent spectral landscape responses. It is also possible to observe the complementary convergence of vegetation and water-related satellite indices and landscape attributes, which reinforce each other as physical controls (or responses) related to water retention.” - References and context
• The references are strongly weighted toward recent Brazilian studies. Please include some of the classic papers on ephemeral and intermittent streams.Thank you for this important comment. We revised the text to incorporate more diverse papers on ephemeral and intermittent streams.
COSTIGAN, Katie H. et al. Flow regimes in intermittent rivers and ephemeral streams. In: Intermittent rivers and ephemeral streams. Academic Press, 2017. p. 51-78.
ERIS, Ebru et al. Frequency analysis of low flows in intermittent and non-intermittent rivers from hydrological basins in Turkey. Water Supply, v. 19, n. 1, p. 30-39, 2019.
SARREMEJANE, Romain; MESSAGER, Mathis Loïc; DATRY, Thibault. Drought in intermittent river and ephemeral stream networks. Ecohydrology, v. 15, n. 5, p. e2390, 2022.
SEFTON, Catherine EM et al. Visualising and quantifying the variability of hydrological state in intermittent rivers. Fundamental and Applied Limnology, v. 193, n. 1, p. 21-38, 2019.
• The paper should also acknowledge interannual variability in flow connectivity, which is critical for interpreting results beyond a single year.We have added a paragraph to section 4.6 Work limitations acknowledging interannual variability and briefly discussing how stream intermittency may vary between years.
“We based our analysis on data from a single hydrological year. While this approach allowed for the capture of short-term hydroclimatic variability, it limits the model’s capacity to generalize across years with different rainfall patterns. Future studies including multi-year data and alternative temporal windows could help address this limitation.”
Minor Comments
Lines 109–111: Be consistent in describing rainy season timing. April–July are not “the first months of the year.”
Thank you. We revised the sentence to clarify the timing of the rainy season and avoid imprecise phrasing. It now reads: “We perform UAV surveys monthly during the rainy season (Apr to Jul), and once more in November 2022 (Fig. 4). As noted by Soares et al. (2024), the rainy season typically occurs in the first half of the year in this region.”
Line 134–135: Do you need to direct readers to the Sentinel user guide?
We agree that this reference is not essential and have removed the sentence referring readers to the Sentinel user guide.
Lines 135 & 161: Clarify the role of Google Earth Engine (GEE) for Sentinel and MapBiomas.
We clarified the text to specify that GEE was used as the platform for accessing, processing, and analyzing both Sentinel and MapBiomas data:
Lines 135:
Original: "Our sentinel images are obtained and processed using Google Earth Engine (GEE)."
Revised: "The access, processing and analyzing of sentinel images is performed in Google Earth Engine (GEE). The spectral indices were processed in the platform, and the mean values were obtained per modelling unit."“MapBiomas data are freely available, and we use Google Earth Engine (GEE) to access, process and analyze the data. The LULC classes were processed in the platform, and the frequency of each class were obtained per modelling unit.”
Line 140: Specify what was done in R.
The sentence was revised to specify that part of image processing was conducted using R: “If there are missing portions, these gaps are filled with the image of the previous or following day. In those cases, the overlaying of different raster (from subsequent days) is performed in R Statistical Software version 4.1.”
Line 165: Where in Figure 1 is this shown?
We added a reference in the caption of Figure 1 to indicate where the relevant feature appears in the figure: “Figure 1. Umbuzeiro River in the Benguê catchment with the monitored river reaches used in UAV surveys: upstream, middle, and downstream. Precipitation is measured in the Aiuaba Experimental Basin (AEB) and shown the monthly data. Stream flow is measured in Aroeira section with no streamflow observed during the year of 2022.”
Line 172: Please provide information on data access (can it be downloaded and where?).
We appreciate the comment. The original raw dataset is not publicly available. However, a processed and formatted version of the same dataset, including the relevant variables used in our study, is available through the following peer-reviewed publication:
Fullhart, A., Goodrich, D. C., Meles, M. B., Oliveira, P. T. S., das Neves Almeida, C., de Araújo, J. C., & Burns, S. (2023). Atlas of precipitation extremes for South America and Africa based on depth-duration-frequency relationships in a stochastic weather generator dataset. International Soil and Water Conservation Research, 11(4), 726-742. DOI: https://doi.org/10.1016/j.iswcr.2023.01.004.We have added this reference to the manuscript and clarified the data availability in the methods section.
Line 203: Suggested rewording: “containing the type of variable…”
Thank you for the suggestion. We revised the sentence to improve clarity and formality. The updated version now reads: “A summary of all candidate predictors used in our modeling approach is presented in Table 2, containing the type of variable, its spatial distribution, and frequency.”
Line 207: Cite Table 2 after “candidate predictors.”
Table 2 is now cited at the appropriate location as suggested.
Line 219: Reword to “The specific predictors employed in each model are summarized in Table 2.”
We have revised the sentence accordingly.Line 360: Section 4.6 Add acknowledgment of interannual variability and how the intermittency might vary
We have added a paragraph acknowledging interannual variability and briefly discussing how stream intermittency may vary between years.
Line 419: Avoid anthropomorphizing landscapes; specify which aspects of river metabolism are relevant.
Thank you for this helpful comment. We revised the sentence to remove anthropomorphic language and to specify which aspects of river metabolism are addressed. The revised version reads: “From an ecological perspective, mapping the temporal dynamics of wet and dry conditions at the river scale is crucial for assessing species migration and resilience, characterizing habitat availability, and analyzing key components of river metabolism, such as gross primary production and ecosystem respiration.”
Figures
- Fig. 1: Add discharge/precipitation panel to establish arid context, I know precipitation data is presented in Figure 9
Thank you for the suggestion. To better contextualize the hydrological regime of the study area, we have added a precipitation panel also to Figure 1, summarizing monthly rainfall averages. This provides a clearer overview of the region’s seasonal aridity, complementing the more detailed data presented later in Figure 9. No discharge occurred during 2022 in the study area.
Updated figure included in the PDF
Figure 1. Umbuzeiro River in the Benguê catchment with the monitored river reaches used in UAV surveys: upstream, middle, and downstream. Precipitation is measured in the Aiuaba Experimental Basin (AEB) and it is shown the monthly data. Stream flow is measured in Aroeira section with no streamflow observed during the year of 2022.- Fig. 2: Show/explain how exactly the UAV surveys are used
Thank you for this observation. We have clarified the role of UAV surveys in both the figure and the main text. In the updated figure caption, we now specify that UAV imagery is used to produce high-resolution (1 meter) water occurrence (Wet/Dry etc) classifications, which serve as ground-truth data for training the random forest models.
Updated figure included in the PDF
Figure 2. Flowchart display of the steps taken during the mapping and modelling of water occurrence. UAV surveys were used to generate high-resolution classification of water occurrence (1.0 meter reaches), which served as ground-truth data for training the classification models. These classifications, alongside predictor variables, were used to train and validate random forest models of water occurrence.- Fig. 3: Quantify misfit between flow accumulation (DEM) and visual mapping. Adjust blue line color for visibility.
Thank you for this comment. We have changed the color of the flow line to improve visibility and included a quantitative estimate of the misalignment between DEM-derived flow paths and visually mapped river reaches (e.g., average lateral offset in meters): “The average horizontal misfit between the flow accumulation path and the manually mapped stream was approximately 60 meters.”
Updated figure included in the PDF
Figure 3. Difference between Umbuzeiro River mappings: the previously available mapping based on flow accumulation (from 30 m resolution DEM) versus visual mapping of the Umbuzeiro River course with satellite imagery. The average horizontal misfit between the flow accumulation path and the manually mapped stream was approximately 60 meters. Image from UAV flight showing the monitored river reach. In highlight the modelling units (diameter of 100 m): river areas whose data is evaluated in water occurrence modelling.- Fig. 4: Add discharge/precipitation to support text (lines 109–110).
We have updated Figure 4 to include a precipitation panel aligned with UAV survey dates, providing clearer context for interpreting seasonal wetness dynamics.
Updated figure included in the PDF
Figure 4. Monthly precipitation and dates of imagery available in 2022, indicating the number of satellite images of each sensor and UAV flights.- Fig. 5: Missing panel label (a). Clarify “1.0 m reaches,” as 1 m seems implausibly small.
Thank you for pointing this out. We have added the missing panel label (a) and clarified in the figure caption and main text that “1.0 m reaches” refers to river segments of approximately 1 meter length along the mapped river sections. These fine-scale segments were individually classified as Wet, Transition, Dry, Not Determined using UAV imagery collected over three approximately 4-km-long river sections. This high spatial resolution allows detailed mapping of intermittency patterns. We also reviewed the explanation of this classification process in the whole document. The updated figure caption now reads:
Figure 5. Examples of UAV imagery showing water occurrence classes along the Umbuzeiro River. UAV surveys were used to generate high-resolution classification of water occurrence (1.0 meter reaches). We show in (a) different wet patches and in (b) different examples of damming structures.
- Fig. 7: Clarify why parameters with high Mean Decrease Accuracy were selected.
Thank you for raising this point. We have clarified in the caption and text that variables with higher Mean Decrease Accuracy (MDA) values were selected because they contribute more significantly to model performance:
“Variables with higher Mean Decrease Accuracy (MDA) values were selected because they have a greater impact on the predictive performance of the model. MDA reflects how much the accuracy of the model decreases when a given variable is excluded; thus, a higher MDA indicates that the variable provides important information for distinguishing between classes. By selecting variables with the highest MDA, we focused on those that most strongly influence the model’s ability to predict spatial patterns of intermittency. This approach helps identify predictors most relevant to distinguishing our water occurrence classes.”
Now Figure 8. Random forest model evaluation and feature selection results for three predictor sets: (model a) Sentinel-2 indices, (model b) PlanetScope indices, and (model c) hydrological variables. In (a) recursive feature elimination curves showing accuracy of the models as features are removed. In (b) out-of-bag (OOB) errors comparing models using all candidate predictors versus only the selected ones. BC denotes the benchmark classifier. In (c) mean decrease accuracy for the top 10 most important predictors in their respective model. Highlighted predictors were selected through recursive feature elimination based on their higher Mean Decrease Accuracy.
- Fig. 8: Please clarify why Table 2 lists 25 predictors, but Fig. 8 refers to 40.
We have clarified in the text that some predictor groups in Table 2 contribute multiple individual variables. Specifically, land cover predictors are derived from two classifications (8 and 7 classes), and the class frequency of each is included as a separate predictor, resulting in 40 total variables displayed in now Figure 8.
- Fig. 9: Explain Model (c) behavior, see Major comments.
Please refer to our detailed response under the major comments section. We now discuss in the manuscript how Model (c)’s behavior may reflect limitations in its predictors or seasonal anomalies not captured by the available data. We also contrast its performance with other models in relation to rainfall timing and spatial heterogeneity.
- Fig. 10: Clarify how “High” and “Low” were calculated. Can values be normalized and expressed as percentages?
Thank you for this insightful comment. To improve interpretability, these values were normalized and expressed as percentages of the total days monitored. We clarified in the text and figure caption that this percentage corresponds to the frequency with which each river reach was classified as “Wet” or “Transition” over the study period.
Updated figure included in the PDF
Now Figure 11. Observations with water in the riverbed (WIR), that is when the reach presents either "Wet" or "Transition" classes. The number of observations is normalized and expressed as percentages. Model (a) uses Sentinel predictors; (b) uses Planetscope indices; and (c) uses hydrological data.Tables
- Table 2: Organize predictors by model (a–c), consistent with Figs. 7, 9, and 10. Replace “constant” as that is not a Frequency
Thank you for this comment. We have replaced the term “constant” in the frequency column with “static” to better reflect that these variables do not change during the study period but still serve as important model inputs. Additionally, we agree that consistency across tables and figures is important. However, in this case, we chose to organize Table 2 by predictor type (e.g., vegetation indices, hydrology, landscape), as this structure better reflects the conceptual grouping used in the modeling process. We feel that grouping by variable type improves clarity, especially given that many predictors are used by more than one model. For consistency with Figures 7, 9, and 10 (now 8, 10 and 11), we have ensured that the “Used by model(s)” column clearly indicates the model(s) associated with each predictor.
- Table 3: Define balanced accuracy conceptually, I know the equation is given, and present Train/Test values in a clearer layout.
Thank you for this valuable suggestion. We have reformatted Table 3 to separate training and testing values into distinct sections, making the results easier to interpret. In addition, we included a conceptual definition of balanced accuracy in the text to complement the equation already provided in the methods section. This definition highlights its usefulness in evaluating model performance under class imbalance. We added this excerpt to the text:
“Balanced accuracy accounts for class imbalance by averaging sensitivity (true positive rate) and specificity (true negative rate), providing a more reliable assessment when classes are unevenly distributed. Values are reported separately for training and testing datasets.”
Table 3. Evaluation metrics for the three models considering train and test data. Model (a) with Sentinel predictors; (b) with Planetscope indices; and (c) with hydrological data.
Updated table included in the PDFData & codes
Please include the data and codes to comply with ESurf's FAIR policies
Thank you for this reminder. In the revised manuscript we will include input data and code used in this study as supplementary material.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
707 | 66 | 21 | 794 | 12 | 32 |
- HTML: 707
- PDF: 66
- XML: 21
- Total: 794
- BibTeX: 12
- EndNote: 32
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
General Information
The study investigates spatial and temporal dynamics of river intermittency in the Umbuzeiro River (Brazilian semiarid region) using a combination of UAV surveys, remote sensing, and Random Forest (RF) modeling. The authors assess water presence classes: “Wet”, “Transition”, “Dry”, and “Not Determined”, across different times and river segments. They evaluate three RF model variants based on different dynamic predictors: Sentinel MNDWI, Planetscope NDVI, and 30-day accumulated precipitation.
This is a well-structured and technically rigorous study that addresses a critical gap in intermittent river monitoring in semi-arid environments. It provides an important proof of concept for combining UAV-based mapping and machine learning for ecohydrological research. The results are promising and provide a methodological blueprint for future upscaling efforts, though there is room for refinement in automation, validation, and generalization.
Despite the interest in the topic, I believe some changes in the paper are needed before it can be considered ready for publication, listed in the specific comments.
Specific Comments