the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Assessing the value of high-resolution rainfall and streamflow data for hydrological modeling: An analysis based on 63 catchments in southeast China
Abstract. The temporal resolution of forcing and calibration data substantially influences the performance of hydrological models. This impact varies among regions according to the climatic and landscape characteristics of the watersheds. In this study, we evaluated the benefits of using high-resolution rainfall and streamflow data in hydrological modeling across 63 small-to-medium-scale catchments in Southeastern China. We applied rainfall and streamflow data at various resolutions ranging from 1 to 24 hours to drive and calibrate a well-established hydrological model. Our findings reveal that: (1) Utilizing sub-daily rainfall data significantly enhances the accuracy of daily streamflow forecasts, with notable improvements observed when models transition from daily to sub-daily resolutions. (2) Forcing and calibrating the model by rainfall and streamflow data with sub-daily resolution data markedly improve hourly streamflow forecasts compared to daily data, but the enhancements become negligible when the resolution exceeds 6 hours. (3) The advantages of sub-daily resolution data are more pronounced in catchments characterized by smaller drainage areas, significant diurnal streamflow variability, and a greater number of rain gauges. These findings provide basis for a more efficient rainfall and streamflow data acquisition.
- Preprint
(1627 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2024-1438', Anonymous Referee #1, 09 Aug 2024
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2024/egusphere-2024-1438/egusphere-2024-1438-RC1-supplement.pdf
-
AC1: 'Reply on RC1', Maihemuti Tudaji, 27 Aug 2024
Thank you for your thorough review of our manuscript. We sincerely appreciate your positive evaluation and the constructive suggestions you provided, which will certainly help us improve the quality of our work.
We have carefully considered all your comments and are currently working on the revisions. I would like to share our explanations and plan for revisions in response to your comments.
Comment 1: In addition to the rainfall and runoff data already mentioned in the paper, other data used for modeling should also be detailed.
Response: Thank you for your reminder. Indeed, we also used DEM, potential evapotranspiration, Temperature, Normalized Difference Vegetation Index (NDVI) and Leaf Area Index (LAI) data necessary to drive the THREW model in this study. The DEM in this study was from the MERIT Digital Elevation Model (DEM) with a spatial resolution of 90m. Temperature and potential evapotranspiration data were sourced from ERA5-land. NDVI and LAI were obtained from NOAA Climate Data Record (CDR) datasets. We will include a description of these data in the revised manuscript.
Comment 2: It would be beneficial to reference Figure 2 at the beginning of Section 2.3 when introducing the experimental design. This will help readers refer to the flowchart while reading the corresponding text, improving their understanding of the experimental setup.
Response: Thanks for your suggestion. We will promptly refer to Figure 2 in the revised manuscript as suggested by you.
Comment 3: In the hourly tests, the model's original output was at the hourly scale, while the calibration was conducted at different time scales, and then the evaluation metrics were calculated back at the hourly scale. Why? I believe it is necessary to provide further clarification on the rationale behind this design choice.
Response: As the primary approach of this study, the experimental design indeed needs to be clearly and thoroughly explained. We apologize for not making it clear earlier. I would like to first explain the rationale behind this design:
1) The purposes of the daily and hourly tests are to investigate the effects of input data and measured streamflow with different resolutions on daily streamflow and hourly streamflow simulations, respectively. Therefore, the final evaluation metrics are the assessments of the simulated daily and hourly streamflow results.
2) To eliminate potential impacts of the model’s computational time step, the time step was standardized to 1 hour across all cases in both tests. This led to the model's original output being at the hourly scale in all cases. In the hourly test, the observed streamflow was also treated as input data for model to its calibration, and its resolution was the subject of investigation. Therefore, to explore the effects of observed streamflow data at different resolutions, the model’s original hourly simulated streamflow was aggregated into different time steps, which were then used for model calibration.
In the revised manuscript, we will present the experimental design in a clearer manner.
Comment 4: In the results shown in Figure 3, the REP values are negative for most watersheds in both the daily and hourly tests. Does this mean there is a systematic underestimation of peak values in the model?
Response: Accurately simulating peak flows is a challenge for all models. This is especially true for small and medium-sized basins like those in this study, where streamflow changes relatively quickly and flood durations are shorter, making it more difficult to accurately model peak flows. Results in this study also indicate that the THREW model performs better in basins with larger areas, higher precipitation, and smaller diurnal runoff variations.
In this study, peak flows were underestimated in most basins, which might be due to the use of the single-objective KGE metric during model calibration. The KGE metric reflects the model’s accuracy over the entire time series. The model is a simplification of natural runoff processes and operates with fixed parameters throughout its execution. The accuracy of the model’s simulation of peak flows and other processes (as evaluated by REP and KGE) are sometimes conflicting objectives. Therefore, to achieve a higher KGE, the optimization algorithm tends to select parameters that improve the overall simulation accuracy, which leads to the underestimation of peak flows in most small basins.
However, we used the same model, parameter range, and parameter optimization algorithm throughout the study, and the model performed well overall. This study focuses on the performance trends of the model under different data resolution conditions. All experiments strictly adhered to the principle of controlling variables. Even if there was a systematic underestimation of peak values, the trend remains consistent. Therefore, the conclusions of this study are not affected by the systematic underestimation.
Comment 5: The paper currently concludes that higher resolution data does not necessarily improve prediction accuracy. It would strengthen the paper to include further analysis or discussion from a hydrological mechanism perspective to explore the underlying reasons for this finding.
Response: Thanks for your suggestion. Other reviewers have made the same recommendations, and we recognize the need for a deeper analysis. Therefore, combined with suggestions from other reviewers, we have selected four representative catchments based on a comprehensive consideration of basin attributes such as spatial scale, rainfall, and streamflow variability, which were identified as the most important factors impacting the model performance and its improvement. We will plot the time series of observed and simulated streamflow under different cases for these catchments, and conduct a deeper analysis to explore and explain the variations from a mechanistic perspective. We will present the results in the revised manuscript.
Citation: https://doi.org/10.5194/egusphere-2024-1438-AC1
-
AC1: 'Reply on RC1', Maihemuti Tudaji, 27 Aug 2024
-
RC2: 'Comment on egusphere-2024-1438', Anonymous Referee #2, 12 Aug 2024
Tudaji et al. studied the effects of temporal resolutions of rainfall and streamflow on the performance of hydrologic model as rainfall is an important input of hydrologic model and the observed streamflow is important in model calibration. The study was conducted at 63 catchments in southeast China and the performances of the model based on different time steps were rigorously evaluated using different metrics. The manuscript is generally well organized and written.
My biggest concern is this work could be better instead of just a labor work. We are wondering the differences between the simulated streamflow using higher and lower resolutions of rainfall and streamflow, e.g., the timing and the magnitude etc. At least authors should plot the streamflow time series at several representative gauges to be transparent about the results. Sometimes, the metrics cannot fully tell the true performance or even give wrong indications.
Then based on the change of the streamflow, we need to know the mechanisms inducing such variations. An in-depth analysis based on physical processes should be conducted.
Then I think decision makers may really wanted to know, once a watershed is given, how we choose the best timestep to conduct hydrologic simulations. Some additional discussions about this are encouraged.
I also concern the model used and the study area, which determine the generality of the conclusions, though authors acknowledge such limitations in the manuscript.
Line23: runoff is not input.
I recommend returning this manuscript to authors for major revision.
Citation: https://doi.org/10.5194/egusphere-2024-1438-RC2 -
AC2: 'Reply on RC2', Maihemuti Tudaji, 27 Aug 2024
Thank you for your thorough review of our manuscript. We sincerely appreciate your constructive suggestions, which will certainly help us improve the quality of our work. We have carefully considered each of your comments and are currently working on the revisions. I would like to share our explanations and plan for revisions in response to your comments.
Comment 1: My biggest concern is this work could be better instead of just a labor work. We are wondering the differences between the simulated streamflow using higher and lower resolutions of rainfall and streamflow, e.g., the timing and the magnitude etc. At least authors should plot the streamflow time series at several representative gauges to be transparent about the results. Sometimes, the metrics cannot fully tell the true performance or even give wrong indications.
Response: Thank you for your constructive suggestion. We recognized the necessity of presenting representative data and conducting a more in-depth analysis. Therefore, we have selected four representative catchments based on a comprehensive consideration of basin attributes such as spatial scale, rainfall, and streamflow variability, which were identified as the most important factors impacting the model performance and its improvement. We will plot the time series of observed and simulated streamflow under different cases for these catchments, and conduct a deeper analysis to explore and explain the variations from a mechanistic perspective. We will present these results in the revised manuscript.
Comment 2: I also concern the model used and the study area, which determine the generality of the conclusions, though authors acknowledge such limitations in the manuscript.
Response: The generality of the conclusions to other regions and models is also a concern for us and other researchers. As we mentioned in the Discussion and Limitations sections, other authors (e.g., Ficchì et al., 2016; Reynolds et al, 2017) have reached similar conclusions in related studies, and they also mentioned concerns about generality and the need for further research. Currently, we are working to verify the generality of this conclusion in other regions and model in a case study in northern China. This is our plan for the next research. We believe that as more related studies emerge, we will gain a clearer and more comprehensive understanding. We will also address this issue in the discussion section of revised manuscript.
Comment 3: Line23: runoff is not input.
Response: We apologize for the confusion caused by our wording. Our intention was to emphasize the importance of rainfall and observed streamflow data to the model. Since observed streamflow is essential for model calibration during the model construction phase, it is, in some cases, also used as input data. We plan to revise the original sentence in the manuscript as follows:
"The effectiveness of these models heavily depends on the quality and resolution of the data, especially the rainfall used for forcing and measured streamflow for calibration."
References:
Ficchi, A., Perrin, C., and Andreassian, V.: Impact of temporal resolution of inputs on hydrological model performance: An analysis based on 2400 flood events, Journal of Hydrology, 538, 454-470, 10.1016/j.jhydrol.2016.04.016, 2016.
Reynolds, J. E., Halldin, S., Xu, C. Y., Seibert, J., and Kauffeldt, A.: Sub-daily runoff predictions using parameters calibrated on the basis of data with a daily temporal resolution, Journal of Hydrology, 550, 399-411, 10.1016/j.jhydrol.2017.05.012, 2017.
Citation: https://doi.org/10.5194/egusphere-2024-1438-AC2
-
AC2: 'Reply on RC2', Maihemuti Tudaji, 27 Aug 2024
-
RC3: 'Comment on egusphere-2024-1438', Anonymous Referee #3, 19 Aug 2024
This work represents a significant step forward in addressing the challenges associated with using high-resolution datasets in hydrological modeling. Specifically, the authors correctly identify and address the still unclear and sometimes contradictory effects of high-resolution data on hydrological model performance. They adopt various metrics to disentangle this complex problem by applying a hydrological model to a set of 63 basins in Southeast China. Particularly noteworthy is the use of metrics such as GOUE to characterize basin characteristics.
However, the primary limitation of this work is the lack of physical interpretation of the outcomes. The hydrological model is treated more as a "black box," failing to provide insights into the hydrological processes that might explain the presented results. I therefore recommend this paper for major revision. In particular, I suggest enhancing the description of the hydrological model, which could assist both the authors and readers in interpreting the results in the discussion section.
Other Comments:
• The methodology section is somewhat difficult to follow. From what I understand, the minimum time step used is one hour, meaning that processes are integrated over time using this interval. This implies that the maximum resolution of the input data is hourly. Once this time step is established, it is possible to produce output at this resolution or at coarser resolutions (e.g., 2 hours, 3 hours, up to 24 hours), but not finer. The discriminating factor is therefore the integration time, after which you may choose to aggregate the output (e.g., streamflow) at a daily time step to compute efficiency metrics. In this context, statements like “all input data except for rainfall were resampled to the hourly resolution” could be misleading.
• Please provide the equation for the GOUE. Additionally, a table outlining the influencing factors would enhance the readability of the paper.
• Could the authors discuss other potential factors that might explain the differences in performance between the different basins? For instance, consider a scenario where a particular process is not well-modeled. This process could be highly significant in one basin but negligible in another, leading to varying results. Could this be the case for some of the basins under consideration?
• The table captions need to be expanded. Specifically, the meaning of bold text and asterisks should be clearly explained in the captions.
• The second question regarding the time step resolution is not fully addressed. According to the methodology section, the highest resolution used is hourly. With input data of an hourly resolution, one could potentially model processes at this time step.
• Line 25-28: References are required to support these statements.Citation: https://doi.org/10.5194/egusphere-2024-1438-RC3 -
AC3: 'Reply on RC3', Maihemuti Tudaji, 27 Aug 2024
Thank you for your detailed review of our manuscript. We greatly value your constructive feedback, which will undoubtedly enhance the quality of our work. We have carefully considered your comments and are in the process of making revisions. Here, I would like to share our responses and outline our planned revisions.
Comment 1: The primary limitation of this work is the lack of physical interpretation of the outcomes. The hydrological model is treated more as a "black box," failing to provide insights into the hydrological processes that might explain the presented results. I therefore recommend this paper for major revision. In particular, I suggest enhancing the description of the hydrological model, which could assist both the authors and readers in interpreting the results in the discussion section.
Response: Thank you for your suggestion. We recognize that the descriptions of hydrological models in this paper are not sufficiently detailed. As you suggested, we will provide a detailed description of the hydrological model in the revised manuscript.
Comment 2: The methodology section is somewhat difficult to follow. From what I understand, the minimum time step used is one hour, meaning that processes are integrated over time using this interval. This implies that the maximum resolution of the input data is hourly. Once this time step is established, it is possible to produce output at this resolution or at coarser resolutions (e.g., 2 hours, 3 hours, up to 24 hours), but not finer. The discriminating factor is therefore the integration time, after which you may choose to aggregate the output (e.g., streamflow) at a daily time step to compute efficiency metrics. In this context, statements like “all input data except for rainfall were resampled to the hourly resolution” could be misleading.
Response: Your understanding is completely correct. We sincerely apologize for the confusion caused by our wording. We acknowledge that the use of "except for" in our original statement was incorrect; the proper term should have been "besides", and the full sentence should be "All input data besides rainfall were resampled to the hourly resolution".
The purpose of this study is to investigate the value of different data resolutions (including rainfall and streamflow) for hydrological modeling. The original resolution of the rainfall data is slightly finer than 1 hour. We first resampled the rainfall data to other temporal scales between 1 and 24 hours to generate rainfall data at different resolutions. The original resolution of the other driving data is 24 hours. To eliminate the influence of computation time step, we fixed the model's computation time step at 1 hour. The model was run with a 1-hour computation time step, so if the input data resolution was coarser than 1 hour, it needed to be resampled to a 1-hour resolution before the calculations, and then generate results at a 1-hour resolution. Subsequently, to explore the effect of streamflow data at different resolutions, the 1-hour resolution simulation results were aggregated to other temporal scales, and the model was calibrated based on observed streamflow at the corresponding scales.
Comment 3: Please provide the equation for the GOUE. Additionally, a table outlining the influencing factors would enhance the readability of the paper.
Response: Thank you for your suggestion. The equation for the GOUE is shown as follows:
where n is the length of the time series of the streamflow, Qh is the actual hourly streamflow, Qh with a macron is the mean of the Qh, QDh is the hourly streamflow based on daily streamflow assuming a uniform intraday streamflow. We will add the GOUE’s equation and description to the revised manuscript. Besides, a brief introduction and abbreviations of the influencing factors have been included in Table 1, and we will mention this clearly in Section 2.1 of the revised manuscript.
Comment 4: Could the authors discuss other potential factors that might explain the differences in performance between the different basins? For instance, consider a scenario where a particular process is not well-modeled. This process could be highly significant in one basin but negligible in another, leading to varying results. Could this be the case for some of the basins under consideration?
Response: Thank you for your suggestion. As you mentioned, the accuracy of the model's simulation of a particular hydrological process and the importance of that process within the catchment could indeed be one of the potential factors. However, since the meteorological conditions across our study catchments are quite similar, there might not be a significant difference in the importance of a particular hydrological process across the different catchments. Therefore, this factor may have little impact on the model's varying performance across the catchments.
Comment 5: The table captions need to be expanded. Specifically, the meaning of bold text and asterisks should be clearly explained in the captions.
Response: Thank you for your suggestion. We will follow your suggestion and clearly indicate the meaning of the bold text and asterisks in the captions in the revised manuscript.
Comment 6: The second question regarding the time step resolution is not fully addressed. According to the methodology section, the highest resolution used is hourly. With input data of an hourly resolution, one could potentially model processes at this time step.
Response: Indeed, if high-resolution data (such as 1-hour resolution) is already available, it can typically provide reliable hourly simulations. However, the purpose of our second question and the design of our second experiment is to determine whether, in scenarios where data availability is limited, relatively coarse resolution data can still offer the same level of reliability in hourly simulations. If so, what is the coarsest resolution that maintains this reliability? We are searching for a resolution threshold: if the data resolution is coarser than this threshold, the reliability of the results decreases; if it is finer, there is no significant improvement in reliability.
We apologize for not clearly explaining the focus of our study. We plan to revise the second question to: "What is the coarsest resolution of rainfall and streamflow data required to provide reliable hourly streamflow simulations?" This change, replacing "necessary" with "coarsest," should help reduce confusion and enhance readability.
Comment 7: Line 25-28: References are required to support these statements.
Response: Thank you for your reminding. We will include relevant references to support this statement in the content mentioned above and provide detailed citations in the reference section. The following will be the revised content:
To address this limitation, data is often artificially disaggregated from raw time series using mass curves (Blöschl and Sivapalan, 1995) or complex stochastic generators (Creutin and Obled, 1980). However, models based on coarsely resolved or artificially refined data can introduce biases, particularly when forecasting at finer temporal scales, as they may not accurately capture the variability and magnitude of hydrological variables (Younis et al., 2008; Huang et al., 2019).
References:
Blöschl G, Sivapalan M. Scale issues in hydrological modelling: a review[J]. Hydrological processes, 1995, 9(3‐4): 251-290.
Creutin J D, Obled C. Modelling spatial and temporal characteristics of rainfall as input to a flood forecasting model[J]. IAHS-AISH Publication (129), 1980: 41-49.
Huang, Y., Bárdossy, A., and Zhang, K.: Sensitivity of hydrological models to temporal and spatial resolutions of rainfall data, Hydrol. Earth Syst. Sci., 23, 2647–2663, https://doi.org/10.5194/hess-23-2647-2019, 2019.
Younis J, Anquetin S, Thielen J. The benefit of high-resolution operational weather forecasts for flash flood warning[J]. Hydrology and Earth System Sciences, 2008, 12(4): 1039-1051.
Citation: https://doi.org/10.5194/egusphere-2024-1438-AC3
-
AC3: 'Reply on RC3', Maihemuti Tudaji, 27 Aug 2024
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
291 | 122 | 27 | 440 | 9 | 10 |
- HTML: 291
- PDF: 122
- XML: 27
- Total: 440
- BibTeX: 9
- EndNote: 10
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1