Analyzing the generalization capabilities of hybrid hydrological models for extrapolation to extreme events

Acuna Espinoza, Eduardo; Loritz, Ralf; Kratzert, Frederik; Klotz, Daniel; Gauch, Martin; Álvarez Chaves, Manuel; Bäuerle, Nicole; Ehret, Uwe

doi:10.5194/egusphere-2024-2147

Preprints

https://doi.org/10.5194/egusphere-2024-2147

Preprints

30 Jul 2024

| 30 Jul 2024

Analyzing the generalization capabilities of hybrid hydrological models for extrapolation to extreme events

Eduardo Acuna Espinoza, Ralf Loritz, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, Nicole Bäuerle, and Uwe Ehret

Abstract. Data-driven techniques have shown the potential to outperform process-based models for rainfall-runoff simulation. Recently, hybrid models, which combine data-driven methods with process-based approaches, have been proposed to leverage the strengths of both methodologies, aiming to enhance simulation accuracy while maintaining certain interpretability. Expanding the set of test cases to evaluate hybrid models under different conditions, we test their generalization capabilities for extreme hydrological events, comparing their performance against Long Short-Term Memory (LSTM) networks and process based models. Our results indicate that hybrid models show similar performance as LSTM networks for most cases. However, hybrid models reported slightly lower errors in the most extreme cases, and were able to produce higher peak discharges.

Received: 10 Jul 2024 – Discussion started: 30 Jul 2024

Competing interests: At least one of the (co-)authors is a member of the editorial board of Hydrology and Earth System Sciences.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 2807 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (2807 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

11 Mar 2025

Analyzing the generalization capabilities of a hybrid hydrological model for extrapolation to extreme events

Eduardo Acuña Espinoza, Ralf Loritz, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, and Uwe Ehret

Hydrol. Earth Syst. Sci., 29, 1277–1294, https://doi.org/10.5194/hess-29-1277-2025,https://doi.org/10.5194/hess-29-1277-2025, 2025

Short summary

Eduardo Acuna Espinoza, Ralf Loritz, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, Nicole Bäuerle, and Uwe Ehret

Interactive discussion

Status: closed

CC1:
'Comment on egusphere-2024-2147', Chaopeng Shen, 22 Aug 2024

Dear authors,
Thanks for the contribution.
We have run similar experiments on our end, which show that our version of single hybrid model, dHBV, outperformed LSTM in nearly all return-period categories. These results are documented here: https://t.co/BnWtEy6NEk. The conclusions seemed to be moderately different from Espinoza24.
To understand where discrepancies lie, we performed extensive due diligence by running multiple experiments with the same setups as the authors to understand the observed differences. We appreciate the authors for making their code available, enabling this exploration. It seems the neuralhydrology vs. hydroDL implementations of differentiable HBV are the main cause of discrepancies, which have the potential the change the conclusions. We would like to share the findings with the authors. While Espinoza24 conducted an experiment to verify that NH-hybrid could reproduce earlier results from Feng et al. (2022), it is important to note that this does not imply that other experiments would yield the same outcome. The claimed equivalence is not established here.
Please see the attached PDF for details.

Citation: https://doi.org/10.5194/egusphere-2024-2147-CC1
- AC1:
  'Reply on CC1', Eduardo Acuna, 09 Sep 2024
  
  Please find attached our response to the comment on egusphere-2024-2147 by Chaopeng Shen
  
  Citation: https://doi.org/10.5194/egusphere-2024-2147-AC1
  - CC3: 'Reply on AC1', Chaopeng Shen, 19 Sep 2024
    
    Please see the attachment. We ran the model dHBV1.0 with more random seeds. Every single time, it has a lower error than LSTM or the "hybrid" in the authors' plot. Hence, the authors' assumption about statistical noise is not correct, and the difference in appearance is still due to the difference between dHBV1.0 and the "hybrid" that the authors trained.
    
    Citation: https://doi.org/10.5194/egusphere-2024-2147-CC3
    
    AC3: 'Reply on CC3', Eduardo Acuna, 04 Oct 2024
    
    Please find attached our response
    
    Citation: https://doi.org/10.5194/egusphere-2024-2147-AC3
CC2:
'Comment on egusphere-2024-2147', John Ding, 30 Aug 2024

AR2 second-order autoregressive process of the streamflow
Besides the LSTM, HBV and a hybrid of the two, the authors may wish to revisit an autoregressive baseline model called AR(2) or AR2. This, an acceleration-based metric, is expressed by:
Qar2[t+1]=2Qobs[t]-Qobs[t-1],
see Azmi et al. (2021, SC1, Eq. 1).
The subject was previously discussed between me and Uwe Ehret, the current closing author, on a storm event scale in a different but related context (ibid., AC1, Table 1).
To summarize my take of our discussion, below are two main points:
1) a third-order AR model, AR-3 (Model-07, therein) when rounding off the time lag coefficients, is identical to AR2, and
2) it outperforms an ANN model (Model-08) by an NSE value of 0.99 to 0.12.
For the 531 CAMELS-US basins (Lines 125-130, and Figure 2), can we infer from point 2 above that an AR2 will be a better performing model? Let’s consider this a hypothesis for falsification in another open discussion forum.
In theory, an AR2 projection hydrograph over/under shoots the observed peak/trough flows - just visualize a USDA-SCS triangular unit hydrograph having an upslope and a downslope projection. This is in contrast to the authors' finding that 'all [three of their] models underestimated extreme flow scenarios,' (Line 243).
References
Azmi, E., Ehret, U., Weijs, S. V., Ruddell, B. L., and Perdigão, R. A. P.: Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance, Hydrol. Earth Syst. Sci., 25, 1103–1115, https://doi.org/10.5194/hess-25-1103-2021, 2021.

Citation: https://doi.org/10.5194/egusphere-2024-2147-CC2
- AC2: 'Reply on CC2', Eduardo Acuna, 09 Sep 2024
  
  Please find attached our response to the comment on egusphere-2024-2147 by John Ding
  
  Citation: https://doi.org/10.5194/egusphere-2024-2147-AC2
RC1:
'Comment on egusphere-2024-2147', Basil Kraft, 05 Sep 2024

Dear Authors,

Thank you for sharing this manuscript. It was a pleasure to review your study on comparing a hybrid hydrological model with both a neural network and a conceptual model. Such investigations are crucial for understanding the strengths and limitations of hybrid approaches.

My full review is attached.

Best regards,

Basil Kraft

Citation: https://doi.org/10.5194/egusphere-2024-2147-RC1
- AC4: 'Reply on RC1', Eduardo Acuna, 04 Oct 2024
  
  We want to thank the referee for the detailed evaluation of our paper. In the attached document we answer the questions, comments and suggestions given.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2147-AC4
RC2:
'Comment on egusphere-2024-2147', Shijie Jiang, 06 Oct 2024

The manuscript "Analyzing the generalization capabilities of hybrid hydrological models for extrapolation to extreme events" compares the generalization capabilities of hybrid models, LSTM networks, and process-based models for rainfall-runoff simulations, with a particular focus on extreme events. The study examines whether hybrid models provide a meaningful advantage over standalone data-driven or process-based models. The results suggest that hybrid models show marginal improvements in predicting extreme peak flows, but overall perform similarly to LSTM networks. The authors argue that given the comparable performance, the choice of model depends on user needs. Overall, the study does a great job of providing a balanced perspective on the hybrid models. The paper is valuable in stimulating further discussion in the field.
Major comments

1) One of the central claims for hybrid models is that they combine the predictive power of data-driven approaches with the interpretability of process-based models. However, the manuscript focuses more on marginal differences in predictive performance than on the added interpretability that might justify hybrid models. I suggest including a discussion of the trade-off between accuracy and interpretability. For example, does the hybrid model help to better understand the causes of extreme flows, such as snowmelt, soil moisture dynamics, or precipitation anomalies? Could the explicit encoding of hydrologic concepts in the hybrid model be more valuable for decision making, even if the predictive gains are minimal?
2) While the paper touches on model errors during extreme events, it does not provide an analysis of where and why each model is better or worse, e.g., under which geophysical, climatic, or soil conditions. This could be helpful to better understand the strengths and limitations of each model type and provide a useful guide to when hybrid / LSTM models are most beneficial.
3) A related comment is that while the authors conclude that the choice of model depends on user needs, the manuscript does not provide clear guidance on how to make this choice. For example, in data-poor environments where high-quality or long-term observational data may not be available, should hybrid models be preferred because they incorporate process-based knowledge that could compensate for sparse data? Is it possible to make a comparison that assumes limited data? I think it would be helpful for practitioners working in regions with poor monitoring infrastructure.
Specific comments:

L12, the term “out-of-sample conditions” is somewhat ambiguous. Please specify what type of generalization is meant (temporal or spatial domains).

L16, the phrase "notion of interpretability" could be clearer. What does "notion" mean in this context? It sounds vague. If interpretability is considered to be a key reason for adopting hybrid models over purely data-driven ones, it should be more clearly defined and quantified. Does interpretability mean the ability to interpret the parameters, processes, or outputs in a hydrologically meaningful way? Or are you suggesting that it's a "so-called" interpretability?

L30, what specific structural deficiencies are you referring to here?

L35, the focus on "higher predictive accuracy" may overlook the fact that accuracy alone may not be the best criterion for assessing model suitability. Authors should clarify that other criteria (such as robustness, model transparency, applicability) besides accuracy may be equally important in model evaluation.

L100, the explanation of the hybrid model’s parameterization is complex and may not be easily understood by just reading this paper. At least a clearer explanation of the buckets and parameters is needed.

L127 without discussing the potential limitations of the HBV model, this claim seems overly simplistic. It is useful to explain here why the HBV model underperformed, even though it has been studied in previous studies.

L150, again, this conclusion of equivalence is overly simplistic and could lead to believing that there are no meaningful differences between the models. Are there certain types of basins or hydrological conditions (e.g., arid basins) where one model clearly outperforms the other?

L167, it's hard to read from the figure about the "slightly lower errors".

L215, this observation is important but lacks sufficient follow-up. If the dynamic parameterization reaches its limits during extreme events, it indicates a potential flaw in the model design, but the text does not discuss how this issue could be addressed or what its implications are. Could the predefined intervals be adjusted or extended to better handle extreme events?

L220, I am very confused here. How does the snowmelt effect indicate the potential bias in the input data? If the snowmelt flux is high, it's not surprising to see a discrepancy between precipitation and runoff. This statement also raises the question of a structural flaw in the HBV model, but it is not elaborated. I'm left wondering what specific deficiencies in the snow module are responsible for the poor performance and how these deficiencies could be addressed in future work. For example, is the snowmelt process not adequately modeled due to insufficient temperature data, or is the parameterization of the snow module too simplistic?

L225, it's vague and doesn't provide enough insight into what types of hybrid architectures might yield different results. In my opinion, the hybrid model used in this paper considers model with a conceptual model as the backbone and neural networks for parameter learning. It would be more actionable to point out some other types of hybrid models, e.g., component replacement or more conceptual frameworks (e.g., https://hess.copernicus.org/articles/26/1579/2022/) that might address some of the limitations identified in the study.

L230, I'm afraid this recommendation is too general and simplistic...

L241, is it possible to use more precise numbers or statistical analysis to support the claim of “slight” outperformance. If the differences are marginal, do you think they might still matter in practical scenarios?

L245, the mention of "possible bias in the input data" is speculative without further analysis. And if that's the case, does it imply that LSTM is insensitive to the bias?

L249, the statement about dynamic parameterization is not sufficiently elaborated. It doesn't provide enough detail about how this adaptation happens or why it is particularly useful for extreme events. Also, the comparison with LSTM gating is interesting, but lacks further discussion.

Citation: https://doi.org/10.5194/egusphere-2024-2147-RC2
- AC5: 'Reply on RC2', Eduardo Acuna, 15 Oct 2024
  
  We want to thank the referee for the detailed evaluation of our paper. In the attached document we answer the questions, comments and suggestions given.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2147-AC5

Interactive discussion

Status: closed

CC1:
'Comment on egusphere-2024-2147', Chaopeng Shen, 22 Aug 2024

Dear authors,
Thanks for the contribution.
We have run similar experiments on our end, which show that our version of single hybrid model, dHBV, outperformed LSTM in nearly all return-period categories. These results are documented here: https://t.co/BnWtEy6NEk. The conclusions seemed to be moderately different from Espinoza24.
To understand where discrepancies lie, we performed extensive due diligence by running multiple experiments with the same setups as the authors to understand the observed differences. We appreciate the authors for making their code available, enabling this exploration. It seems the neuralhydrology vs. hydroDL implementations of differentiable HBV are the main cause of discrepancies, which have the potential the change the conclusions. We would like to share the findings with the authors. While Espinoza24 conducted an experiment to verify that NH-hybrid could reproduce earlier results from Feng et al. (2022), it is important to note that this does not imply that other experiments would yield the same outcome. The claimed equivalence is not established here.
Please see the attached PDF for details.

Citation: https://doi.org/10.5194/egusphere-2024-2147-CC1
- AC1:
  'Reply on CC1', Eduardo Acuna, 09 Sep 2024
  
  Please find attached our response to the comment on egusphere-2024-2147 by Chaopeng Shen
  
  Citation: https://doi.org/10.5194/egusphere-2024-2147-AC1
  - CC3: 'Reply on AC1', Chaopeng Shen, 19 Sep 2024
    
    Please see the attachment. We ran the model dHBV1.0 with more random seeds. Every single time, it has a lower error than LSTM or the "hybrid" in the authors' plot. Hence, the authors' assumption about statistical noise is not correct, and the difference in appearance is still due to the difference between dHBV1.0 and the "hybrid" that the authors trained.
    
    Citation: https://doi.org/10.5194/egusphere-2024-2147-CC3
    
    AC3: 'Reply on CC3', Eduardo Acuna, 04 Oct 2024
    
    Please find attached our response
    
    Citation: https://doi.org/10.5194/egusphere-2024-2147-AC3
CC2:
'Comment on egusphere-2024-2147', John Ding, 30 Aug 2024

AR2 second-order autoregressive process of the streamflow
Besides the LSTM, HBV and a hybrid of the two, the authors may wish to revisit an autoregressive baseline model called AR(2) or AR2. This, an acceleration-based metric, is expressed by:
Qar2[t+1]=2Qobs[t]-Qobs[t-1],
see Azmi et al. (2021, SC1, Eq. 1).
The subject was previously discussed between me and Uwe Ehret, the current closing author, on a storm event scale in a different but related context (ibid., AC1, Table 1).
To summarize my take of our discussion, below are two main points:
1) a third-order AR model, AR-3 (Model-07, therein) when rounding off the time lag coefficients, is identical to AR2, and
2) it outperforms an ANN model (Model-08) by an NSE value of 0.99 to 0.12.
For the 531 CAMELS-US basins (Lines 125-130, and Figure 2), can we infer from point 2 above that an AR2 will be a better performing model? Let’s consider this a hypothesis for falsification in another open discussion forum.
In theory, an AR2 projection hydrograph over/under shoots the observed peak/trough flows - just visualize a USDA-SCS triangular unit hydrograph having an upslope and a downslope projection. This is in contrast to the authors' finding that 'all [three of their] models underestimated extreme flow scenarios,' (Line 243).
References
Azmi, E., Ehret, U., Weijs, S. V., Ruddell, B. L., and Perdigão, R. A. P.: Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance, Hydrol. Earth Syst. Sci., 25, 1103–1115, https://doi.org/10.5194/hess-25-1103-2021, 2021.

Citation: https://doi.org/10.5194/egusphere-2024-2147-CC2
- AC2: 'Reply on CC2', Eduardo Acuna, 09 Sep 2024
  
  Please find attached our response to the comment on egusphere-2024-2147 by John Ding
  
  Citation: https://doi.org/10.5194/egusphere-2024-2147-AC2
RC1:
'Comment on egusphere-2024-2147', Basil Kraft, 05 Sep 2024

Dear Authors,

Thank you for sharing this manuscript. It was a pleasure to review your study on comparing a hybrid hydrological model with both a neural network and a conceptual model. Such investigations are crucial for understanding the strengths and limitations of hybrid approaches.

My full review is attached.

Best regards,

Basil Kraft

Citation: https://doi.org/10.5194/egusphere-2024-2147-RC1
- AC4: 'Reply on RC1', Eduardo Acuna, 04 Oct 2024
  
  We want to thank the referee for the detailed evaluation of our paper. In the attached document we answer the questions, comments and suggestions given.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2147-AC4
RC2:
'Comment on egusphere-2024-2147', Shijie Jiang, 06 Oct 2024

The manuscript "Analyzing the generalization capabilities of hybrid hydrological models for extrapolation to extreme events" compares the generalization capabilities of hybrid models, LSTM networks, and process-based models for rainfall-runoff simulations, with a particular focus on extreme events. The study examines whether hybrid models provide a meaningful advantage over standalone data-driven or process-based models. The results suggest that hybrid models show marginal improvements in predicting extreme peak flows, but overall perform similarly to LSTM networks. The authors argue that given the comparable performance, the choice of model depends on user needs. Overall, the study does a great job of providing a balanced perspective on the hybrid models. The paper is valuable in stimulating further discussion in the field.
Major comments

1) One of the central claims for hybrid models is that they combine the predictive power of data-driven approaches with the interpretability of process-based models. However, the manuscript focuses more on marginal differences in predictive performance than on the added interpretability that might justify hybrid models. I suggest including a discussion of the trade-off between accuracy and interpretability. For example, does the hybrid model help to better understand the causes of extreme flows, such as snowmelt, soil moisture dynamics, or precipitation anomalies? Could the explicit encoding of hydrologic concepts in the hybrid model be more valuable for decision making, even if the predictive gains are minimal?
2) While the paper touches on model errors during extreme events, it does not provide an analysis of where and why each model is better or worse, e.g., under which geophysical, climatic, or soil conditions. This could be helpful to better understand the strengths and limitations of each model type and provide a useful guide to when hybrid / LSTM models are most beneficial.
3) A related comment is that while the authors conclude that the choice of model depends on user needs, the manuscript does not provide clear guidance on how to make this choice. For example, in data-poor environments where high-quality or long-term observational data may not be available, should hybrid models be preferred because they incorporate process-based knowledge that could compensate for sparse data? Is it possible to make a comparison that assumes limited data? I think it would be helpful for practitioners working in regions with poor monitoring infrastructure.
Specific comments:

L12, the term “out-of-sample conditions” is somewhat ambiguous. Please specify what type of generalization is meant (temporal or spatial domains).

L16, the phrase "notion of interpretability" could be clearer. What does "notion" mean in this context? It sounds vague. If interpretability is considered to be a key reason for adopting hybrid models over purely data-driven ones, it should be more clearly defined and quantified. Does interpretability mean the ability to interpret the parameters, processes, or outputs in a hydrologically meaningful way? Or are you suggesting that it's a "so-called" interpretability?

L30, what specific structural deficiencies are you referring to here?

L35, the focus on "higher predictive accuracy" may overlook the fact that accuracy alone may not be the best criterion for assessing model suitability. Authors should clarify that other criteria (such as robustness, model transparency, applicability) besides accuracy may be equally important in model evaluation.

L100, the explanation of the hybrid model’s parameterization is complex and may not be easily understood by just reading this paper. At least a clearer explanation of the buckets and parameters is needed.

L127 without discussing the potential limitations of the HBV model, this claim seems overly simplistic. It is useful to explain here why the HBV model underperformed, even though it has been studied in previous studies.

L150, again, this conclusion of equivalence is overly simplistic and could lead to believing that there are no meaningful differences between the models. Are there certain types of basins or hydrological conditions (e.g., arid basins) where one model clearly outperforms the other?

L167, it's hard to read from the figure about the "slightly lower errors".

L215, this observation is important but lacks sufficient follow-up. If the dynamic parameterization reaches its limits during extreme events, it indicates a potential flaw in the model design, but the text does not discuss how this issue could be addressed or what its implications are. Could the predefined intervals be adjusted or extended to better handle extreme events?

L220, I am very confused here. How does the snowmelt effect indicate the potential bias in the input data? If the snowmelt flux is high, it's not surprising to see a discrepancy between precipitation and runoff. This statement also raises the question of a structural flaw in the HBV model, but it is not elaborated. I'm left wondering what specific deficiencies in the snow module are responsible for the poor performance and how these deficiencies could be addressed in future work. For example, is the snowmelt process not adequately modeled due to insufficient temperature data, or is the parameterization of the snow module too simplistic?

L225, it's vague and doesn't provide enough insight into what types of hybrid architectures might yield different results. In my opinion, the hybrid model used in this paper considers model with a conceptual model as the backbone and neural networks for parameter learning. It would be more actionable to point out some other types of hybrid models, e.g., component replacement or more conceptual frameworks (e.g., https://hess.copernicus.org/articles/26/1579/2022/) that might address some of the limitations identified in the study.

L230, I'm afraid this recommendation is too general and simplistic...

L241, is it possible to use more precise numbers or statistical analysis to support the claim of “slight” outperformance. If the differences are marginal, do you think they might still matter in practical scenarios?

L245, the mention of "possible bias in the input data" is speculative without further analysis. And if that's the case, does it imply that LSTM is insensitive to the bias?

L249, the statement about dynamic parameterization is not sufficiently elaborated. It doesn't provide enough detail about how this adaptation happens or why it is particularly useful for extreme events. Also, the comparison with LSTM gating is interesting, but lacks further discussion.

Citation: https://doi.org/10.5194/egusphere-2024-2147-RC2
- AC5: 'Reply on RC2', Eduardo Acuna, 15 Oct 2024
  
  We want to thank the referee for the detailed evaluation of our paper. In the attached document we answer the questions, comments and suggestions given.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2147-AC5

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload

ED: Reconsider after major revisions (further review by editor and referees) (16 Oct 2024) by Manuela Irene Brunner

AR by Eduardo Acuna on behalf of the Authors (20 Nov 2024) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (22 Nov 2024) by Manuela Irene Brunner

RR by Basil Kraft (09 Dec 2024)

RR by Shijie Jiang (25 Dec 2024)

ED: Publish subject to minor revisions (review by editor) (03 Jan 2025) by Manuela Irene Brunner

AR by Eduardo Acuna on behalf of the Authors (12 Jan 2025) Author's response Author's tracked changes Manuscript

ED: Publish as is (18 Jan 2025) by Manuela Irene Brunner

AR by Eduardo Acuna on behalf of the Authors (20 Jan 2025) Manuscript

Journal article(s) based on this preprint

11 Mar 2025

Analyzing the generalization capabilities of a hybrid hydrological model for extrapolation to extreme events

Eduardo Acuña Espinoza, Ralf Loritz, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, and Uwe Ehret

Hydrol. Earth Syst. Sci., 29, 1277–1294, https://doi.org/10.5194/hess-29-1277-2025,https://doi.org/10.5194/hess-29-1277-2025, 2025

Short summary

Eduardo Acuna Espinoza, Ralf Loritz, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, Nicole Bäuerle, and Uwe Ehret

Viewed

Total article views: 5,138 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
2,902	1,184	1,052	5,138	83	141

HTML: 2,902
PDF: 1,184
XML: 1,052
Total: 5,138
BibTeX: 83
EndNote: 141

Views and downloads (calculated since 30 Jul 2024)

Month	HTML	PDF	XML	Total
Jul 2024	118	38	6	162
Aug 2024	472	206	14	692
Sep 2024	282	44	86	412
Oct 2024	254	78	8	340
Nov 2024	64	30	250	344
Dec 2024	40	28	296	364
Jan 2025	54	44	324	422
Feb 2025	90	46	14	150
Mar 2025	62	50	0	112
Apr 2025	50	20	0	70
May 2025	54	16	2	72
Jun 2025	66	46	2	114
Jul 2025	58	34	2	94
Aug 2025	140	40	2	182
Sep 2025	352	104	4	460
Oct 2025	82	70	2	154
Nov 2025	72	44	4	120
Dec 2025	100	74	0	174
Jan 2026	68	26	12	106
Feb 2026	114	48	10	172
Mar 2026	198	50	8	256
Apr 2026	50	30	5	85
May 2026	31	5	0	36
Jun 2026	8	3	0	11
Jul 2026	23	10	1	34

Cumulative views and downloads (calculated since 30 Jul 2024)

Month	HTML	PDF	XML	Total
Jul 2024	118	38	6	162
Aug 2024	472	206	14	692
Sep 2024	282	44	86	412
Oct 2024	254	78	8	340
Nov 2024	64	30	250	344
Dec 2024	40	28	296	364
Jan 2025	54	44	324	422
Feb 2025	90	46	14	150
Mar 2025	62	50	0	112
Apr 2025	50	20	0	70
May 2025	54	16	2	72
Jun 2025	66	46	2	114
Jul 2025	58	34	2	94
Aug 2025	140	40	2	182
Sep 2025	352	104	4	460
Oct 2025	82	70	2	154
Nov 2025	72	44	4	120
Dec 2025	100	74	0	174
Jan 2026	68	26	12	106
Feb 2026	114	48	10	172
Mar 2026	198	50	8	256
Apr 2026	50	30	5	85
May 2026	31	5	0	36
Jun 2026	8	3	0	11
Jul 2026	23	10	1	34

Viewed (geographical distribution)

Total article views: 5,130 (including HTML, PDF, and XML) Thereof 5,130 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 31 Jul 2026

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (2807 KB)
Metadata XML

Short summary

Data-driven techniques have shown the potential to outperform process-based models in rainfall-runoff simulations. Hybrid models, combining both approaches, aim to enhance accuracy and maintain interpretability. Expanding the set of test cases to evaluate hybrid models under different conditions we test their generalization capabilities for extreme hydrological events.

Analyzing the generalization capabilities of hybrid hydrological models for extrapolation to extreme events

Journal article(s) based on this preprint

Interactive discussion

Interactive discussion

Peer review completion

Suggestions for revision or reasons for rejection

Suggestions for revision or reasons for rejection

Journal article(s) based on this preprint

Viewed

Viewed (geographical distribution)


Total:	0
HTML:	0
PDF:	0
XML:	0