Selecting a conceptual hydrological model using Bayes' factors computed with Replica Exchange Hamiltonian Monte Carlo and Thermodynamic Integration

Mingo, Damian N.; Nijzink, Remko; Ley, Christophe; Hale, Jack S.

doi:https://doi.org/10.5194/egusphere-2023-2865

Preprints

https://doi.org/10.5194/egusphere-2023-2865

Preprints

08 Jan 2024

| 08 Jan 2024

Selecting a conceptual hydrological model using Bayes' factors computed with Replica Exchange Hamiltonian Monte Carlo and Thermodynamic Integration

Damian N. Mingo, Remko Nijzink, Christophe Ley, and Jack S. Hale

Abstract. We develop a method for computing Bayes’ factors of conceptual rainfall-runoff models based on thermodynamic integration, gradient-based replica-exchange Markov Chain Monte Carlo algorithms and modern differentiable programming languages. We apply our approach to the problem of choosing from a set of conceptual bucket-type models with increasing dynamical complexity calibrated against both synthetically generated and real runoff data from Magela Creek, Australia. We show that using the proposed methodology the Bayes factor can be used to select a parsimonious model and can be computed robustly in a few hours on modern computing hardware. We introduce formal posterior predictive checks for the selected model. The prior calibrated posterior predictive p-value, which also tests for prior data conflict, is used for the posterior predictive checks. Prior data conflict is when the prior favours parameter values that are less likely given the data.

Received: 30 Nov 2023 – Discussion started: 08 Jan 2024

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Damian N. Mingo, Remko Nijzink, Christophe Ley, and Jack S. Hale

Status: final response (author comments only)

RC1:
'Comment on egusphere-2023-2865', Anonymous Referee #1, 03 Apr 2024

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2024/egusphere-2023-2865/egusphere-2023-2865-RC1-supplement.pdf

Citation: https://doi.org/10.5194/egusphere-2023-2865-RC1
- AC1: 'Reply on RC1', Damian Mingo Ndiwago, 08 Apr 2024
  
  We would like to thank the first anonymous reviewer for their thorough and insightful feedback on our manuscript. The reply is attached as a supplement.
  
  Citation: https://doi.org/10.5194/egusphere-2023-2865-AC1
RC2:
'Comment on egusphere-2023-2865', Georgios Boumis, 14 May 2024

I have now finished reviewing the work by Mingo et al. The authors have combined Replica-Exchange Hamiltonian Monte Carlo (HMC) with Thermodynamic Integration in order to do Bayesian inference for the parameters of a conceptual hydrologic model, while simultaneously they compute the marginal likelihood of the model; the latter, facilitates model inter-comparison via the Bayes Factor (BF). In general, the manuscript is well written and has novelty in the sense that the proposed algorithm has never been applied before to hydrological modeling. As a result, I am overall positive! However, I think the manuscript would benefit from a more in-depth discussion (possibly toward the end of the article) about the scientific problem that the authors address, the limitations, and what are some possible alternatives.
In light of the extensive comments (major and editorial) of Reviewer #1 with which I completely agree, I would like to raise some concerns about the usefulness of BF as a hydrologic model inter-comparison metric. Please see my comments below:
1. For the synthetic experiments, Tables 4 and 5 show that both DIC and WAIC could correctly indicate the data-generating model, i.e., M2 and M3, respectively. For the average reader, this might practically mean that we do not need BF as an additional metric to "tell" us which model to choose. Please provide an explanation to show why employing BF matters. If you cannot demonstrate that the BF can capture the true underlying model while the other, simpler metrics, cannot, then it is hard to justify your analysis.
2. Although I am not a Hydrologist myself, I have a hard time understanding the usefulness of BF within the context of hydrologic model comparison. Traditional hydrologists calibrate models using algorithms like Shuffled Complex Evolution (SCE) based on optimization of a deterministic metric, e.g., NSE. I do understand that Bayesian inference of hydrologic model parameters, on the other hand, is appealing because it naturally provides a measure of uncertainty, which is always important. But the BF provides no pragmatic information to the modeler as per which model is performing better. For example, one would still have to compute NSE or KGE for all models M2, M3, and M4 for the real-world data (Table 8) to get an idea of what's happening. On the contrary, I would argue that for conceptual hydrologic models, which are not computationally demanding and time-intensive, likelihood-free methods like Approximate Bayesian Computing (ABC) might be more suitable for model comparison, as the posterior distributions of parameters for different models are obtained on the basis of an actually useful (to the modeler) distance metric, e.g., NSE, KGE, or even a metric tailored only to river discharge peaks!!!
Again, I am positive about your article and I believe it should be considered for publication, but please provide a better discussion about the practical use of BF as a hydrologic model comparison metric...

Citation: https://doi.org/10.5194/egusphere-2023-2865-RC2
- AC2: 'Reply on RC2', Damian Mingo Ndiwago, 22 May 2024
  
  We would like to thank the second reviewer for their thoughtful comments. We will address their specific comments in this response and move towards a final response in the coming weeks. We are also more than happy to discuss specific points with the reviewer again. The response is attached.
  
  Citation: https://doi.org/10.5194/egusphere-2023-2865-AC2

Damian N. Mingo, Remko Nijzink, Christophe Ley, and Jack S. Hale

Data sets

Magela Creek data (precipitation, discharge, potential evapotranspiration, temperature) D. N. Mingo and Jack S. Hale https://doi.org/10.5281/zenodo.10202093

Model code and software

Selecting a conceptual hydrological model using Bayes' factors Damian N. Mingo and Jack S. Hale https://doi.org/10.5281/zenodo.10202093

Damian N. Mingo, Remko Nijzink, Christophe Ley, and Jack S. Hale

Viewed

Total article views: 489 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
351	109	29	489	18	13

HTML: 351
PDF: 109
XML: 29
Total: 489
BibTeX: 18
EndNote: 13

Views and downloads (calculated since 08 Jan 2024)

Month	HTML	PDF	XML	Total
Jan 2024	86	18	3	107
Feb 2024	20	8	1	29
Mar 2024	35	11	2	48
Apr 2024	76	32	9	117
May 2024	62	20	5	87
Jun 2024	58	14	4	76
Jul 2024	14	6	5	25

Cumulative views and downloads (calculated since 08 Jan 2024)

Month	HTML	PDF	XML	Total
Jan 2024	86	18	3	107
Feb 2024	20	8	1	29
Mar 2024	35	11	2	48
Apr 2024	76	32	9	117
May 2024	62	20	5	87
Jun 2024	58	14	4	76
Jul 2024	14	6	5	25

Viewed (geographical distribution)

Total article views: 484 (including HTML, PDF, and XML) Thereof 484 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 26 Jul 2024

Short summary

Hydrologists are often faced with selecting amongst a set of competing models with different numbers of parameters and ability to fit available data. The Bayes’ factor is a tool that can be used to compare models, however it is very difficult to compute the Bayes’ factor numerically. In our paper we explore and develop highly efficient algorithms for computing the Bayes’ factor of hydrological systems, which will bring this useful tool for selecting models to everyday hydrological practice.


Total:	0
HTML:	0
PDF:	0
XML:	0