Bayesian data selection to quantify the value of data for landslide runout calibration

Kumar, V Mithlesh; Yildiz, Anil; Kowalski, Julia

doi:10.5194/egusphere-2025-4531

Preprints

https://doi.org/10.5194/egusphere-2025-4531

Preprints

02 Oct 2025

| 02 Oct 2025

Status: this preprint is open for discussion and under review for Nonlinear Processes in Geophysics (NPG).

Bayesian data selection to quantify the value of data for landslide runout calibration

V Mithlesh Kumar, Anil Yildiz, and Julia Kowalski

Abstract. The reliability of physics-based landslide runout models depends on the effective calibration of its parameters, which are often conceptual and cannot be physically measured. Bayesian methods offer a robust framework to incorporate uncertainties in both model and observations into the calibration process. Therefore, they are increasingly used to calibrate physics-based landslide runout models. However, the practical application of Bayesian methods to real-world landslide events depends on the availability and quality of observational data, which determines the reliability of the calibration outcomes. Despite this, systematic investigation of the influence of observational data on the Bayesian calibration of landslide runout models has been limited.

We propose quantifying the impact of observational data on calibration outcomes by measuring the information gained during the calibration process using a decision-theoretic measure called Kullback-Leibler (KL) divergence. Building on this, we present a unified Bayesian data selection workflow to identify the most informative dataset for calibrating a given parameter. The workflow runs parallel calibration routines across available observation datasets. It then computes the information gained relative to the observations by calculating the KL divergence between prior and posterior distributions and selects the dataset that yields the highest KL divergence.

We demonstrate our workflow using an elementary landslide runout model, calibrating friction parameters with a diverse set of synthetic observations to evaluate the impact of data selection on parameter calibration. Specifically, we compare and quantify the information gained from calibration routines using observations with varying information content, i.e., velocity vs. position, and observations with different granularity, i.e., aggregated data vs. time series data. The insights from this study will optimize the use of available observations for calibration and guide the design of effective data acquisition strategies.

Received: 15 Sep 2025 – Discussion started: 02 Oct 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

V Mithlesh Kumar, Anil Yildiz, and Julia Kowalski

Status: open (until 27 Nov 2025)

Post a comment Subscribe to comment alert

RC1:
'Comment on egusphere-2025-4531', Reyko Schachtschneider, 04 Nov 2025 reply

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-4531/egusphere-2025-4531-RC1-supplement.pdf
Reply

Citation: https://doi.org/10.5194/egusphere-2025-4531-RC1
- AC1: 'Reply on RC1', V Mithlesh Kumar, 10 Nov 2025 reply
  
  Thank you for your comments. We are currently addressing all the points you raised. One of your main concerns was the missing labels in several figures, and we appreciate you bringing this to our attention.
  
  The missing labels resulted from a rendering error in the preprint version rather than an issue with the submitted manuscript. The version we submitted on September 15, 2025, included all figure labels. However, on October 6, we noticed that the posted preprint had some labels missing. We contacted the editorial office, who confirmed that such rendering errors can occur and promptly restored the correct version the same day.
  
  The online preprint has displayed all figures correctly with labels since October 6, and no changes were made to the manuscript content.
  
  Thank you again for your careful review and attention to detail.
  
  Reply
  
  Citation: https://doi.org/10.5194/egusphere-2025-4531-AC1

V Mithlesh Kumar, Anil Yildiz, and Julia Kowalski

Data sets

Bayesian data selection to quantify the value of data for landslide runout calibration V Mithlesh Kumar https://doi.org/10.5281/zenodo.17120721

Interactive computing environment

Bayesian data selection to quantify the value of data for landslide runout calibration V Mithlesh Kumar https://doi.org/10.5281/zenodo.17120721

V Mithlesh Kumar, Anil Yildiz, and Julia Kowalski

Viewed

Total article views: 172 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
110	50	12	172	10	9

HTML: 110
PDF: 50
XML: 12
Total: 172
BibTeX: 10
EndNote: 9

Views and downloads (calculated since 02 Oct 2025)

Month	HTML	PDF	XML	Total
Oct 2025	76	27	7	110
Nov 2025	34	23	5	62

Cumulative views and downloads (calculated since 02 Oct 2025)

Month	HTML	PDF	XML	Total
Oct 2025	76	27	7	110
Nov 2025	34	23	5	62

Viewed (geographical distribution)

Total article views: 171 (including HTML, PDF, and XML) Thereof 171 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 23 Nov 2025

Short summary

The reliability of Bayesian calibration depends on the quality and availability of observational data. But are we choosing the right data? We address this question by measuring the information gained during calibration to quantify how data selection influences the Bayesian calibration of physics-based landslide runout models. We find that more data does not always yield better results – observations that capture the dynamics governed by a parameter are more effective for its calibration.


Total:	0
HTML:	0
PDF:	0
XML:	0