Preprints
https://doi.org/10.31223/X5TM02
https://doi.org/10.31223/X5TM02
14 Nov 2022
 | 14 Nov 2022

Comparing detrital age spectra, and other geological distributions, using the Wasserstein distance

Alex Lipp and Pieter Vermeesch

Abstract. Distributional data such as detrital age populations or grain size distributions are common in the geological sciences. As analytical techniques become more sophisticated, increasingly large amounts of distributional data are being gathered. These advances require quantitative and objective methods, such as multidimensional scaling (MDS), to analyse large numbers of samples. Crucial to such methods is choosing a sensible measure of dissimilarity between samples. At present, the Kolmogorov-Smirnov (KS) statistic is the most widely used of these dissimilarity measures. However, the KS statistic has some limitations. It is very sensitive to differences between the modes of two distributions, and relatively insensitive to differences between their tails. Here we introduce the Wasserstein-2 distance (W2) as an alternative to address this issue. Whereas the KS-distance is defined as the maximum vertical distance between two empirical cumulative distribution functions, the W2-distance is a function of the horizontal distances (i.e., age differences) between individual observations. Using a combination of synthetic examples and a published zircon U-Pb dataset, we show that the W2 distance produces similar MDS results to the KS-distance in most cases, but significantly different results in some cases. Where the results differ, the W2 results are geologically more sensible. For the case study, we find that the MDS map that is produced using W2 can be readily interpreted in terms of the shape and average age of the age spectra. The W2-distance has been added to the R package IsoplotR, for immediate use in detrital geochronology and other applications. The W2 distance can be generalised to multiple dimensions, which opens opportunities beyond distributional data.

Journal article(s) based on this preprint

17 May 2023
Short communication: The Wasserstein distance as a dissimilarity metric for comparing detrital age spectra and other geological distributions
Alex Lipp and Pieter Vermeesch
Geochronology, 5, 263–270, https://doi.org/10.5194/gchron-5-263-2023,https://doi.org/10.5194/gchron-5-263-2023, 2023
Short summary

Alex Lipp and Pieter Vermeesch

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
ED: Publish subject to revisions (further review by editor and referees) (10 Feb 2023) by Michael Dietze
AR by Alex Lipp on behalf of the Authors (13 Mar 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (14 Mar 2023) by Michael Dietze
RR by Joel Saylor (20 Mar 2023)
RR by Anonymous Referee #1 (27 Mar 2023)
ED: Publish subject to revisions (further review by editor and referees) (28 Mar 2023) by Michael Dietze
AR by Alex Lipp on behalf of the Authors (03 Apr 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Publish as is (17 Apr 2023) by Michael Dietze
ED: Publish as is (18 Apr 2023) by Klaus Mezger (Editor)
AR by Alex Lipp on behalf of the Authors (25 Apr 2023)  Manuscript 

Journal article(s) based on this preprint

17 May 2023
Short communication: The Wasserstein distance as a dissimilarity metric for comparing detrital age spectra and other geological distributions
Alex Lipp and Pieter Vermeesch
Geochronology, 5, 263–270, https://doi.org/10.5194/gchron-5-263-2023,https://doi.org/10.5194/gchron-5-263-2023, 2023
Short summary

Alex Lipp and Pieter Vermeesch

Alex Lipp and Pieter Vermeesch

Viewed

Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.

Total article views: 287 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
287 0 0 287 0 0
  • HTML: 287
  • PDF: 0
  • XML: 0
  • Total: 287
  • BibTeX: 0
  • EndNote: 0
Views and downloads (calculated since 14 Nov 2022)
Cumulative views and downloads (calculated since 14 Nov 2022)

Viewed (geographical distribution)

Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.

Total article views: 281 (including HTML, PDF, and XML) Thereof 281 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Latest update: 07 Oct 2023
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
The Wasserstein distance is shown to be an appropriate dissimilarity metric for comparing distributional data such as detrital mineral ages. Using synthetic and real data we compare the Wasserstein distance to the commonly used Kolmogorov-Smirnov distance. The results are, in general, similar, but where they differ the Wasserstein distance is found to have more geologically sensible results. Code required to calculate the Wasserstein distance between distributions is provided in python and R.