the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Technical Note: Cluster Analysis of Inverse Thermochronology Models
Abstract. Thermochronological inverse modeling may produce none-unique solution, that can group different thermal histories. Objective identification of such groups, also referred as "path families", is challenging and greatly benefits from dimension-reducing exploratory data analysis tools. This article proposes a statistical algorithm to overcome these challenges. We show that Hausdorff and Frechet distances are viable dissimilarity measures for ordered point sets, such as time-temperature paths. Clustering the pairwise dissimilarities between modeled thermal histories reveals distinct groups of thermal histories for a given sample or set of samples. As demonstrated by clustering a natural example, automated path-clustering allows for an objective and reproducible interpretation and maybe particularly useful for samples with poor prior knowledge of the time-temperature history. To allow adoption of the method by the thermochronology community, the methods introduced in this article are freely available through the package software thermoclustr, written in the programming language R.
- Preprint
(7097 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
- RC1: 'Comment on egusphere-2026-1279', Kalin McDannell, 21 Apr 2026
-
RC2: 'Comment on egusphere-2026-1279', Anonymous Referee #2, 28 Apr 2026
“Technical Note: Cluster Analysis of Inverse Thermochronology Models” by Tobias Stephan and co-authors is an important contribution that could be published with some minor revisions. It is also a clear and logical step to apply clustering algorithms in place of manual selection of paths. In particular, there are lots of grammatical errors that should be easy to fix. Additional references are required and a longer discussion about the potential issues would be helpful.
There are lots of typos in the document. For example, the first sentence of the abstract has errors. “Abstract. Thermochronological inverse modeling may produce none-unique solution, that can group different thermal histories.” First Thermochronological inverse modelling is not limited to thermal history modelling. There are now lots of tools used to estimate different parameters from thermochron data. These include tools (e.g., Pecube, GLIDE) designed to extract exhumation rates, landscape change parameters or other things from thermochron data. It should be non-unique not “none-unique”. It should be solutions not solution. The second clause doesn’t make sense. There are lots of errors like these throughout the manuscript that need to be fixed.
My second concern, is that the example in Figure 6 shows that there is essentially no difference between the paths where there is resolution. In the time interval from 0 to 98 Ma, which is the time between the mean AFT age and today, and in the temperature interval from 100 degrees to 0, which is the temperature sensitivity of the data, there is no difference between the clusters. In other words, the differences between the clusters is all where there is no resolution any way, and the clusters all overlap showing that no distinct clusters have been identified. The data constrain cooling from 100 Ma and 100C to the present and the only differences between the clusters is due to the requirement of the clustering algorithms to cluster the data. If the parameter space was explored more fully, potentially there would be no distinct clusters unless the number of clusters was defined. In this way, I think the algorithm gives the impression that there are distinct clusters when really there should not be. Please add a box showing where the models are well resolved.
The aim to pull apart a continuous and smoothly varying distribution of paths, into a predefined number of clusters is worrying because an apparent increase in model resolution is achieved. These issues should be carefully discussed. For this reason, the manual selection of path families should probably be preferred over automatic detection of clusters that may not be very useful. This is reinforced by the discussion in Section 4. If you know that the sample had to be at 230C at 350 MA, why not find a path family that goes through this constraint. Or just add a constraint box?
It is also important to cite other work that looks at clustering of paths. A very relevant paper to cite here would be Willett (Willett, S.D., 1997. Inverse modeling of annealing of fission tracks in apatite; 1, A controlled random search method. American Journal of Science, 297(10)) who use a model similar to HEFTY to find thermal histories consistent with AFT data. He then used clustering analysis to determine how well resolved the timing of reheating is. In Fox and Shuster (Fox, M. and Shuster, D.L., 2014. The influence of burial heating on the (U–Th)/He system in apatite: Grand Canyon case study. Earth and Planetary Science Letters, 397, pp.174-183.), the authors pulled apart a distribution of likely thermal histories to reveal how different parts of a thermal history are correlated. This was further developed in Fox and Carter (Fox, M. and Carter, A., 2020. Heated topics in thermochronology and paths towards resolution. Geosciences, 10(9), p.375.) where they did this for a result from QTQT to determine marginal conditional posterior probabilities.
Citation: https://doi.org/10.5194/egusphere-2026-1279-RC2
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 468 | 226 | 34 | 728 | 36 | 43 |
- HTML: 468
- PDF: 226
- XML: 34
- Total: 728
- BibTeX: 36
- EndNote: 43
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Stephan et al. present an automated statistical framework for clustering time-temperature paths (so-called "path families") from thermochronological inverse modeling, implemented in the R package thermoclustr. While the technical execution is competent and the software may have practical utility, the paper rests on a premise that warrants closer scrutiny. The authors treat distinct path clusters as geologically meaningful thermal history scenarios, but the 'dispersed' ensemble of acceptable t-T paths (produced by the HeFTy software) exists precisely because the thermochronological data cannot distinguish between them—they are all valid solutions within data uncertainty. Clustering a t-T ensemble does not necessarily reveal meaningful geological signal; it simply partitions solutions that the data are fundamentally unable to distinguish due to poor resolution. The non-uniqueness of inverse thermal history models is a statement about the resolving power of the data, and subdividing the acceptable solution space into groups does not change that fundamental limitation. A secondary but related concern is that path geometry is often already heavily structured by user-defined t-T constraints or "exploration boxes", meaning that recovered clusters for many inversions (especially where data sensitivity is low or absent) largely reflect the topology of modeling choices rather than nuanced thermal history signal.
The natural example presented in Section 4 inadvertently highlights some of these limitations. Three clusters are recovered from the HeFTy inversion of data for sample 112-73, a Paleoproterozoic granodiorite from the southwestern Northwest Territories overlain by Devonian strata of the Western Canada Sedimentary Basin (truncated model shown in Figure 6). All three clusters represent acceptable fits to the same thermally reset apatite (U-Th)/He and fission-track data, yet they show different burial and exhumation histories distinguished by heating rates, peak temperatures, and cooling onset. Rather than the clustering algorithm identifying the geologically meaningful solution, the authors sequentially reject Clusters 1 and 2 by appealing to regional stratigraphic constraints from Morrow et al. (1993) and t-T constraints from nearby Mississippi Valley-type mineralization, ultimately endorsing Cluster 3 as the most plausible thermal history. This is the interpretive workflow that geoscientists already apply when evaluating inverse t-T modeling results. The clustering step alone added little discriminating power and the plausible thermal history was identified by geological reasoning, not by the algorithm. Unfortunately, without a clearer discussion of when clustering yields genuinely interpretable results versus when it simply reflects data or modeling limitations, the method risks being applied uncritically in contexts where the data do not support the level of thermal history discrimination that clustering implies.