Advancing Last Glacial Maximum paleoclimate reconstructions in Europe using pollen data: a multi-method (mega)biomization approach
Abstract. Pollen records are one of the most spatially and temporally resolved proxies for reconstructing past vegetation dynamics, environmental changes and climate variability. Over the past decade, a large variety of methods based on different ecological or mathematical concepts has been used to reconstruct paleoclimatic conditions from pollen assemblages. However, the accuracy of these climate reconstructions strongly depends on the choice of the modern calibration dataset, the taxonomic resolution, and/or the modelling assumptions. The lack of a univocal response still limits the application of pollen-based climate reconstructions to assess key climate changes over multiple time periods especially during the Last Glacial Maximum (LGM, ~23–19 kyr BP). Here, we present a multi-method approach, including the Modern Analogue Technique (MAT), the Weighted Averaging Partial Least Squares regression (WA-PLS) and the probability density function-based Climate REconstruction SofTware (CREST), to reconstruct European climates during the LGM. The quality and performance of our climate reconstructions show strong heterogeneity when based on large calibration datasets encompassing wide climatic and vegetation gradients, making local sampling for climate reconstructions difficult. Instead of sampling the global calibration dataset, we test the effect of the latest biomization and megabiomization methods (local calibrations based on megabiome procedures) on climate reconstructions by introducing a new biome-based approach. Unlike previous studies, we use the weighted mean of climate variables from all megabiome scores rather than only considering the dominant (i.e., highest score) megabiome. This significantly reduces some of the statistical noise of climate reconstructions, drastically minimizing threshold and non-linear effects associated with megabiome classification changes. With these methodological advancements and our multi-method comparison, we evaluate the uncertainties (RMSEP) of the paleoclimate reconstructions for the LGM in Europe. Across climate reconstruction methods (MAT, WA-PLS and CREST methods), European LGM annual temperatures from the biomization method were on average 6.7±2.2 °C (mean SD) colder than today, consistent with megabiomization results (7.4±2.3 °C colder). Winter temperature (mean temperature of the coldest month, MTCO) results exhibit substantial spatial variability across Europe. Local calibration techniques significantly reduce uncertainties in LGM MTCO reconstructions, but they remain highly sensitive to the choice of calibration datasets.
Competing interests: At least one of the (co-)authors is a member of the editorial board of Climate of the Past.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
## Overview
This ms reconstructs LGM climate from pollen using a new method that begins with a biomisation step, followed by reconstructions based on modern samples from that each biome which are weighted according to the biome score to give the final reconstruction.
I can understand the motivation for the proposed method.
Reconstructions based on continental scale calibration sets can give poor reconstructions, perhaps because of large secondary climate gradients, perhaps because taxa with wide climatic tolerances create bad analogues.
One option would be to restrict the transfer function to base reconstructions only on calibration set samples from the same biome as the fossil sample. However uncertainty in assigning the fossil sample to a biome would lead to biases and instability in reconstructions from adjacent samples assigned to different biomes because of small differences in pollen assemblage composition.
The proposed method is analogous to model averaging with AIC weights.
However, the AIC weights are the probability that each model is the best of a set of candidate models.
Biome scores (even after transformation to have a unit sum) are not the probability of each biome occurring, so the weighted sum of reconstructions based on each biome may be a biased estimate of the climatic conditions. (If the authors believe biome affinity scores can be transformed into probabilities, they need to fully justify this).
It might be worth trying the biomisation method devised by Cruz-Silva et al (2022; https://doi.org/10.1111/jbi.14448) who report that their method gives approximate probabilities of a pollen sample belonging to each biome.
## Cross-validation
The methods, should be reorganised to start with the modern and then fossil pollen data, then the biomisation, followed by the transfer function methods. In the results, the cross-validation preformance should be shown before the LGM reconstructions.
The cross-validation scheme should mimic the analysis used to reconstruct climate from the fossil data.
As far as I can tell from the description and partial code (see below), this has not been done.
The past climate estimated as the weighted mean of the reconstructions based on each biome, whereas the cross-validation is based only on one biome.
This likely biases the model performance.
Once consequence of this is that the uncertainties reported for the reconstructions are likely to be incorrect.
## Modern biomes
I'm surprised to see that the biomisation scheme finds temperate forest on Svalbard. There are no trees on Svalbard; even _Betula nana_ is scarce.
The sample is Lomonosovfonna, this is a glacier sample, not a lake as the data report (the previous sample Vuoskkujávri is a lake not a glacier, and its elevation is wrong - please check if the data have further issues). Little of the pollen at Lomonosovfonna is from local sources (see the original paper), most has been transported from northern Scandinavia or further. Such extreme samples should probably be excluded to avoid them biasing the results. It should also be excluded because it, like a few other samples, has a very low pollen count (27 grains), and such low count sums cannot be expected to give a reliable reconstruction.
I'm also surprised that most of the Scandinavian + Western Siberian modern megabiomes are marked as temperate forest, with a few tundra locations.
This contradicts Dallmeyer et al 2019 which limits temperate forest to about 60˚N in Scandinavia + Siberia with boreal forest covering most of the area. The ms should, at a minimum, explain why it extends temperate forest into the spruce and pine dominated northern forests. It's possible that there is an error somewhere in the analysis, the authors should make sure it does not also affect the LGM reconstructions
To try to better understand what the authors have done, I looked at the provided R code.
The code is not easy to read, having few comments and not being well formatted (the R package `styler` can ameliorate this latter problem instantly), and does not run as it is cannot find the files it is trying to import.
This is not simply a working directory problem; the files are missing.
The problems seems to be that the provided code is only a fragment of the total code used.
Some other code has been used to pre-process the provided raw data and split it into a multitude of files.
This code should also be provided, as well the code use to make the maps, figures, tables etc.
The code should be possible for the reader to run once and get all the results.
The current code appears to need running repeatedly with different lines commented out. I would recommend making functions to encapsulate the logic so it can be run repeatedly.
The biomisation code used is sufficient, but makes no check that pollen types are assigned to plant functional types. If pollen types are not assigned, the biomisation procedure will be biased.
## Citations and bibliography
The citations and bibliography need more attention.
I checked two authors and found errors with both.
The ms cites:
- Birks et al., 2004 - perhaps Birks and Seppä, 2004
- Birks et al., 1005 - obviously wrong year
- Birks and Seppä, 2010 - wrong year
- Ter Braak and Looman, 1986 - not in bibliography (I also don't think this paper, written several years before the development of WAPLS, should be used to describe WAPLS).
The number of errors suggests that the authors are not using citation management software: zotero, jabref, endnote - take your pick, all are better than writing references by hand.
## Figures
Figures 1 & 2 have STEP in yellow and DESE in beige.
Whereas figure 3 has STEP in beige, SAVA in yellow and no DESE.
Obviously, there should be a consistent colour scheme and choice of megabiomes.
I find the yellow and beige difficult to distinguish in figure 3 because of the low contrast.
Figure 3 just shows the dominant biomes. Would it be possible to make, at least for some lakes a plot of biome affinity against time, especially for the LGM.
Figure 4 seems to show that few of the samples from Lac du Bouchet (please don't switch from Lac to Lake) have highest biome affinity to temperate forest during the LGM. I'm not sure which of the site on fig 3 is Lac du Bouchet, but many of the sites show a lot of temperate forest, even near the glacial limits. I'm missing the information I need to understand this surprising result, and really don't want to have to run the partial code in its current state.
Appendix 1C I would recommend a continuous colour scale for this plot. I doubt the current colour scale is particularly accessible for people with colour deficient vision.
Appendix 4 Why are some points outlined in black and others not?
I don't see a red contour.
## Mostly Minor points
I'm sure the two parts of table 2 can be merged.
241. All the methods used here could be described as statistical.
524 "WGS84 projection" the projection used to plot the data is not especially relevant (and figure 2 is not a WGS84 projection).
544 "The lack of data in Eastern Europe results in an underrepresentation of cold modern climate conditions." I'm no longer sure what is meant by Eastern Europe. Its a term with a diverging geographic and political meanings. What I think of as Eastern Europe, isn't particularly cold compared to Finland. Perhaps Western Siberia instead.
603 I have failed to parse this long sentence.
560 "biomization also helps reduce the impact of the non-analogue effect" This statement needs supporting evidence.
736 "increase" - from the context, I was expecting "decease", but see above comments on the cross-validation scheme.
Table 3 would be more useful in an appendix. At the moment it is very large, and will be difficult to extract data from it.
865 I suspect the seasonal anomalies have little skill. This doubt could be assuaged by showing that this method has skill in a robust cross-validation analysis. It would probably help to explain how a seasonal anomaly should be interpreted - positive anomalies indicate increased seasonality wrt modern?
952 Any discussion of the apparently good performance of MAT without mentioning that MAT is extremely susceptible to spatial autocorrelation and its performance statistics can be very misleading is flawed.
1077 "performs well for pollen taxa with low taxonomic resolution". This statement needs supporting evidence.
The authorship contribution statement is inadequate, describing only the writing and editing stages. Please see the CREDIT taxonomy for other roles which can be recorded.