Preprints
https://doi.org/10.5194/egusphere-2025-526
https://doi.org/10.5194/egusphere-2025-526
28 Feb 2025
 | 28 Feb 2025
Status: this preprint is open for discussion and under review for Biogeosciences (BG).

Utilizing Probability Estimates from Machine Learning and Pollen to Understand the Depositional Influences on Branched GDGT in Wetlands, Peatlands, and Lakes

Amy Cromartie, Cindy De Jonge, Guillemette Ménot, Mary Robles, Lucas Dugerdil, Odile Peyron, Marta Rodrigo-Gámiz, Jon Camuera, Maria Jose Ramos-Roman, Gonzalo Jiménez-Moreno, Claude Colombié, Lilit Sahakyan, and Sébastien Joannin

Abstract. Branched glycerol dialkyl glycerol tetraethers (brGDGTs) serve as critical molecular biomarkers for the quantitative reconstruction of past environments, ambient temperature and pH across various archives. Despite their success, numerous issues persist that limit their application. The distribution of brGDGTs varies significantly based on provenance, resulting in biases in environmental reconstructions that rely on fractional abundances and derived indices, such as the MBT’5ME. This issue is especially significant in shallow lakes, wetlands, and peatlands within semi-arid and arid regions, where ecosystems are sensitive to diverse environmental and climatic factors. Recent advancements, such as machine learning techniques, have been developed to identify changes in sources; however, these techniques are insufficient for detecting mixed source environments. The probability estimates derived from five machine learning algorithms are employed here to detect provenance changes in brGDGT downcore records and to identify periods of mixed provenance. A new global modern database (n=2301) was compiled to train, validate, test, and apply these algorithms to two sedimentary records. Our findings are corroborated by pollen and non-pollen palynomorphs obtained from the identical records. These microfossil proxies are utilized to discuss changes in provenance, hydrology, and ecology that influence the distribution of brGDGTs. Probability estimates derived from Random Forest with a sigmoid calibration are most effective in detecting changes in brGDGT distribution. Minor changes in the relative contributions of brGDGTs provenance can significantly influence the distribution of brGDGTs, especially regarding the MBT'5ME index. This study introduces a novel brGDGT wetland index aimed at monitoring potential biases arising from wetland development.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Download
Short summary
BrGDGT are a molecular biomarker utilized for paleotemperature reconstructions. One issue,...
Share