Preprints
https://doi.org/10.5194/egusphere-2025-5363
https://doi.org/10.5194/egusphere-2025-5363
18 Nov 2025
 | 18 Nov 2025
Status: this preprint is open for discussion and under review for Biogeosciences (BG).

PeatDepth-ML: A Global Map of Peat Depth Predicted using Machine Learning

Jade Skye, Joe R. Melton, Colin Goldblatt, Angela Gallego-Sala, Michelle Garneau, and Scott Winton

Abstract. Peatlands are major carbon stores that are sensitive to climate change and increasingly affected by human activity. Accurate assessment of carbon stocks and modelling of peatland responses to future climate scenarios requires robust information on peat depth. We developed PeatDepth-ML, a machine learning framework that predicts global peat depths using a comprehensive database of peat depth measurements for training and validation. Building on an existing framework for mapping peatland extent, we incorporated new environmental datasets relevant to peat formation, revised cross-validation procedures, and introduced a custom scoring metric to improve predictions of deep peat deposits. To evaluate model sensitivity to sampling bias inherent in the training data, we applied a bootstrapping approach. Model performance, assessed using a blocked leave-one-out approach, yielded a root mean square error of 70.1 ± 0.9 cm and a mean bias error of 2.1 ± 0.7 cm, performing as well as or better than previously published models. The global map produced by PeatDepth-ML predicts a median peat depth of 134 cm (IQR: 87–187) over areas with more than 30 cm of peat. Like other regression-based models, PeatDepth-ML tended to predict toward mean training depths. An area of applicability analysis suggests the model has good applicability globally with the exception of some coastal and several mountainous regions like the Andes and the highlands of Borneo and New Guinea. Predictor selection was highly sensitive to training data subsets that arose from the bootstrapping approach, occasionally resulting in regional variations in accuracy. The bootstrapping approach and our area of applicability analysis thus clearly demonstrates the prime importance of quality training data in data-driven approaches like PeatDepth-ML. Using our predicted peat depth map, together with peatland extent and literature-derived estimates of bulk density and organic carbon content, we estimate global peat carbon stocks at 327–373 Pg C, consistent with previous global estimates.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Jade Skye, Joe R. Melton, Colin Goldblatt, Angela Gallego-Sala, Michelle Garneau, and Scott Winton

Status: open (until 30 Dec 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Jade Skye, Joe R. Melton, Colin Goldblatt, Angela Gallego-Sala, Michelle Garneau, and Scott Winton

Data sets

Peat-DBase version 0.9 Jade Skye https://doi.org/10.5281/zenodo.15530645

Model code and software

PeatDepth-ML: Using Machine Learning to Predict a Global Map of Peat Depth Jade Skye et al. https://doi.org/10.5281/zenodo.15530817

Jade Skye, Joe R. Melton, Colin Goldblatt, Angela Gallego-Sala, Michelle Garneau, and Scott Winton
Metrics will be available soon.
Latest update: 18 Nov 2025
Download
Short summary
We developed PeatDepth-ML, a machine learning model predicting peat depth worldwide to help estimate carbon stocks in these climate-critical ecosystems. Our model predicts median depths of 134 cm in peatlands. Using bootstrapping, we rigorously assessed how sampling bias affects predictions. This revealed predictor selection and regional accuracy can vary greatly with different data subsets, demonstrating model reliability fundamentally depends on training data quality and geographic coverage.
Share