PeatDepth-ML: A Global Map of Peat Depth Predicted using Machine Learning

Skye, Jade; Melton, Joe R.; Goldblatt, Colin; Gallego-Sala, Angela; Garneau, Michelle; Winton, Scott

doi:10.5194/egusphere-2025-5363

Preprints

https://doi.org/10.5194/egusphere-2025-5363

Preprints

18 Nov 2025

| 18 Nov 2025

Status: this preprint is open for discussion and under review for Biogeosciences (BG).

PeatDepth-ML: A Global Map of Peat Depth Predicted using Machine Learning

Jade Skye, Joe R. Melton, Colin Goldblatt, Angela Gallego-Sala, Michelle Garneau, and Scott Winton

Abstract. Peatlands are major carbon stores that are sensitive to climate change and increasingly affected by human activity. Accurate assessment of carbon stocks and modelling of peatland responses to future climate scenarios requires robust information on peat depth. We developed PeatDepth-ML, a machine learning framework that predicts global peat depths using a comprehensive database of peat depth measurements for training and validation. Building on an existing framework for mapping peatland extent, we incorporated new environmental datasets relevant to peat formation, revised cross-validation procedures, and introduced a custom scoring metric to improve predictions of deep peat deposits. To evaluate model sensitivity to sampling bias inherent in the training data, we applied a bootstrapping approach. Model performance, assessed using a blocked leave-one-out approach, yielded a root mean square error of 70.1 ± 0.9 cm and a mean bias error of 2.1 ± 0.7 cm, performing as well as or better than previously published models. The global map produced by PeatDepth-ML predicts a median peat depth of 134 cm (IQR: 87–187) over areas with more than 30 cm of peat. Like other regression-based models, PeatDepth-ML tended to predict toward mean training depths. An area of applicability analysis suggests the model has good applicability globally with the exception of some coastal and several mountainous regions like the Andes and the highlands of Borneo and New Guinea. Predictor selection was highly sensitive to training data subsets that arose from the bootstrapping approach, occasionally resulting in regional variations in accuracy. The bootstrapping approach and our area of applicability analysis thus clearly demonstrates the prime importance of quality training data in data-driven approaches like PeatDepth-ML. Using our predicted peat depth map, together with peatland extent and literature-derived estimates of bulk density and organic carbon content, we estimate global peat carbon stocks at 327–373 Pg C, consistent with previous global estimates.

Received: 29 Oct 2025 – Discussion started: 18 Nov 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Jade Skye, Joe R. Melton, Colin Goldblatt, Angela Gallego-Sala, Michelle Garneau, and Scott Winton

Status: open (until 24 Jan 2026)

Post a comment Subscribe to comment alert

RC1: 'Comment on egusphere-2025-5363', Anonymous Referee #1, 17 Dec 2025 reply

The authors present PeatDepth-ML, a machine-learning framework for predicting global peat depth using a large compilation of peat depth measurements and environmental covariates. They extend existing peatland mapping approaches by incorporating additional predictors, revised spatial cross-validation, a custom metric targeting deep peat, and a bootstrapping strategy to assess sensitivity to sampling bias. Model performance is evaluated with blocked leave-one-out validation, and the resulting global peat depth map is used to estimate global peat carbon stocks, which are found to be consistent with previous studies.
I think the work is relevant for the journal and generally well-executed, though I think some revisions are in order prior to publication. I will give detailed list of comments in the following. Thank you for your work.
Detailed comments:
Lines 49 and 66: "machine learning" --> use abbreviation "ML".
Line 92, Figure A1: I think Figure A1 is quite important, presenting the peat data distributions. Why not include it in main text instead of in appendix?
Line 97: "However, grid cells with zero peat depth consistently dominate..." --> explicitly state the percentage of zero peat depth as it is the substantial majority of the data. I think it is good to state as the data is quite, though naturally, imbalanced.
Line 185: "machine learning" --> "ML"
Line 189: What were the hyperparameters which were optimized? I did not see them listed.
Line 192: "cross validation" --> "cross-validation"
Line 205: "don't" --> "do not"
Line 209: Add reference for LightGBM, maybe also fully open up the term. Lets not assume reader knows all the abbreviations by default.
Line 247: Did you mention somewhere how many predictors you had in total available for the ML runs? I would be curious to know this.
Figure 8 and A1: I am not used to horizontal histograms or distributions being presented. Was there a particular reason for this? If not, why not use standard orientation in visualization (vertical bars), which, to my experience, is more common.
Figure A1 caption: extra whitespace before ".", "...desert data ."
Line 357: Open up the abbreviations, although well-known, the RMSE, MBE, NME. They are mentioned also in appendix more specifically, but good the clarify the abbreviations, once introduced.
Line 362: Could you please elaborate on the null models a bit. Do you mean baseline models? Also on same line, notice extra period ". ."
Line 370: "BLOOCV" Did you define this abbreviation, even though clear to myself. But still, define it earlier in the text when you mention cross-validation.
Figure 9: The legend is little bit unclear for me. What is "bootstrap results", what results? Maybe rephrase more clearly, if possible.

Reply

Citation: https://doi.org/10.5194/egusphere-2025-5363-RC1

Jade Skye, Joe R. Melton, Colin Goldblatt, Angela Gallego-Sala, Michelle Garneau, and Scott Winton

Data sets

Peat-DBase version 0.9 Jade Skye https://doi.org/10.5281/zenodo.15530645

Model code and software

PeatDepth-ML: Using Machine Learning to Predict a Global Map of Peat Depth Jade Skye et al. https://doi.org/10.5281/zenodo.15530817

Jade Skye, Joe R. Melton, Colin Goldblatt, Angela Gallego-Sala, Michelle Garneau, and Scott Winton

Viewed

Total article views: 361 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
223	114	24	361	13	16

HTML: 223
PDF: 114
XML: 24
Total: 361
BibTeX: 13
EndNote: 16

Views and downloads (calculated since 18 Nov 2025)

Month	HTML	PDF	XML	Total
Nov 2025	166	33	12	211
Dec 2025	57	81	12	150

Cumulative views and downloads (calculated since 18 Nov 2025)

Month	HTML	PDF	XML	Total
Nov 2025	166	33	12	211
Dec 2025	57	81	12	150

Viewed (geographical distribution)

Total article views: 356 (including HTML, PDF, and XML) Thereof 356 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 27 Dec 2025

Short summary

We developed PeatDepth-ML, a machine learning model predicting peat depth worldwide to help estimate carbon stocks in these climate-critical ecosystems. Our model predicts median depths of 134 cm in peatlands. Using bootstrapping, we rigorously assessed how sampling bias affects predictions. This revealed predictor selection and regional accuracy can vary greatly with different data subsets, demonstrating model reliability fundamentally depends on training data quality and geographic coverage.


Total:	0
HTML:	0
PDF:	0
XML:	0