A gradient-boosted tree framework to model the ice thickness of the World's glaciers (IceBoost v1)

Maffezzoli, Niccolò; Rignot, Eric; Barbante, Carlo; Petersen, Troels; Vascon, Sebastiano

doi:https://doi.org/10.5194/egusphere-2024-2455

Preprints

https://doi.org/10.5194/egusphere-2024-2455

Preprints

23 Sep 2024

| 23 Sep 2024

A gradient-boosted tree framework to model the ice thickness of the World's glaciers (IceBoost v1)

Niccolò Maffezzoli, Eric Rignot, Carlo Barbante, Troels Petersen, and Sebastiano Vascon

Abstract. Knowledge of glacier ice volumes is crucial for constraining future sea level potential, evaluating freshwater resources, and assessing impacts on societies, from regional to global. Motivated by the disparity in existing ice volume estimates, we present IceBoost, a global Machine Learning framework to model individual glacier ice thickness distributions. IceBoost is an ensemble of two gradient-boosted trees trained with 3.7 million globally-available ice thickness measurements and an array of 34 numerical features. The model error is similar to existing models outside polar regions and up to 30–40 % lower at high latitudes. Providing supervision by exposing the model to available glacier thickness measurements reduces the error by up to a factor 2 to 3. A feature ranking analysis reveals that geodetic information are the most informative variables, while ice velocity can improve the model performance by 6 % at high latitudes. A major feature of IceBoost is a capability to generalize outside the training domain, i.e. producing meaningful ice thickness distributions in all regions of the World, including in the ice sheet peripheries.

Received: 03 Aug 2024 – Discussion started: 23 Sep 2024

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Preprint (PDF, 10435 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (10435 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

09 May 2025

A gradient-boosted tree framework to model the ice thickness of the world's glaciers (IceBoost v1.1)

Niccolò Maffezzoli, Eric Rignot, Carlo Barbante, Troels Petersen, and Sebastiano Vascon

Geosci. Model Dev., 18, 2545–2568, https://doi.org/10.5194/gmd-18-2545-2025,https://doi.org/10.5194/gmd-18-2545-2025, 2025

Short summary

Niccolò Maffezzoli, Eric Rignot, Carlo Barbante, Troels Petersen, and Sebastiano Vascon

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-2455', Anonymous Referee #1, 02 Dec 2024

See attachment.

Citation: https://doi.org/10.5194/egusphere-2024-2455-RC1
- AC2: 'Reply on RC1-RC2-Editor', Niccolò Maffezzoli, 17 Jan 2025
  
  We would like to thank the two Reviewers and the Editor for all their constructive comments and for their time to review the manuscript.
  Attached is our combined replies to Reviewer no. 1, Reviewer no.2, and to the Editor's comments.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2455-AC2
RC2:
'Comment on egusphere-2024-2455', Anonymous Referee #2, 09 Dec 2024
The paper presents a machine learning framework to model individual glacier thicknesses based on global glacier inventory data, supplemented with other data. The paper is motivated by differences in estimated ice volume between current physics-based models. The results obtained in the paper are promising and appears to improve the ice thickness estimates compared to the existing models. Estimating the World’s total glacier volume is relevant to projections of ice mass loss and sea level rise, and making use of machine learning to address this task is interesting and timely.
However, while the results are promising, the presentation needs to be improved. The method is not sufficiently explained but it is clear that it builds on knowledge from previous work and physics-based knowledge. The results needs to be systematically assessed and limitations discussed in more detail. Including these comments and arguments will increase confidence in the results and improve the impact. These points are explained below.
I have several concerns with the paper:
The selection of features used to train the IceBoost are not substantiated and explained. The datasets used to produce the features are from well know data repositories or data products, and clearly referenced, but it is not explained how the features in Table 1 were chosen. For example, how are the three aspect features defined, and what is the basis for the selected smoothed slopes? Were other smoothings and other variables investigated, but not included in the analysis in the end?

Furthermore, the main text does not define or describe the features, but the reader is referred to Appendix A and B. This makes the presentation of the model to appear rather superficial and makes it impossible to read Table 1, as well as understanding the discussions later. I suggest that some of Appendix A and B is included in the main text, so the presentation of the model appears clear and self-contained.

This brings me to the next point, which is that the physical basis for the selected features is not described or commented on. I acknowledge that the machine learning method is not based on physical relationships, but the selection of the features are building on knowledge of physics-based relationships with ice thickness and volume. I miss some comments about the relevance of these features. Some may be obvious, like slope, distance to margin, etc., but others are not obvious, and other obvious parameters are not included, like e.g. precipitation. In this context, I also miss some references to previous works of what are the key parameters, in addition to the references to the textbook and the review of scaling relationships by Bahr et al. For example work by Oerlemans (e.g. summarized very briefly in Oerlemans (2005): Extracting a climate signal from 169 glacier records. Science 308, 675-677, https://www.science.org/doi/10.1126/science.1107046 or more elaborated in Oerlamans’s online booklet: https://www.cambridge.org/core/journals/journal-of-glaciology/article/j-oerlemans-2011-minimal-glacier-models-second-edition-utrecht-utrecht-university-igitur-utrecht-publishing-archiving-services-90pp-isbn-9879067010221-paperback-available-free-from-joerlemansuunl-and-as-free-download-from-httpigiturarchivelibraryuuni/A49EB2866D3E6E38040AECFD7B3E24AA .

It is later discussed in section 3 that some features are not important for the results, while other features are. In this discussion, it would be helpful to include an assessment of how this analysis compares to the previous work, i.e. whether the same features come out as important, or other parameters are more important in the work here. It is for example expected that slope is important. It is not a surprise that ice velocity is less important, but it is indeed very interesting to have the importance of including velocity quantified. This should be clarified and assessed in relation to previous knowledge. I also miss some discussion of why velocity is only important in high-latitude regions, not elsewhere.

The assessment of the model results, including comparison with other results and discussion of any limitations/difficulties appears sporadic. Perhaps because it is distributed over several sections (2.4, table 2, section 4). The quantitative assessment is summarized in Table 2, but only referred to in the methods section, and the application section is very qualitatively focused on the examples. The discussion in section 4 is very interesting, but could perhaps be summarized into a more general discussion of limitations/advantages of the method. It would be very helpful/interesting to see examples where the model do not perform well. Overall, I recommend that the assessment of the results and discussion of limitations should be more systematically organized to become more convincing.

Another problem in this context is that the figure captions are generally deficient, in particular figures 3, 4, 5, and it took me very long time to understand what they actually show (see below).

Specific points:
Section 2.4, lines 146-149: Explain why some regions are similar, so evaluations can be indicative in these regions.

Appendix B2: I don’t think it is clear why missing glacier aspect data are substituted with zero. Please explain.

Figure captions: all figure captions need to be reviewed and checked for completeness and clarity. Add letters a, b, etc

Figure 3 and 4: it was not clear that the two models in lower row show the same results as in the upper row, but now with the observations added. Please clarify that observations are added in all three figure in lower row. This will help underline how much better the IceBoost is in these examples.

Figure 5: It is not clear what big and small circles mean. Only by zooming in, it is possible to distinguish. It is not explained what the background image is.

I suggest to merge section 4 and 4.1, as section 4.1 adds to the impression that the assessment in the main paper is lacking.
Citation: https://doi.org/10.5194/egusphere-2024-2455-RC2
- AC1: 'Reply on RC1-RC2-Editor', Niccolò Maffezzoli, 17 Jan 2025
  
  We would like to thank the two Reviewers and the Editor for all their constructive comments and for their time to review the manuscript.
  Attached is our combined replies to Reviewer no. 1, Reviewer no.2, and to the Editor's comments.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2455-AC1
- AC2: 'Reply on RC1-RC2-Editor', Niccolò Maffezzoli, 17 Jan 2025
  
  We would like to thank the two Reviewers and the Editor for all their constructive comments and for their time to review the manuscript.
  Attached is our combined replies to Reviewer no. 1, Reviewer no.2, and to the Editor's comments.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2455-AC2

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-2455', Anonymous Referee #1, 02 Dec 2024

See attachment.

Citation: https://doi.org/10.5194/egusphere-2024-2455-RC1
- AC2: 'Reply on RC1-RC2-Editor', Niccolò Maffezzoli, 17 Jan 2025
  
  We would like to thank the two Reviewers and the Editor for all their constructive comments and for their time to review the manuscript.
  Attached is our combined replies to Reviewer no. 1, Reviewer no.2, and to the Editor's comments.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2455-AC2
RC2:
'Comment on egusphere-2024-2455', Anonymous Referee #2, 09 Dec 2024
The paper presents a machine learning framework to model individual glacier thicknesses based on global glacier inventory data, supplemented with other data. The paper is motivated by differences in estimated ice volume between current physics-based models. The results obtained in the paper are promising and appears to improve the ice thickness estimates compared to the existing models. Estimating the World’s total glacier volume is relevant to projections of ice mass loss and sea level rise, and making use of machine learning to address this task is interesting and timely.
However, while the results are promising, the presentation needs to be improved. The method is not sufficiently explained but it is clear that it builds on knowledge from previous work and physics-based knowledge. The results needs to be systematically assessed and limitations discussed in more detail. Including these comments and arguments will increase confidence in the results and improve the impact. These points are explained below.
I have several concerns with the paper:
The selection of features used to train the IceBoost are not substantiated and explained. The datasets used to produce the features are from well know data repositories or data products, and clearly referenced, but it is not explained how the features in Table 1 were chosen. For example, how are the three aspect features defined, and what is the basis for the selected smoothed slopes? Were other smoothings and other variables investigated, but not included in the analysis in the end?

Furthermore, the main text does not define or describe the features, but the reader is referred to Appendix A and B. This makes the presentation of the model to appear rather superficial and makes it impossible to read Table 1, as well as understanding the discussions later. I suggest that some of Appendix A and B is included in the main text, so the presentation of the model appears clear and self-contained.

This brings me to the next point, which is that the physical basis for the selected features is not described or commented on. I acknowledge that the machine learning method is not based on physical relationships, but the selection of the features are building on knowledge of physics-based relationships with ice thickness and volume. I miss some comments about the relevance of these features. Some may be obvious, like slope, distance to margin, etc., but others are not obvious, and other obvious parameters are not included, like e.g. precipitation. In this context, I also miss some references to previous works of what are the key parameters, in addition to the references to the textbook and the review of scaling relationships by Bahr et al. For example work by Oerlemans (e.g. summarized very briefly in Oerlemans (2005): Extracting a climate signal from 169 glacier records. Science 308, 675-677, https://www.science.org/doi/10.1126/science.1107046 or more elaborated in Oerlamans’s online booklet: https://www.cambridge.org/core/journals/journal-of-glaciology/article/j-oerlemans-2011-minimal-glacier-models-second-edition-utrecht-utrecht-university-igitur-utrecht-publishing-archiving-services-90pp-isbn-9879067010221-paperback-available-free-from-joerlemansuunl-and-as-free-download-from-httpigiturarchivelibraryuuni/A49EB2866D3E6E38040AECFD7B3E24AA .

It is later discussed in section 3 that some features are not important for the results, while other features are. In this discussion, it would be helpful to include an assessment of how this analysis compares to the previous work, i.e. whether the same features come out as important, or other parameters are more important in the work here. It is for example expected that slope is important. It is not a surprise that ice velocity is less important, but it is indeed very interesting to have the importance of including velocity quantified. This should be clarified and assessed in relation to previous knowledge. I also miss some discussion of why velocity is only important in high-latitude regions, not elsewhere.

The assessment of the model results, including comparison with other results and discussion of any limitations/difficulties appears sporadic. Perhaps because it is distributed over several sections (2.4, table 2, section 4). The quantitative assessment is summarized in Table 2, but only referred to in the methods section, and the application section is very qualitatively focused on the examples. The discussion in section 4 is very interesting, but could perhaps be summarized into a more general discussion of limitations/advantages of the method. It would be very helpful/interesting to see examples where the model do not perform well. Overall, I recommend that the assessment of the results and discussion of limitations should be more systematically organized to become more convincing.

Another problem in this context is that the figure captions are generally deficient, in particular figures 3, 4, 5, and it took me very long time to understand what they actually show (see below).

Specific points:
Section 2.4, lines 146-149: Explain why some regions are similar, so evaluations can be indicative in these regions.

Appendix B2: I don’t think it is clear why missing glacier aspect data are substituted with zero. Please explain.

Figure captions: all figure captions need to be reviewed and checked for completeness and clarity. Add letters a, b, etc

Figure 3 and 4: it was not clear that the two models in lower row show the same results as in the upper row, but now with the observations added. Please clarify that observations are added in all three figure in lower row. This will help underline how much better the IceBoost is in these examples.

Figure 5: It is not clear what big and small circles mean. Only by zooming in, it is possible to distinguish. It is not explained what the background image is.

I suggest to merge section 4 and 4.1, as section 4.1 adds to the impression that the assessment in the main paper is lacking.
Citation: https://doi.org/10.5194/egusphere-2024-2455-RC2
- AC1: 'Reply on RC1-RC2-Editor', Niccolò Maffezzoli, 17 Jan 2025
  
  We would like to thank the two Reviewers and the Editor for all their constructive comments and for their time to review the manuscript.
  Attached is our combined replies to Reviewer no. 1, Reviewer no.2, and to the Editor's comments.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2455-AC1
- AC2: 'Reply on RC1-RC2-Editor', Niccolò Maffezzoli, 17 Jan 2025
  
  We would like to thank the two Reviewers and the Editor for all their constructive comments and for their time to review the manuscript.
  Attached is our combined replies to Reviewer no. 1, Reviewer no.2, and to the Editor's comments.
  
  Citation: https://doi.org/10.5194/egusphere-2024-2455-AC2

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Niccolò Maffezzoli on behalf of the Authors (19 Jan 2025) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (21 Jan 2025) by Ludovic Räss

RR by Anonymous Referee #2 (16 Feb 2025)

ED: Publish subject to minor revisions (review by editor) (17 Feb 2025) by Ludovic Räss

AR by Niccolò Maffezzoli on behalf of the Authors (25 Feb 2025) Author's response Author's tracked changes Manuscript

ED: Publish as is (27 Feb 2025) by Ludovic Räss

AR by Niccolò Maffezzoli on behalf of the Authors (02 Mar 2025) Manuscript

Journal article(s) based on this preprint

09 May 2025

A gradient-boosted tree framework to model the ice thickness of the world's glaciers (IceBoost v1.1)

Niccolò Maffezzoli, Eric Rignot, Carlo Barbante, Troels Petersen, and Sebastiano Vascon

Geosci. Model Dev., 18, 2545–2568, https://doi.org/10.5194/gmd-18-2545-2025,https://doi.org/10.5194/gmd-18-2545-2025, 2025

Short summary

Niccolò Maffezzoli, Eric Rignot, Carlo Barbante, Troels Petersen, and Sebastiano Vascon

Data sets

IceBoost - a Gradient Boosted Tree global framework for glacier ice thickness retrieval Niccolò Maffezzoli et al. https://zenodo.org/records/13145836

Model code and software

IceBoost GitHub repository Niccolò Maffezzoli https://github.com/nmaffe/iceboost

Niccolò Maffezzoli, Eric Rignot, Carlo Barbante, Troels Petersen, and Sebastiano Vascon

Viewed

Total article views: 602 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
432	152	18	602	19	24

HTML: 432
PDF: 152
XML: 18
Total: 602
BibTeX: 19
EndNote: 24

Views and downloads (calculated since 23 Sep 2024)

Month	HTML	PDF	XML	Total
Sep 2024	91	32	5	128
Oct 2024	43	16	1	60
Nov 2024	63	7	4	74
Dec 2024	106	39	2	147
Jan 2025	61	18	2	81
Feb 2025	20	12	3	35
Mar 2025	20	13	0	33
Apr 2025	16	14	1	31
May 2025	12	1	0	13

Cumulative views and downloads (calculated since 23 Sep 2024)

Month	HTML	PDF	XML	Total
Sep 2024	91	32	5	128
Oct 2024	43	16	1	60
Nov 2024	63	7	4	74
Dec 2024	106	39	2	147
Jan 2025	61	18	2	81
Feb 2025	20	12	3	35
Mar 2025	20	13	0	33
Apr 2025	16	14	1	31
May 2025	12	1	0	13

Viewed (geographical distribution)

Total article views: 581 (including HTML, PDF, and XML) Thereof 581 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 09 May 2025

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (10435 KB)
Metadata XML

Short summary

In this work we introduces IceBoost, a machine learning framework to model the ice thickness distribution of all the World's glaciers, with greater accuracy than state-of-the-art methods. The model is trained on 3.7 million measurements globally available and provides skillful estimates across all regions. This advancement will help in better assessing future sea level changes, freshwater resources, with significance for both the scientific community and society at large.


Total:	0
HTML:	0
PDF:	0
XML:	0

A gradient-boosted tree framework to model the ice thickness of the World's glaciers (IceBoost v1)

Journal article(s) based on this preprint

Interactive discussion

Interactive discussion

Peer review completion

Suggestions for revision or reasons for rejection

Journal article(s) based on this preprint

Data sets

Model code and software

Viewed

Viewed (geographical distribution)