A novel machine learning retrieval for the detection of ice crystal icing conditions based on geostationary satellite imagery

Aricò, Matteo; Piontek, Dennis; Bugliaro, Luca; Mayer, Johanna; Müller, Richard; Kalinka, Frank; Butter, Max

doi:10.5194/egusphere-2025-2985

Preprints

https://doi.org/10.5194/egusphere-2025-2985

Preprints

19 Aug 2025

| 19 Aug 2025

A novel machine learning retrieval for the detection of ice crystal icing conditions based on geostationary satellite imagery

Matteo Aricò, Dennis Piontek, Luca Bugliaro, Johanna Mayer, Richard Müller, Frank Kalinka, and Max Butter

Abstract. High ice water content (HIWC) conditions are a concern for aviation as the ingestion of ice particles in the jet engines can induce ice crystal icing (ICI), which results in performance loss and damage. To constantly monitor these conditions, retrievals for the detection of ICI were recently developed based on geostationary satellite imagery, but their calibration is limited to targeted flight campaigns or scattered samplings from ICI events databases. In this work, we close this gap, using exclusively remote sensing data to develop and assess a new retrieval for potential ICI conditions.

Cloud IWC measurements are provided from the synergy of radar and lidar (DARDAR) on board the polar-orbiting satellites CloudSat and CALIPSO. HIWC conditions (IWC ≥ 0.5 g m⁻³) at typical cruise altitudes are used as the proxy for areas with potential ICI formation. The HIWC conditions predictors are taken from a combination of observations and retrievals of the geostationary satellite Meteosat Second Generation (MSG). A random forest is trained and tested based on the collocated dataset of active and passive measurements during the summer months of 2013 and 2015, covering the European domain. The input predictors are the brightness temperature difference between the MSG channels at 6.2 and 10.8 µm wavelengths, the visible channel at 0.6 µm wavelength, the cloud optical thickness at 0.6 µm wavelength, and four convection metrics related to the distance to the closest convective cell, area extent of the convective cells, and convection density in the pixel surroundings. Over Europe, 83 % of HIWC conditions measured in the DARDAR dataset are correctly detected. The associated false alarm rate is 51 %. The retrieval is further tested with the ICI events database reported by Lufthansa. Four out of seven events are correctly detected. In conclusion, the retrieval achieves performances comparable to previously developed retrievals. An operational application would enable aircraft rerouting around areas with high ICI probability.

Received: 07 Jul 2025 – Discussion started: 19 Aug 2025

Competing interests: Author MB is employed by Deutsche Lufthansa AG. All other authors declare that they have no conflict of interest.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Matteo Aricò, Dennis Piontek, Luca Bugliaro, Johanna Mayer, Richard Müller, Frank Kalinka, and Max Butter

Status: closed

RC1:
'Comment on egusphere-2025-2985', Anonymous Referee #1, 10 Sep 2025

Summary
This paper describes a method for assessing the likelihood of ICI derived from geostationary satellite observations and derived products, coupled with cloud water content estimates derived from the CloudSat/CALIPSO-based DARDAR product. ICI and HIWC often occurs within deep convection, though it has also been observed within mid-latitude frontal cloud bands, and represents a significant hazard to aviation. Machine learning identified the most important metrics for diagnosing ICI that are combined to estimate a (daytime only) ICI/HIWC likelihood for the MSG SEVIRI imager, that has been validated with a subset of DARDAR not used in model training. Performance is fairly comparable to existing methods, though with slightly weaker validation stats. The product is then validated with DARDAR and compared with a small sample of European ICI events encountered by in-service Lufthansa aircraft.
The authors have a clear understanding of the ICI/HIWC hazard, existing satellite-based methods from the literature focused on diagnosing ICI/HIWC, machine learning best practices, and the most appropriate geostationary parameters for diagnosing this hazard. The paper is clear and well written. My concerns with the paper begin with the exclusive focus over Europe. Convection over Europe is relatively infrequent compared with Africa or the tropical Atlantic and typically weaker (with warmer tops) than these regions. Figure 3 shows several African/Atlantic Lufthansa ICI events that could have been studied, which if included would increase confidence in the method’s global applicability. DARDAR likely viewed many intense storms over Africa as well that would serve as excellent training for the model. Another concern is the fact that there is no attempt to try to develop/validate a model that can operate at night. Aside from tau and visible reflectance, all other parameters are available at night. I would like to see a night-time product demonstration. Third, there was an international HIWC/HAIC field campaign based in Cayenne, French Guyana in 2015 that was within the SEVIRI field of view. IWC data was collected at 5 sec intervals from 2 aircraft which would be an extremely robust dataset for model validation. The authors should explore this data as it has been a number of years since collection and the data should be freely available by now. I have a number of other more minor comments/concerns listed below
Given that the paper and its writing are of high quality, but due to the significant concerns mentioned above, I say the paper is acceptable but with major revisions to address these concerns.
Specific Minor Comments/Concerns
Sections 2.1.1 and 2.1.2, the differences between CIPS and APICS, and the ramifications of these differences on the analysis are not very clear. I see both produce optical depth but you use the optical depth from one model for water cloud and the other for ice cloud. Additionally it is not explained why you are not using the cloud product data operationally generated by EUMETSAT which would make your method more easy to apply by others in the community.
Section 4.1, I don’t think you need to use paper space to define very commonly used validation metrics. You could simply cite the Wilks meteorological statistics book and move one Wilks, D. S., 2006: Statistical Methods in the Atmospheric Sciences. 2nd ed. International Geophysics Series, Vol. 100, Academic Press, 648 pp.
Validation stats in general, it would be interesting to see the validation applied to > 1.0 g m^-3 data in addition to > 0.5 as the higher value is likely to be more consequential for aircraft.
Figure 7 and many other mapped data figures (i.e. Figure 10), the mapped product is very hard to see details of. For Fig 7, I recommend you make the map much larger and place below the curtain plot data. For Fig 10, consider enlarging the graphics as I cannot see details when printed out on paper.
Figure A.3, there is an extremely odd look to the HIWC product with a discontinuity at 49.3 N latitude. What is the reason for this? Figure A.7 has an odd diagonal discontinuity too.
All Figures in Appendix, what is the purpose of plotting the wind information on the maps? It seems like an unnecessary detail that adds clutter to the map.

Citation: https://doi.org/10.5194/egusphere-2025-2985-RC1
- AC1: 'Reply on RC1', Matteo Arico, 21 Oct 2025
  
  We thank the reviewer for his constructive criticism and the interesting hints that have helped us improving the manuscript. We have addressed all the concerns in the attached PDF.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2985-AC1
RC2:
'Comment on egusphere-2025-2985', Anonymous Referee #2, 11 Sep 2025
General Comments
In this paper, Aricò et al. present a method for detecting high ice water content (HIWC) conditions associated with aviation ice crystal icing (ICI) events using geostationary satellite data. They apply a machine learning approach to derive a set of input variables from Meteosat Second Generation (MSG) SEVIRI derived products. Training of a random forest is accomplished using CloudSat radar and lidar (DARDAR) products. Finally, the authors obtained a database comprised of in-service ICI events which they have used to asses performance of their detection method.
The paper is clearly written and well-organized. It begins with a comprehensive description of the ICI threat to aviation and a review of previously published HIWC detection methods. It provides thorough descriptions of the various data sets used. Explanations of the random forest approach, why it was chosen, and how it was applied to this problem are detailed and clear. Results presented are meaningful, and conclusions are well-justified. Significantly, the authors have demonstrated the need for a dedicated HIWC detection product by showing displacement between their HIWC Mask and detected convective cells. Overall, this is a very good paper that needs only minor revision.
Specific points:
If I understand correctly, the method is limited to cruise altitudes, but it’s not clear why. ICI engine events have occurred during ascents and descents, so the hazard is not limited to cruise altitudes.

While I understand why the method is limited to daytime, given its reliance on products derived from visible channels, I think it would be appropriate to at least discuss how you might develop a corresponding nighttime method.

It’s difficult to see the symbols in certain figures (e.g., Figs 1, 7, 9, 10). The images and symbols could be enlarged and/or the symbol color could have better contrast with the background image.

Verification against in-service ICI events is very important, but the Lufthansa database apparently only includes air data system (ADS) events (e.g., TAT anomalies), not engine events. I assume the authors did not have access to the latter. Some discussion of the relationship between ADS events and engine events would bolster the significance of your results.

In the box and whisker plots shown in Fig. 6, some of the variables on the vertical axis only have outlier points, i.e., no box and whiskers. Could you explain how this should be interpreted?
Citation: https://doi.org/10.5194/egusphere-2025-2985-RC2
- AC2: 'Reply on RC2', Matteo Arico, 21 Oct 2025
  
  We thank the reviewer for the points brought up. Those constructive criticisms have helped us to improve the manuscript. We have addressed the concerns in the PDF attached.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2985-AC2

Status: closed

RC1:
'Comment on egusphere-2025-2985', Anonymous Referee #1, 10 Sep 2025

Summary
This paper describes a method for assessing the likelihood of ICI derived from geostationary satellite observations and derived products, coupled with cloud water content estimates derived from the CloudSat/CALIPSO-based DARDAR product. ICI and HIWC often occurs within deep convection, though it has also been observed within mid-latitude frontal cloud bands, and represents a significant hazard to aviation. Machine learning identified the most important metrics for diagnosing ICI that are combined to estimate a (daytime only) ICI/HIWC likelihood for the MSG SEVIRI imager, that has been validated with a subset of DARDAR not used in model training. Performance is fairly comparable to existing methods, though with slightly weaker validation stats. The product is then validated with DARDAR and compared with a small sample of European ICI events encountered by in-service Lufthansa aircraft.
The authors have a clear understanding of the ICI/HIWC hazard, existing satellite-based methods from the literature focused on diagnosing ICI/HIWC, machine learning best practices, and the most appropriate geostationary parameters for diagnosing this hazard. The paper is clear and well written. My concerns with the paper begin with the exclusive focus over Europe. Convection over Europe is relatively infrequent compared with Africa or the tropical Atlantic and typically weaker (with warmer tops) than these regions. Figure 3 shows several African/Atlantic Lufthansa ICI events that could have been studied, which if included would increase confidence in the method’s global applicability. DARDAR likely viewed many intense storms over Africa as well that would serve as excellent training for the model. Another concern is the fact that there is no attempt to try to develop/validate a model that can operate at night. Aside from tau and visible reflectance, all other parameters are available at night. I would like to see a night-time product demonstration. Third, there was an international HIWC/HAIC field campaign based in Cayenne, French Guyana in 2015 that was within the SEVIRI field of view. IWC data was collected at 5 sec intervals from 2 aircraft which would be an extremely robust dataset for model validation. The authors should explore this data as it has been a number of years since collection and the data should be freely available by now. I have a number of other more minor comments/concerns listed below
Given that the paper and its writing are of high quality, but due to the significant concerns mentioned above, I say the paper is acceptable but with major revisions to address these concerns.
Specific Minor Comments/Concerns
Sections 2.1.1 and 2.1.2, the differences between CIPS and APICS, and the ramifications of these differences on the analysis are not very clear. I see both produce optical depth but you use the optical depth from one model for water cloud and the other for ice cloud. Additionally it is not explained why you are not using the cloud product data operationally generated by EUMETSAT which would make your method more easy to apply by others in the community.
Section 4.1, I don’t think you need to use paper space to define very commonly used validation metrics. You could simply cite the Wilks meteorological statistics book and move one Wilks, D. S., 2006: Statistical Methods in the Atmospheric Sciences. 2nd ed. International Geophysics Series, Vol. 100, Academic Press, 648 pp.
Validation stats in general, it would be interesting to see the validation applied to > 1.0 g m^-3 data in addition to > 0.5 as the higher value is likely to be more consequential for aircraft.
Figure 7 and many other mapped data figures (i.e. Figure 10), the mapped product is very hard to see details of. For Fig 7, I recommend you make the map much larger and place below the curtain plot data. For Fig 10, consider enlarging the graphics as I cannot see details when printed out on paper.
Figure A.3, there is an extremely odd look to the HIWC product with a discontinuity at 49.3 N latitude. What is the reason for this? Figure A.7 has an odd diagonal discontinuity too.
All Figures in Appendix, what is the purpose of plotting the wind information on the maps? It seems like an unnecessary detail that adds clutter to the map.

Citation: https://doi.org/10.5194/egusphere-2025-2985-RC1
- AC1: 'Reply on RC1', Matteo Arico, 21 Oct 2025
  
  We thank the reviewer for his constructive criticism and the interesting hints that have helped us improving the manuscript. We have addressed all the concerns in the attached PDF.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2985-AC1
RC2:
'Comment on egusphere-2025-2985', Anonymous Referee #2, 11 Sep 2025
General Comments
In this paper, Aricò et al. present a method for detecting high ice water content (HIWC) conditions associated with aviation ice crystal icing (ICI) events using geostationary satellite data. They apply a machine learning approach to derive a set of input variables from Meteosat Second Generation (MSG) SEVIRI derived products. Training of a random forest is accomplished using CloudSat radar and lidar (DARDAR) products. Finally, the authors obtained a database comprised of in-service ICI events which they have used to asses performance of their detection method.
The paper is clearly written and well-organized. It begins with a comprehensive description of the ICI threat to aviation and a review of previously published HIWC detection methods. It provides thorough descriptions of the various data sets used. Explanations of the random forest approach, why it was chosen, and how it was applied to this problem are detailed and clear. Results presented are meaningful, and conclusions are well-justified. Significantly, the authors have demonstrated the need for a dedicated HIWC detection product by showing displacement between their HIWC Mask and detected convective cells. Overall, this is a very good paper that needs only minor revision.
Specific points:
If I understand correctly, the method is limited to cruise altitudes, but it’s not clear why. ICI engine events have occurred during ascents and descents, so the hazard is not limited to cruise altitudes.

While I understand why the method is limited to daytime, given its reliance on products derived from visible channels, I think it would be appropriate to at least discuss how you might develop a corresponding nighttime method.

It’s difficult to see the symbols in certain figures (e.g., Figs 1, 7, 9, 10). The images and symbols could be enlarged and/or the symbol color could have better contrast with the background image.

Verification against in-service ICI events is very important, but the Lufthansa database apparently only includes air data system (ADS) events (e.g., TAT anomalies), not engine events. I assume the authors did not have access to the latter. Some discussion of the relationship between ADS events and engine events would bolster the significance of your results.

In the box and whisker plots shown in Fig. 6, some of the variables on the vertical axis only have outlier points, i.e., no box and whiskers. Could you explain how this should be interpreted?
Citation: https://doi.org/10.5194/egusphere-2025-2985-RC2
- AC2: 'Reply on RC2', Matteo Arico, 21 Oct 2025
  
  We thank the reviewer for the points brought up. Those constructive criticisms have helped us to improve the manuscript. We have addressed the concerns in the PDF attached.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2985-AC2

Matteo Aricò, Dennis Piontek, Luca Bugliaro, Johanna Mayer, Richard Müller, Frank Kalinka, and Max Butter

Viewed

Total article views: 1,704 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,633	52	19	1,704	33	39

HTML: 1,633
PDF: 52
XML: 19
Total: 1,704
BibTeX: 33
EndNote: 39

Views and downloads (calculated since 19 Aug 2025)

Month	HTML	PDF	XML	Total
Aug 2025	537	18	2	557
Sep 2025	1,038	12	9	1,059
Oct 2025	51	14	8	73
Nov 2025	7	8	0	15

Cumulative views and downloads (calculated since 19 Aug 2025)

Month	HTML	PDF	XML	Total
Aug 2025	537	18	2	557
Sep 2025	1,038	12	9	1,059
Oct 2025	51	14	8	73
Nov 2025	7	8	0	15

Viewed (geographical distribution)

Total article views: 1,663 (including HTML, PDF, and XML) Thereof 1,663 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 10 Nov 2025

Short summary

The goal is to assess the feasibility of an ice crystal icing detection algorithm based exclusively on remote sensing data. Active measurements are used to train and validate a newly developed random forest algorithm that is applied to passive satellite imagery to estimate the ice crystal icing conditions probability. 83 % of ice crystal icing conditions are correctly detected, showing potential for an operational implementation to mitigate its negative effects on the fleet.


Total:	0
HTML:	0
PDF:	0
XML:	0