Preprints
https://doi.org/10.5194/egusphere-2025-1501
https://doi.org/10.5194/egusphere-2025-1501
07 May 2025
 | 07 May 2025
Status: this preprint is open for discussion and under review for Atmospheric Measurement Techniques (AMT).

Classifying Thermodynamic Cloud Phase Using Machine Learning Models

Lexie Goldberger, Maxwell Levin, Carlandra Harris, Andrew Geiss, Matthew D. Shupe, and Damao Zhang

Abstract. Vertically resolved thermodynamic cloud phase classifications are essential for studies of atmospheric cloud and precipitation processes. The Department of Energy (DOE) Atmospheric Radiation Measurement (ARM) THERMOCLDPHASE Value-Added Product (VAP) uses a multi-sensor approach to classify thermodynamic cloud phase by combining lidar backscatter and depolarization, radar reflectivity, Doppler velocity, spectral width, microwave radiometer-derived liquid water path, and radiosonde temperature measurements. The measured voxels are classified as ice, snow, mixed-phase, liquid (cloud water), drizzle, rain, and liq_driz (liquid+drizzle). We use this product as the ground truth to train three machine learning (ML) models to predict the thermodynamic cloud phase from multi-sensor remote sensing measurements taken at the ARM North Slope of Alaska (NSA) observatory: a random forest (RF), a multilayer perceptron (MLP), and a convolutional neural network (CNN) with a U-Net architecture. Evaluations against the outputs of the THERMOCLDPHASE VAP with one year of data show that the CNN outperforms the other two models, achieving the highest test accuracy, F1-score, and mean Intersection over Union (IOU). Analysis of ML confidence scores shows ice, rain, and snow have higher confidence scores, followed by liquid, while mixed, drizzle, and liq_driz have lower scores. Feature importance analysis reveals that the mean Doppler velocity and vertically resolved temperature are the most influential datastreams for ML thermodynamic cloud phase predictions. The ML models’ generalization capacity is further evaluated by applying them at another Arctic ARM site in Norway using data taken during the ARM Cold-Air Outbreaks in the Marine Boundary Layer Experiment (COMBLE) field campaign. Finally, we evaluate the ML models’ response to simulated instrument outages and signal degradation.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Lexie Goldberger, Maxwell Levin, Carlandra Harris, Andrew Geiss, Matthew D. Shupe, and Damao Zhang

Status: open (until 12 Jun 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Lexie Goldberger, Maxwell Levin, Carlandra Harris, Andrew Geiss, Matthew D. Shupe, and Damao Zhang
Lexie Goldberger, Maxwell Levin, Carlandra Harris, Andrew Geiss, Matthew D. Shupe, and Damao Zhang

Viewed

Total article views: 98 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
84 9 5 98 8 3 3
  • HTML: 84
  • PDF: 9
  • XML: 5
  • Total: 98
  • Supplement: 8
  • BibTeX: 3
  • EndNote: 3
Views and downloads (calculated since 07 May 2025)
Cumulative views and downloads (calculated since 07 May 2025)

Viewed (geographical distribution)

Total article views: 129 (including HTML, PDF, and XML) Thereof 129 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 15 May 2025
Download
Short summary
This study leverages machine learning models to classify cloud thermodynamic phases using multi-sensor remote sensing data collected at the Department of Energy Atmospheric Radiation Measurement North Slope of Alaska observatory. We evaluate model performance, feature importance, application of the model to another observatory, and quantify how the models respond to instrument outages.
Share