Preprints
https://doi.org/10.5194/egusphere-2025-3210
https://doi.org/10.5194/egusphere-2025-3210
20 Aug 2025
 | 20 Aug 2025
Status: this preprint is open for discussion and under review for Atmospheric Measurement Techniques (AMT).

Unsupervised Classification of Absorbing Aerosols with the SP2 via a Variational Autoencoder (VAE)

Aaryan Doshi and Kara Lamb

Abstract. The Single Particle Soot Photometer (SP2) detects refractory aerosol particle mass on a single-particle basis via laser-induced incandescence (L-II). While the SP2 has traditionally been used to quantify black carbon aerosol mass in the atmosphere, the instrument is increasingly being used to detect and quantify other types of absorbing aerosols, such as mineral dust or anthropogenically-sourced iron oxide aerosols. Quantifying the mass loadings and emission sources of absorbing aerosols in the atmosphere is important for understanding their role in the climate cycle. Supervised machine learning algorithms have shown potential to classify different types of aerosols from L-II signals, but these methods are sensitive to instrument configuration and require training datasets generated from laboratory samples, which do not generalize well to ambient atmospheric aerosols. Here we explore the effectiveness of an unsupervised deep learning method, a variational autoencoder (VAE), applied directly to L-II signals from the SP2 in order to classify different types of absorbing aerosols. The VAE compresses L-II signals into a bottleneck latent representation and reconstructs an output as similar as possible to the input signal, thereby reducing dimensionality. We apply this approach to a dataset comprised of laboratory samples of materials that show detectable incandescence in the SP2, including fullerene soot (as a proxy for black carbon), coated fullerene soot, coal fly ash, mineral dust, volcanic ash, hematite, and magnetite. We explore optimal latent representations of L-II signals to maximize separability of different aerosol classes by varying the size of the latent representation, and find that a latent representation of 3 allows us to capture the majority of the information in the L-II signals relevant for identifying different types of absorbing aerosols. We demonstrate that unsupervised machine learning is a promising method for identifying distinct populations of aerosols detected by the SP2.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Aaryan Doshi and Kara Lamb

Status: open (until 25 Sep 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-3210', Anonymous Referee #1, 09 Sep 2025 reply
Aaryan Doshi and Kara Lamb

Data sets

Laser-Induced Incandescent Signals for Laboratory Samples of Absorbing Aerosols Detected by the Single Particle Soot Photometer Kara Lamb https://doi.org/10.5281/zenodo.15800436

Interactive computing environment

SP2-Aerosol-Classification Aaryan Doshi and Kara Lamb https://github.com/adoshi25/SP2-Aerosol-Classification

Aaryan Doshi and Kara Lamb

Viewed

Total article views: 989 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
977 9 3 989 15 25
  • HTML: 977
  • PDF: 9
  • XML: 3
  • Total: 989
  • BibTeX: 15
  • EndNote: 25
Views and downloads (calculated since 20 Aug 2025)
Cumulative views and downloads (calculated since 20 Aug 2025)

Viewed (geographical distribution)

Total article views: 986 (including HTML, PDF, and XML) Thereof 986 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 13 Sep 2025
Download
Short summary
Aerosols that absorb sunlight play key role in Earth's climate. To improve detection of absorbing aerosols measured by the SP2, we explore unsupervised machine learning. Unlike earlier methods that require labeled training data from laboratory measurements , our approach learns patterns directly from SP2 signals. This makes it more applicable to atmospheric observations. We show this method can reveal distinct aerosol populations and improve aerosol classification.
Share