Preprints
https://doi.org/10.5194/egusphere-2025-3616
https://doi.org/10.5194/egusphere-2025-3616
22 Aug 2025
 | 22 Aug 2025
Status: this preprint is open for discussion and under review for Atmospheric Measurement Techniques (AMT).

Leveraging Machine Learning to Enhance Aerosol Classification using Single-Particle Mass Spectrometry

Jose A. Perez Chavez, Maria A. Zawadowicz, Christopher Boxe, and Joseph Wilkins

Abstract. Advancing automated classification of atmospheric aerosols from Single-Particle Mass Spectrometry (SPMS) data remains challenging due to overlapping chemical signatures and limited labeled data. Semi-supervised learning approaches offer potential solutions by leveraging unlabeled data to enhance classification accuracy. Four models were compared: a supervised Support Vector Machine (SVM), a self-training SVM, a stacked autoencoder classifier, and a stacked autoencoder trained with a temporal ensembling mean teacher framework. All models achieved robust performance with overall accuracies of 90.0–91.1 %, representing improvements over previous work on the same dataset (87 %) and competitive performance with current methods. Notably, the models effectively classified aerosols with limited representation in the dataset – soot (0.77 % of spectra, F1-scores: 0.93–0.97) and hazelnut pollen (0.98 % of spectra, F1-scores: 0.97–1.00) – highlighting their ability to capture distinct chemical signatures even with fewer than 200 training samples per class. While challenges persist in classifying certain species, particularly feldspars due to overlapping spectral features and class imbalances, this study demonstrates the significant potential of semi-supervised learning and advanced machine learning architectures in improving aerosol classification, with implications for atmospheric and climate research.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Jose A. Perez Chavez, Maria A. Zawadowicz, Christopher Boxe, and Joseph Wilkins

Status: open (until 13 Oct 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Jose A. Perez Chavez, Maria A. Zawadowicz, Christopher Boxe, and Joseph Wilkins
Jose A. Perez Chavez, Maria A. Zawadowicz, Christopher Boxe, and Joseph Wilkins

Viewed

Total article views: 907 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
885 18 4 907 17 20
  • HTML: 885
  • PDF: 18
  • XML: 4
  • Total: 907
  • BibTeX: 17
  • EndNote: 20
Views and downloads (calculated since 22 Aug 2025)
Cumulative views and downloads (calculated since 22 Aug 2025)

Viewed (geographical distribution)

Total article views: 905 (including HTML, PDF, and XML) Thereof 905 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 11 Sep 2025
Download
Short summary
In this study, we leverage the power of machine learning to develop classifiers using a comprehensive dataset of SPMS spectra. These classifiers enable automatic differentiation of aerosol particles based on their chemistry and size, facilitating more accurate and efficient aerosol classification. Our results show increased accuracy when including unlabeled data in a semi-supervised framework.
Share