Preprints
https://doi.org/10.5194/egusphere-2023-784
https://doi.org/10.5194/egusphere-2023-784
09 May 2023
 | 09 May 2023

Machine learning approaches for automatic classification of single-particle mass spectrometry data

Guanzhong Wang, Heinrich Ruser, Julian Schade, Johannes Passig, Thomas Adam, Günther Dollinger, and Ralf Zimmermann

Abstract. The chemical composition of aerosol particles is a key parameter for human health and climate effects. Single-particle mass spectrometry (SPMS) has evolved to a mature technology with unique chemical coverage and the capability to analyze the distribution of aerosol components in the particle ensemble in real-time. With the fully automated characterization of the chemical profile of the aerosol particles, selective real-time monitoring of air quality could be performed e.g. for urgent risk assessments due to particularly harmful pollutants. For aerosol particle classification, mostly unsupervised clustering algorithms (ART-2a, K-means and their derivatives) are used, which require manual post-processing. In this work, we focus on supervised algorithms to tackle the problem of automatic classification of large amounts of aerosol particle data. Supervised learning requires data with labels to train a predictive model. Therefore, we created a labeled benchmark dataset containing ~24,000 particles with eight different coarse categories that were highly abundant at a measurement in summer in Central Europe: Elemental Carbon (EC), Organic Carbon and Elemental Carbon (OC-EC), Potassium-rich (K-rich), Calcium-rich (Ca-rich), Iron-rich (Fe-rich), Vanadium-rich (V-rich), Magnesium-rich (Mg-rich) and Sodium-rich (Na-rich). Using the chemical features of particles the performance of the following classical supervised algorithms was tested: K-nearest neighbors, support vector machine, decision tree, random forest and multi-layer perceptron. This work shows that despite the entrenched position of unsupervised clustering algorithms in the field, the use of supervised algorithms has the potential to replace the manual step of clustering algorithms in many applications, where real-time data analysis is essential. For the classification of the eight classes, the prediction accuracy of several supervised algorithms exceeded 97 %. The trained model was used to classify ~49,000 particles from a blind dataset in 0.2 seconds, taking into account also a class of “unclassified” particles. The predictions are highly consistent with the results obtained in a previous study using ART-2a.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Journal article(s) based on this preprint

16 Jan 2024
Machine learning approaches for automatic classification of single-particle mass spectrometry data
Guanzhong Wang, Heinrich Ruser, Julian Schade, Johannes Passig, Thomas Adam, Günther Dollinger, and Ralf Zimmermann
Atmos. Meas. Tech., 17, 299–313, https://doi.org/10.5194/amt-17-299-2024,https://doi.org/10.5194/amt-17-299-2024, 2024
Short summary
Guanzhong Wang, Heinrich Ruser, Julian Schade, Johannes Passig, Thomas Adam, Günther Dollinger, and Ralf Zimmermann

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-784', Anonymous Referee #1, 18 Jul 2023
    • AC1: 'Reply on RC1', Guanzhong Wang, 21 Aug 2023
  • RC2: 'Comment on egusphere-2023-784', Anonymous Referee #2, 12 Sep 2023
    • AC2: 'Reply on RC2', Guanzhong Wang, 09 Oct 2023
    • AC3: 'Reply on RC2', Guanzhong Wang, 12 Oct 2023

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-784', Anonymous Referee #1, 18 Jul 2023
    • AC1: 'Reply on RC1', Guanzhong Wang, 21 Aug 2023
  • RC2: 'Comment on egusphere-2023-784', Anonymous Referee #2, 12 Sep 2023
    • AC2: 'Reply on RC2', Guanzhong Wang, 09 Oct 2023
    • AC3: 'Reply on RC2', Guanzhong Wang, 12 Oct 2023

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Guanzhong Wang on behalf of the Authors (13 Oct 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (18 Oct 2023) by Hendrik Fuchs
RR by Anonymous Referee #2 (30 Oct 2023)
RR by Anonymous Referee #1 (02 Nov 2023)
ED: Publish as is (02 Nov 2023) by Hendrik Fuchs
AR by Guanzhong Wang on behalf of the Authors (06 Nov 2023)  Manuscript 

Journal article(s) based on this preprint

16 Jan 2024
Machine learning approaches for automatic classification of single-particle mass spectrometry data
Guanzhong Wang, Heinrich Ruser, Julian Schade, Johannes Passig, Thomas Adam, Günther Dollinger, and Ralf Zimmermann
Atmos. Meas. Tech., 17, 299–313, https://doi.org/10.5194/amt-17-299-2024,https://doi.org/10.5194/amt-17-299-2024, 2024
Short summary
Guanzhong Wang, Heinrich Ruser, Julian Schade, Johannes Passig, Thomas Adam, Günther Dollinger, and Ralf Zimmermann
Guanzhong Wang, Heinrich Ruser, Julian Schade, Johannes Passig, Thomas Adam, Günther Dollinger, and Ralf Zimmermann

Viewed

Total article views: 562 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
347 187 28 562 15 15
  • HTML: 347
  • PDF: 187
  • XML: 28
  • Total: 562
  • BibTeX: 15
  • EndNote: 15
Views and downloads (calculated since 09 May 2023)
Cumulative views and downloads (calculated since 09 May 2023)

Viewed (geographical distribution)

Total article views: 547 (including HTML, PDF, and XML) Thereof 547 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Latest update: 30 Aug 2024
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
This research aims to develop a novel warning system for the real-time monitoring of pollutants in the atmosphere. The system is capable of sampling and investigating airborne aerosol particles on-site, utilizing artificial intelligence to learn their chemical signatures and to classify them in real-time. We applied single-particle mass spectrometry for analyzing of chemical composition of aerosol particles and suggested several supervised algorithms for highly reliable automatic classification.