the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Machine learning approaches for automatic classification of single-particle mass spectrometry data
Abstract. The chemical composition of aerosol particles is a key parameter for human health and climate effects. Single-particle mass spectrometry (SPMS) has evolved to a mature technology with unique chemical coverage and the capability to analyze the distribution of aerosol components in the particle ensemble in real-time. With the fully automated characterization of the chemical profile of the aerosol particles, selective real-time monitoring of air quality could be performed e.g. for urgent risk assessments due to particularly harmful pollutants. For aerosol particle classification, mostly unsupervised clustering algorithms (ART-2a, K-means and their derivatives) are used, which require manual post-processing. In this work, we focus on supervised algorithms to tackle the problem of automatic classification of large amounts of aerosol particle data. Supervised learning requires data with labels to train a predictive model. Therefore, we created a labeled benchmark dataset containing ~24,000 particles with eight different coarse categories that were highly abundant at a measurement in summer in Central Europe: Elemental Carbon (EC), Organic Carbon and Elemental Carbon (OC-EC), Potassium-rich (K-rich), Calcium-rich (Ca-rich), Iron-rich (Fe-rich), Vanadium-rich (V-rich), Magnesium-rich (Mg-rich) and Sodium-rich (Na-rich). Using the chemical features of particles the performance of the following classical supervised algorithms was tested: K-nearest neighbors, support vector machine, decision tree, random forest and multi-layer perceptron. This work shows that despite the entrenched position of unsupervised clustering algorithms in the field, the use of supervised algorithms has the potential to replace the manual step of clustering algorithms in many applications, where real-time data analysis is essential. For the classification of the eight classes, the prediction accuracy of several supervised algorithms exceeded 97 %. The trained model was used to classify ~49,000 particles from a blind dataset in 0.2 seconds, taking into account also a class of “unclassified” particles. The predictions are highly consistent with the results obtained in a previous study using ART-2a.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(1394 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(1394 KB) - Metadata XML
- BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2023-784', Anonymous Referee #1, 18 Jul 2023
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2023/egusphere-2023-784/egusphere-2023-784-RC1-supplement.pdf
-
AC1: 'Reply on RC1', Guanzhong Wang, 21 Aug 2023
Dear Reviewer,
We would like to express our gratitude for taking the time to review our manuscript. Your valuable insights and comments have been very helpful in enhancing the quality of our work, and we appreciate your thoughtful evaluation. We understand that timely communication is desired in the review process, and we apologize for our delayed answers .
We have carefully considered each of your suggestions and concerns, and we are pleased to provide a point-by-point response to address them, please see the supplement.
Once again, we extend our thanks for your time, effort, and expertise in reviewing our manuscript. We believe that your feedback has been pivotal in shaping the final version of our work.
Best regards
Guanzhong Wang
-
AC1: 'Reply on RC1', Guanzhong Wang, 21 Aug 2023
-
RC2: 'Comment on egusphere-2023-784', Anonymous Referee #2, 12 Sep 2023
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2023/egusphere-2023-784/egusphere-2023-784-RC2-supplement.pdf
-
AC2: 'Reply on RC2', Guanzhong Wang, 09 Oct 2023
Dear Reviewer,
we would like to express our gratitude for taking the time to review our manuscript. Your valuable insights and comments have been very helpful in enhancing the quality of our work, and we appreciate your thoughtful evaluation. We understand that timely communication is desired in the review process, and we apologize for our delayed answers.
We have carefully considered each of your suggestions and concerns, and we are pleased to provide a point-by-point response to address them, please see the supplement.
Once again, we extend our thanks for your time, effort, and expertise in reviewing our manuscript. We believe that your feedback has been pivotal in shaping the final version of our work.
Best regards, on behalf of the authors
Guanzhong Wang
Citation: https://doi.org/10.5194/egusphere-2023-784-AC2 -
AC3: 'Reply on RC2', Guanzhong Wang, 12 Oct 2023
Dear Reviewer,
We have uploaded a pdf file of our reply, but now we find that we are unable to download or view the file on the web page. It shows 404 error.
We contacted the editorial support team and they asked us to re-upload our reply again.
Thank you in advance.
Best regards
Guanzhong Wang
-
AC2: 'Reply on RC2', Guanzhong Wang, 09 Oct 2023
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2023-784', Anonymous Referee #1, 18 Jul 2023
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2023/egusphere-2023-784/egusphere-2023-784-RC1-supplement.pdf
-
AC1: 'Reply on RC1', Guanzhong Wang, 21 Aug 2023
Dear Reviewer,
We would like to express our gratitude for taking the time to review our manuscript. Your valuable insights and comments have been very helpful in enhancing the quality of our work, and we appreciate your thoughtful evaluation. We understand that timely communication is desired in the review process, and we apologize for our delayed answers .
We have carefully considered each of your suggestions and concerns, and we are pleased to provide a point-by-point response to address them, please see the supplement.
Once again, we extend our thanks for your time, effort, and expertise in reviewing our manuscript. We believe that your feedback has been pivotal in shaping the final version of our work.
Best regards
Guanzhong Wang
-
AC1: 'Reply on RC1', Guanzhong Wang, 21 Aug 2023
-
RC2: 'Comment on egusphere-2023-784', Anonymous Referee #2, 12 Sep 2023
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2023/egusphere-2023-784/egusphere-2023-784-RC2-supplement.pdf
-
AC2: 'Reply on RC2', Guanzhong Wang, 09 Oct 2023
Dear Reviewer,
we would like to express our gratitude for taking the time to review our manuscript. Your valuable insights and comments have been very helpful in enhancing the quality of our work, and we appreciate your thoughtful evaluation. We understand that timely communication is desired in the review process, and we apologize for our delayed answers.
We have carefully considered each of your suggestions and concerns, and we are pleased to provide a point-by-point response to address them, please see the supplement.
Once again, we extend our thanks for your time, effort, and expertise in reviewing our manuscript. We believe that your feedback has been pivotal in shaping the final version of our work.
Best regards, on behalf of the authors
Guanzhong Wang
Citation: https://doi.org/10.5194/egusphere-2023-784-AC2 -
AC3: 'Reply on RC2', Guanzhong Wang, 12 Oct 2023
Dear Reviewer,
We have uploaded a pdf file of our reply, but now we find that we are unable to download or view the file on the web page. It shows 404 error.
We contacted the editorial support team and they asked us to re-upload our reply again.
Thank you in advance.
Best regards
Guanzhong Wang
-
AC2: 'Reply on RC2', Guanzhong Wang, 09 Oct 2023
Peer review completion
Journal article(s) based on this preprint
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
347 | 187 | 28 | 562 | 15 | 15 |
- HTML: 347
- PDF: 187
- XML: 28
- Total: 562
- BibTeX: 15
- EndNote: 15
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Cited
2 citations as recorded by crossref.
Guanzhong Wang
Julian Schade
Johannes Passig
Thomas Adam
Günther Dollinger
Ralf Zimmermann
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(1394 KB) - Metadata XML