Preprints
https://doi.org/10.5194/egusphere-2025-6259
https://doi.org/10.5194/egusphere-2025-6259
19 Jan 2026
 | 19 Jan 2026
Status: this preprint is open for discussion and under review for Atmospheric Measurement Techniques (AMT).

Flow cytometry and machine learning enable identification of allergenic urban tree pollen

Sarah Tardif, Maria Raquel Kanieski, Gauthier Lapa, Grégoire Bonnamour, Rita Sousa-Silva, Isabelle Laforest-Lapointe, and Alain Paquette

Abstract. Exposure to allergenic pollen is a major public health concern, as it is a key trigger for respiratory allergies, including seasonal allergic rhinitis, which affects approximately 20 % of the global population. Monitoring airborne pollen is essential for prevention and clinical management, yet traditional identification methods, such as light microscopy, are time-consuming and often limited to genus- or family-level resolution. Here, we present a high-throughput approach combining flow cytometry with machine learning to identify pollen from urban environments. We collected a reference database of pollen from 97 species across 34 genera, representing the dominant allergenic trees and other common airborne taxa in Montreal, Canada. Using flow cytometry, we measured particle size, granularity, and fluorescence intensity across multiple excitation and emission channels, and applied a Random Forest classifier to distinguish pollen taxa. At the species level, the model achieved a mean F1-score of 0.76, while genus-level classification reached 0.90, with misclassifications largely occurring among closely related species. Granularity and fluorescence parameters from the violet and blue lasers were the most distinctive features. Our results demonstrate that flow cytometry combined with machine learning provides an efficient, scalable alternative to microscopy, with potential for large-scale urban pollen monitoring.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Sarah Tardif, Maria Raquel Kanieski, Gauthier Lapa, Grégoire Bonnamour, Rita Sousa-Silva, Isabelle Laforest-Lapointe, and Alain Paquette

Status: open (until 24 Feb 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Sarah Tardif, Maria Raquel Kanieski, Gauthier Lapa, Grégoire Bonnamour, Rita Sousa-Silva, Isabelle Laforest-Lapointe, and Alain Paquette

Data sets

Pollen Flow Cytometry Datasets and Classification Models Sarah Tardif https://doi.org/10.6084/m9.figshare.30870641

Model code and software

Pollen-classification-model Sarah Tardif https://github.com/SarahTardif/Pollen-classification-model

Sarah Tardif, Maria Raquel Kanieski, Gauthier Lapa, Grégoire Bonnamour, Rita Sousa-Silva, Isabelle Laforest-Lapointe, and Alain Paquette
Metrics will be available soon.
Latest update: 19 Jan 2026
Download
Short summary
We developed a high-throughput method combining flow cytometry and machine learning to identify urban pollen. Using a reference database of 97 species across 34 genera, with values of particle size, granularity, and multi-channel fluorescence for each pollen grains, our method enables rapid species- and genus-level pollen identification. It provides an efficient alternative to microscopy, with potential for large-scale urban pollen monitoring and allergy management.
Share