the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Real-Time Pollen Dynamics and Automated Detection: Novel insights from Wrocław (Poland) 2024–2025
Abstract. Accurate monitoring and forecasting of airborne pollen are essential for public health and allergy management. This study evaluated a neural network model for real-time pollen monitoring using locally collected data in Wrocław, Poland (2024–2025) with the Swisens Poleno Jupiter detector. The retrained model, based on local data, outperformed the reference model trained on Swiss datasets and validated against a Hirst-type pollen trap. The coefficient of determination (R²) remained high (~0.8) especially for Alnus, Betula and Quercus, while the root mean square error (RMSE) was lower, particularly at low and medium concentrations, showing improved sensitivity, real-time detection and better representation of seasonal and diurnal dynamics. Hourly analyses revealed distinct taxon-specific diurnal patterns in pollen release. Temperature and relative humidity were the main drivers of variability, while wind speed influenced all taxa except Pinus. Hourly pollen concentrations were positively correlated with planetary boundary layer height, especially for Betula and Alnus, highlighting the role of atmospheric mixing in pollen dispersion. Wind direction, particularly from southern and southeastern sectors, modulated local transport, reflecting land cover effects. Correlations with meteorological variables varied by month and flowering stage. Validation against Hirst-type data confirmed that the locally retrained model accurately captures taxon-specific pollen dynamics, demonstrating its effectiveness for real-time allergen monitoring and improving the reliability of allergy risk assessments.
- Preprint
(1309 KB) - Metadata XML
-
Supplement
(206 KB) - BibTeX
- EndNote
Status: final response (author comments only)
- RC1: 'Comment on egusphere-2025-5874', Benoît Crouzy, 20 Apr 2026
-
RC2: 'Comment on egusphere-2025-5874', Ellen-Wien Augustijn, 23 Apr 2026
Many official pollen monitoring stations are converting to automated pollen detection and results on retraining models based on local data are very relevant in this context. This paper fits into this research gap. In addition, the paper also looks at diurnal variability of pollen concentrations.
General comments:
In line 56 of the manuscript you mention that many monitoring station are currently converting to automatic pollen samplers. It is true, yet these types of samplers remain very expensive. This creates a situation of a few locations in each country with very accurate pollen monitoring, but also large gaps in space with many locations that have no data et al. To create a monitoring network with a good spatial coverage, low-cost sensors should be developed and tested.Comment on line 76: I agree that automatic pollen detection has not achieved its full potential and maturity, but I also wonder when further improvements will still make considerable additional contributions (and to what purpose). You collected high-quality reference data for retraining and validating the pollen detection model. For every country/sensor, this will have to be done again.
The relevance of the diurnal concentration pattern analysis remained unclear to me until I received the results section. You seem to believe that information on the high-risk times of day will benefit the allergy sufferers, and that it enables you to assess the influence of meteorological parameters. I understand the link with the meteorological parameters, yet I doubt if warning allergy sufferers of high-risk moments during the day for will be implementable. Referring to my first point, the lack of a good spatial network will probably have a larger impact.
Interesting that the old model overestimated the duration of the season for Quercus and Betula. It is also interesting that the old model shows much shorter periods for 2025 compared to 2024. Is there any explanation for this?
Citation: https://doi.org/10.5194/egusphere-2025-5874-RC2
Viewed
| HTML | XML | Total | Supplement | BibTeX | EndNote | |
|---|---|---|---|---|---|---|
| 371 | 118 | 31 | 520 | 45 | 19 | 24 |
- HTML: 371
- PDF: 118
- XML: 31
- Total: 520
- Supplement: 45
- BibTeX: 19
- EndNote: 24
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
The manuscript describes the retraining of an automatic pollen identification algorithm for use with an airflow cytometer in a new environment. This allows in turn the authors to present an interesting case study of real-time pollen dynamics in correlation with weather parameters at the site in question.
While the study is of interest to the community, two critical points need to be addressed before it can be considered for publication:
1) No code or training dataset is made directly available, which significantly hinders the reproducibility of the study and the reuse of the model for operational or research purposes. This is particularly regrettable given that dedicated portals would allow this with minimal effort. For example, the Sylva/AutoPollen data portal or the Zenodo repository could be used for the training datasets, and GitHub could be used for the code. The issue is exacerbated by the lack of information on model training and architecture.
2) The benchmark model (“old model”) shows reasonable correlations with reference measurements, but there are massive scaling issues (Figure 3). This could be related to issues when applying scaling factors used for converting raw events to concentrations, as different versions of the Swisens Poleno software have been known to circulate. I strongly recommend that the authors double-check how scaling is applied when using the benchmark model, as this has a dramatic effect on the metrics for exposure class determination (leading to the old model systematically overestimating classes), the RMSE, and consequently the conclusions of the paper.
In addition to those main points, here are my other comments and suggestions in the order as they appear in the manuscript:
Line 71: “advanced instrument” I suggest removing “advanced” as this adds no information per se.
Line 75: “detections” remove “s”
Line 76: remove “advanced”
Line 81-82: Which “reference pollen database” are the authors referring to? How can this database be accessed?
Line 92: “These tree species” If I understand correctly you work at genus level throughout the paper. I thus suggest to use “genera” instead of “species”.
Line 109-111: Maybe compare hourly correlations you obtain with the results from Chappuis et al. 2020, Aerobiologia for a different automatic pollen monitor.
Line 119: “20 meters” what about surrounding buildings height /height of the urban layer if any.
Line 132: “airborne cytometer” to be replaced by “airflow cytometer”
Line 157-158: Why were Poaceae, particularly important for respiratory allergies disregarded in the study?
Line 221: How was the size of the buffer determined ?
Line 265: “Cruozy” -> “Crouzy”
Line 324: “Polleno” takes one “l”, check throughout the manuscript as this appears one more time on line 413.
Figure 3: I suggest to use a capital letter in front of each genus name
Line 340, 348 and 352: the massive RMSE differences might be related to improper scaling.
Figure 5, 7 and the S1 to S5 should be made self-contained by the caption for the reader to understand all the symbols used and different metrics represented.
Line 501-510: the case of rainfall could be discussed in more details as the result is a bit counter intuitive (rainfall is expected to scavenge airborne pollen).
Table 2: the results on PBL are interesting but again counter intuitive, how could that be explained? One might expect a compression effect.
Line 544 “Dissucion” typo
Whole discussion: correlation vs. causality issues, for example “crucial” Line 642, “stronger effect” Line 658, “influential” Line 660, “responsive”
Line 674 or “shaping “ Line 690”.
Line 717: Pine pollen might not be relevant for allergies.