Preprints
https://doi.org/10.5194/egusphere-2023-463
https://doi.org/10.5194/egusphere-2023-463
06 Apr 2023
 | 06 Apr 2023

Spatio-temporal modeling of air pollutant concentrations in Germany using machine learning

Vigneshkumar Balamurugan, Jia Chen, Adrian Wenzel, and Frank N. Keutsch

Abstract. Machine learning (ML) models are becoming a meaningful tool for modeling air pollutant concentrations. ML models are capable of learning and modeling complex non-linear interactions between variables, and they require less computational effort than chemical transport models (CTMs). In this study, we used gradient boosted tree (GBT) and multi-layer perceptron (MLP; neural network) algorithms to model near-surface nitrogen dioxide (NO2) and ozone (O3) concentrations over Germany at 0.1 degree spatial resolution and daily intervals.

We trained the ML models using TROPOMI satellite column measurements combined with information on emission sources, air pollutant precursors and meteorology as feature variables. We found that the trained GBT model for NO2 and O3 explained a major portion of the observed concentrations (R2 = 0.68–0.88, RMSE = 4.77–8.67 μg m-3 and R2 = 0.74–0.92, RMSE = 8.53–13.2 μg m-3, respectively). The trained MLP model performed worse than the trained GBT model for both NO2 and O3 (R2 = 0.46–0.82 and R2 = 0.42–0.9, respectively).

Our NO2 GBT model outperforms the CAMS model, a data-assimilated CTM, but slightly under-performs for O3. However, our NO2 and O3 ML models require less computational effort than CTM. Therefore, we can analyze people’s exposure to near-surface NO2 and O3 with significantly less effort. During the study period (2018-04-30 and 2021-07-01), it was found that around 36 % of people lived in locations where the WHO NO2 limit was exceeded for more than 25 % of the days, while 90 % of the population resided in areas where the WHO O3 limit was surpassed for over 25 % of days. Although metropolitan areas had high NO2 concentrations, rural areas, particularly in southern Germany, had high O3 concentrations.

Furthermore, our ML models can be used to evaluate the effectiveness of mitigation policies. Near-surface NO2 and O3 concentrations changes during the 2020 COVID-19 lockdown period over Germany were indeed reproduced by the GBT model, with meteorology-accounted for near-surface NO2 significantly decreased (by 23±5.3 %) and meteorology-accounted for near-surface O3 slightly increased (by 1±4.6 %) over ten major German metropolitan areas, compared to 2019. Finally, our O3 GBT model is highly transferable to other countries, at least to neighboring countries and locations where no measurements are available (R2 = 0.87–0.94), whereas our NO2 GBT model is moderately transferable (R2 = 0.32–0.64).

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Journal article(s) based on this preprint

14 Sep 2023
Spatiotemporal modeling of air pollutant concentrations in Germany using machine learning
Vigneshkumar Balamurugan, Jia Chen, Adrian Wenzel, and Frank N. Keutsch
Atmos. Chem. Phys., 23, 10267–10285, https://doi.org/10.5194/acp-23-10267-2023,https://doi.org/10.5194/acp-23-10267-2023, 2023
Short summary
Vigneshkumar Balamurugan, Jia Chen, Adrian Wenzel, and Frank N. Keutsch

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-463', Anonymous Referee #1, 01 Jun 2023
  • RC2: 'Comment on egusphere-2023-463', Anonymous Referee #2, 23 Jun 2023
  • AC1: 'Comment on egusphere-2023-463', Vigneshkumar Balamurugan, 28 Jul 2023

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-463', Anonymous Referee #1, 01 Jun 2023
  • RC2: 'Comment on egusphere-2023-463', Anonymous Referee #2, 23 Jun 2023
  • AC1: 'Comment on egusphere-2023-463', Vigneshkumar Balamurugan, 28 Jul 2023

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Vigneshkumar Balamurugan on behalf of the Authors (28 Jul 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Publish as is (31 Jul 2023) by Harald Saathoff
AR by Vigneshkumar Balamurugan on behalf of the Authors (14 Aug 2023)  Manuscript 

Journal article(s) based on this preprint

14 Sep 2023
Spatiotemporal modeling of air pollutant concentrations in Germany using machine learning
Vigneshkumar Balamurugan, Jia Chen, Adrian Wenzel, and Frank N. Keutsch
Atmos. Chem. Phys., 23, 10267–10285, https://doi.org/10.5194/acp-23-10267-2023,https://doi.org/10.5194/acp-23-10267-2023, 2023
Short summary
Vigneshkumar Balamurugan, Jia Chen, Adrian Wenzel, and Frank N. Keutsch
Vigneshkumar Balamurugan, Jia Chen, Adrian Wenzel, and Frank N. Keutsch

Viewed

Total article views: 517 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
368 131 18 517 7 7
  • HTML: 368
  • PDF: 131
  • XML: 18
  • Total: 517
  • BibTeX: 7
  • EndNote: 7
Views and downloads (calculated since 06 Apr 2023)
Cumulative views and downloads (calculated since 06 Apr 2023)

Viewed (geographical distribution)

Total article views: 458 (including HTML, PDF, and XML) Thereof 458 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Latest update: 18 Sep 2024
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
In this study, machine learning models are employed to model NO2 and O3 concentrations. We employed a wide range of sources of data, including meteorological and column satellite measurements, to model NO2 and O3 concentrations. The spatial and temporal variability, as well as their drivers, were investigated. Notably, the machine learning model established the relationship between NOX and O3. Despite the fact that metropolitan regions are NO2 hotspots, rural areas have high O3 concentrations.