Spatio-temporal modeling of air pollutant concentrations in Germany using machine learning

Balamurugan, Vigneshkumar; Chen, Jia; Wenzel, Adrian; Keutsch, Frank N.

doi:https://doi.org/10.5194/egusphere-2023-463

Vigneshkumar Balamurugan, Jia Chen, Adrian Wenzel, and Frank N. Keutsch

Abstract. Machine learning (ML) models are becoming a meaningful tool for modeling air pollutant concentrations. ML models are capable of learning and modeling complex non-linear interactions between variables, and they require less computational effort than chemical transport models (CTMs). In this study, we used gradient boosted tree (GBT) and multi-layer perceptron (MLP; neural network) algorithms to model near-surface nitrogen dioxide (NO₂) and ozone (O₃) concentrations over Germany at 0.1 degree spatial resolution and daily intervals.

We trained the ML models using TROPOMI satellite column measurements combined with information on emission sources, air pollutant precursors and meteorology as feature variables. We found that the trained GBT model for NO₂ and O₃ explained a major portion of the observed concentrations (R² = 0.68–0.88, RMSE = 4.77–8.67 μg m^-3 and R² = 0.74–0.92, RMSE = 8.53–13.2 μg m^-3, respectively). The trained MLP model performed worse than the trained GBT model for both NO₂ and O₃ (R² = 0.46–0.82 and R² = 0.42–0.9, respectively).

Our NO₂ GBT model outperforms the CAMS model, a data-assimilated CTM, but slightly under-performs for O₃. However, our NO₂ and O₃ ML models require less computational effort than CTM. Therefore, we can analyze people’s exposure to near-surface NO₂ and O₃ with significantly less effort. During the study period (2018-04-30 and 2021-07-01), it was found that around 36 % of people lived in locations where the WHO NO₂ limit was exceeded for more than 25 % of the days, while 90 % of the population resided in areas where the WHO O₃ limit was surpassed for over 25 % of days. Although metropolitan areas had high NO₂ concentrations, rural areas, particularly in southern Germany, had high O₃ concentrations.

Furthermore, our ML models can be used to evaluate the effectiveness of mitigation policies. Near-surface NO₂ and O₃ concentrations changes during the 2020 COVID-19 lockdown period over Germany were indeed reproduced by the GBT model, with meteorology-accounted for near-surface NO₂ significantly decreased (by 23±5.3 %) and meteorology-accounted for near-surface O₃ slightly increased (by 1±4.6 %) over ten major German metropolitan areas, compared to 2019. Finally, our O₃ GBT model is highly transferable to other countries, at least to neighboring countries and locations where no measurements are available (R² = 0.87–0.94), whereas our NO₂ GBT model is moderately transferable (R² = 0.32–0.64).

Received: 13 Mar 2023 – Discussion started: 06 Apr 2023

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Preprint (PDF, 13329 KB)

Download & links

Journal article(s) based on this preprint

14 Sep 2023

Spatiotemporal modeling of air pollutant concentrations in Germany using machine learning

Vigneshkumar Balamurugan, Jia Chen, Adrian Wenzel, and Frank N. Keutsch

Atmos. Chem. Phys., 23, 10267–10285, https://doi.org/10.5194/acp-23-10267-2023,https://doi.org/10.5194/acp-23-10267-2023, 2023

Short summary

Country	#	Views	%
United States of America	1	102	22
Germany	2	92	20
China	3	55	12
India	4	32	6
Austria	5	22	4


Total:	0
HTML:	0
PDF:	0
XML:	0

Spatio-temporal modeling of air pollutant concentrations in Germany using machine learning

Journal article(s) based on this preprint

Interactive discussion

Interactive discussion

Peer review completion

Journal article(s) based on this preprint

Viewed

Viewed (geographical distribution)

Cited

1 citations as recorded by crossref.