Preprints
https://doi.org/10.5194/egusphere-2026-74
https://doi.org/10.5194/egusphere-2026-74
15 Jan 2026
 | 15 Jan 2026
Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Deciphering the impacts of meteorology on surface ozone variability in eastern China using explainable machine learning models

Xingpei Ye, Lin Zhang, Xiaolin Wang, Ni Lu, Sebastian Hickman, Guo Luo, and Alex T. Archibald

Abstract. Understanding how meteorology influences surface ozone variability is critical for interpreting trends and designing effective air quality policies. This study employs explainable machine learning (XML) with SHapley Additive exPlanations (SHAP) to interpret daily ozone variations from 2013 to 2023 across three major regions in eastern China: North China Plain (NCP), Yangtze River Delta (YRD), and Pearl River Delta (PRD). An ensemble of five machine learning models (LightGBM, XGBoost, CatBoost, Random Forest, and Extra Trees) is trained using 14 meteorological variables and two temporal indicators. XML reveals nonlinear, region-specific ozone-meteorology relationships that are broadly consistent with physical understanding, while differences in SHAP attributions across algorithms highlight structural uncertainty arising from multicollinearity among input variables. We use SHAP-derived contributions to attribute warm-season ozone trends to meteorological versus non-meteorological drivers. Before 2019, ozone increases are mainly associated with the temporal proxy for non-meteorological influences (e.g., emission changes), whereas after 2019 meteorological variability dominates regional ozone trends. Exploiting the additive nature of SHAP, we develop a de-weathering framework that partitions daily ozone into a SHAP-based climatological baseline and a meteorology-induced ozone anomaly (MOA). Across all three regions, the magnitude of positive MOA events increases over 2013–2023, while their frequency and duration show no significant trends, indicating a strengthening meteorological amplification of pollution episodes rather than more frequent events. Our results demonstrate both the utility and limitations of XML for disentangling meteorological drivers of ozone pollution and provide new constraints on how meteorology shapes surface ozone under China’s clean air actions.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Xingpei Ye, Lin Zhang, Xiaolin Wang, Ni Lu, Sebastian Hickman, Guo Luo, and Alex T. Archibald

Status: open (until 26 Feb 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Xingpei Ye, Lin Zhang, Xiaolin Wang, Ni Lu, Sebastian Hickman, Guo Luo, and Alex T. Archibald
Xingpei Ye, Lin Zhang, Xiaolin Wang, Ni Lu, Sebastian Hickman, Guo Luo, and Alex T. Archibald

Viewed

Total article views: 43 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
30 12 1 43 5 3 2
  • HTML: 30
  • PDF: 12
  • XML: 1
  • Total: 43
  • Supplement: 5
  • BibTeX: 3
  • EndNote: 2
Views and downloads (calculated since 15 Jan 2026)
Cumulative views and downloads (calculated since 15 Jan 2026)

Viewed (geographical distribution)

Total article views: 43 (including HTML, PDF, and XML) Thereof 43 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 17 Jan 2026
Download
Short summary
This study investigates how meteorology influences long-term surface ozone trends and pollution events across three major regions in eastern China using an explainable machine learning framework. The results show physically interpretable yet model-dependent ozone-meteorology relationships, highlighting both the potential and the limitations of explainable machine learning for process understanding in atmospheric chemistry.
Share