Preprints
https://doi.org/10.5194/egusphere-2025-160
https://doi.org/10.5194/egusphere-2025-160
05 Mar 2025
 | 05 Mar 2025

Multi-Machine Learning Approaches to Modeling Small-Scale Source Attribution of Ozone Formation

Zheng Xiao, Yifeng Lu, and Guangli Xiu

Abstract. Accurate source apportionment of ozone (O3) precursors is crucial for implementing scientific O3 control strategies. While traditional approaches rely on complex calculations of volatile organic compounds (VOCs) and meteorological parameters, their applicability in real-time scenarios remains limited. Taking the Shanghai chemical industrial park as an example, we propose a novel two-step machine learning (ML) approach that integrates positive matrix factorization (PMF) with other ML methods to systematically quantify the spatiotemporal impacts of VOCs on O3 formation. Analysis of high- frequency data from 12 VOC monitoring stations (2021–2023) using six ML models revealed XGBoost as the optimal predictor (R2=0.644) for local VOC emissions. By combining SHapley Additive exPlanations (SHAP) with ML modeling, we precisely evaluated VOC-O3 relationships and located emission sources. Results identified solvent use (SU) and fuel evaporation (FE) as primary O3 formation contributors, followed by combustion sources (CS) and vehicle emissions (VE). PMF analysis further distinguished six VOC sources: petrochemical processes (PP), FE, CS, SU, polymer fabrication (PF), and VE. Temporal analysis revealed seasonal variations, with CS and FE dominant in spring/summer, while PF prevailed in autumn. This innovative framework demonstrates exceptional capability for rapid source identification and precise contribution quantification, establishing a new paradigm for high-resolution O3 source apportionment.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Zheng Xiao, Yifeng Lu, and Guangli Xiu

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-160', Anonymous Referee #1, 09 May 2025
  • CC1: 'Comment on egusphere-2025-160', Thomas Karl, 18 May 2025
  • RC2: 'Comment on egusphere-2025-160', Anonymous Referee #2, 25 May 2025
  • AC1: 'Comment on egusphere-2025-160', Guangli Xiu, 28 Jun 2025

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-160', Anonymous Referee #1, 09 May 2025
  • CC1: 'Comment on egusphere-2025-160', Thomas Karl, 18 May 2025
  • RC2: 'Comment on egusphere-2025-160', Anonymous Referee #2, 25 May 2025
  • AC1: 'Comment on egusphere-2025-160', Guangli Xiu, 28 Jun 2025
Zheng Xiao, Yifeng Lu, and Guangli Xiu
Zheng Xiao, Yifeng Lu, and Guangli Xiu

Viewed

Total article views: 480 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
360 101 19 480 40 15 35
  • HTML: 360
  • PDF: 101
  • XML: 19
  • Total: 480
  • Supplement: 40
  • BibTeX: 15
  • EndNote: 35
Views and downloads (calculated since 05 Mar 2025)
Cumulative views and downloads (calculated since 05 Mar 2025)

Viewed (geographical distribution)

Total article views: 477 (including HTML, PDF, and XML) Thereof 477 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 30 Aug 2025
Download
Short summary
This study innovates air pollution tracking in industrial areas by merging AI with traditional methods. Analyzing three years of data from a Shanghai chemical park, we identified ozone pollution sources and seasonal variations, revealing that chemical solvents and fuel vapors are key contributors. Our method enables faster, accurate source identification, aiding better air quality decisions in rapidly developing regions.
Share