Preprints
https://doi.org/10.5194/egusphere-2023-1531
https://doi.org/10.5194/egusphere-2023-1531
10 Aug 2023
 | 10 Aug 2023

Diagnosing drivers of PM2.5 simulation biases from meteorology, chemical composition, and emission sources using an efficient machine learning method

Shuai Wang, Mengyuan Zhang, Yueqi Gao, Peng Wang, Qingyan Fu, and Hongliang Zhang

Abstract. Chemical transport models (CTMs) are widely used for air pollution modeling, which suffer from significant biases due to uncertainties in simplified parameterization, meteorological fields, and emission inventories. Accurate diagnosis of simulation biases is critical for improvement of models, interpretation of results, and efficient air quality management, especially for the simulation of fine particulate matter (PM2.5). In this study, an efficient method based on machine learning (ML) was designed to diagnose the drivers of the Community Multiscale Air Quality (CMAQ) model biases in simulating PM2.5 concentrations from three perspectives of meteorology, chemical composition, and emission sources. The source-oriented CMAQ were used to diagnose influences of different emission sources on PM2.5 biases. The ML models showed good fitting ability with small performance gap between training and validation. The CMAQ model underestimates PM2.5 by -19.25 to -2.66 μg/m3 in 2019, especially in winter and spring and high PM2.5 events. Secondary organic components showed the largest contribution to PM2.5 simulation bias for different regions and seasons (13.8–22.6 %) among components. Relative humidity, cloud cover, and soil surface moisture were the main meteorological factors contributing to PM2.5 bias in the North China Plain, Pearl River Delta, and northwestern, respectively. Both primary and secondary inorganic components from residential sources showed the largest contribution (12.05 % and 12.78 %), implying large uncertainties in this sector. The ML-based methods provide valuable complements to traditional mechanism-based methods for model improvement, with high efficiency and low reliance on prior information.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Journal article(s) based on this preprint

06 May 2024
Diagnosing drivers of PM2.5 simulation biases in China from meteorology, chemical composition, and emission sources using an efficient machine learning method
Shuai Wang, Mengyuan Zhang, Yueqi Gao, Peng Wang, Qingyan Fu, and Hongliang Zhang
Geosci. Model Dev., 17, 3617–3629, https://doi.org/10.5194/gmd-17-3617-2024,https://doi.org/10.5194/gmd-17-3617-2024, 2024
Short summary
Shuai Wang, Mengyuan Zhang, Yueqi Gao, Peng Wang, Qingyan Fu, and Hongliang Zhang

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • CEC1: 'Comment on egusphere-2023-1531', Juan Antonio Añel, 05 Sep 2023
    • AC1: 'Reply on CEC1', Hongliang Zhang, 07 Oct 2023
  • RC1: 'Comment on egusphere-2023-1531', Anonymous Referee #1, 06 Sep 2023
    • AC2: 'Reply on RC1', Hongliang Zhang, 07 Oct 2023
  • RC2: 'Comment on egusphere-2023-1531', Anonymous Referee #2, 15 Sep 2023
    • AC3: 'Reply on RC2', Hongliang Zhang, 07 Oct 2023

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • CEC1: 'Comment on egusphere-2023-1531', Juan Antonio Añel, 05 Sep 2023
    • AC1: 'Reply on CEC1', Hongliang Zhang, 07 Oct 2023
  • RC1: 'Comment on egusphere-2023-1531', Anonymous Referee #1, 06 Sep 2023
    • AC2: 'Reply on RC1', Hongliang Zhang, 07 Oct 2023
  • RC2: 'Comment on egusphere-2023-1531', Anonymous Referee #2, 15 Sep 2023
    • AC3: 'Reply on RC2', Hongliang Zhang, 07 Oct 2023

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Hongliang Zhang on behalf of the Authors (07 Oct 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (18 Oct 2023) by Lele Shu
RR by Anonymous Referee #1 (03 Nov 2023)
RR by Anonymous Referee #3 (23 Nov 2023)
ED: Reconsider after major revisions (24 Nov 2023) by Lele Shu
AR by Hongliang Zhang on behalf of the Authors (21 Dec 2023)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (25 Dec 2023) by Lele Shu
RR by Anonymous Referee #4 (05 Jan 2024)
RR by Anonymous Referee #1 (12 Jan 2024)
ED: Reconsider after major revisions (18 Jan 2024) by Lele Shu
AR by Hongliang Zhang on behalf of the Authors (26 Feb 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Publish subject to minor revisions (review by editor) (04 Mar 2024) by Lele Shu
AR by Hongliang Zhang on behalf of the Authors (05 Mar 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Publish as is (06 Mar 2024) by Lele Shu
AR by Hongliang Zhang on behalf of the Authors (13 Mar 2024)  Manuscript 

Journal article(s) based on this preprint

06 May 2024
Diagnosing drivers of PM2.5 simulation biases in China from meteorology, chemical composition, and emission sources using an efficient machine learning method
Shuai Wang, Mengyuan Zhang, Yueqi Gao, Peng Wang, Qingyan Fu, and Hongliang Zhang
Geosci. Model Dev., 17, 3617–3629, https://doi.org/10.5194/gmd-17-3617-2024,https://doi.org/10.5194/gmd-17-3617-2024, 2024
Short summary
Shuai Wang, Mengyuan Zhang, Yueqi Gao, Peng Wang, Qingyan Fu, and Hongliang Zhang

Model code and software

Machine learning code and training datasets Shuai Wang https://zenodo.org/record/7907626

Shuai Wang, Mengyuan Zhang, Yueqi Gao, Peng Wang, Qingyan Fu, and Hongliang Zhang

Viewed

Total article views: 546 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
375 143 28 546 46 18 22
  • HTML: 375
  • PDF: 143
  • XML: 28
  • Total: 546
  • Supplement: 46
  • BibTeX: 18
  • EndNote: 22
Views and downloads (calculated since 10 Aug 2023)
Cumulative views and downloads (calculated since 10 Aug 2023)

Viewed (geographical distribution)

Total article views: 529 (including HTML, PDF, and XML) Thereof 529 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Latest update: 03 Sep 2024
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
Numerical models are widely used for air pollution modeling, but suffer from significant biases. Machine learning model designed in this study shows highly efficiency in identifying such biases. Meteorology (relative humidity and cloud cover), chemical composition (secondary organic components and dust aerosol), and emission sources (residential activities) are diagnosed as the main drivers of bias in modeling PM2.5, a typical air pollutant. The results will help to numerical model improvements.