Preprints
https://doi.org/10.5194/egusphere-2022-396
https://doi.org/10.5194/egusphere-2022-396
28 Jul 2022
 | 28 Jul 2022

Predicting peak daily maximum 8-hour ozone, and linkages to emissions and meteorology, in Southern California using machine learning methods

Ziqi Gao, Yifeng Wang, Petros Vasilakos, Cesunica E. Ivey, Khanh Do, and Armistead Goode Russell

Abstract. The growing abundance of data is conducive to using numerical methods to relate air quality, meteorology, and emissions to address which factors impact pollutant concentrations. Often, it is the extreme values that are of interest for health and regulatory purposes (e.g., the National Ambient Air Quality Standard for ozone uses the annual, maximum, daily 4th highest, 8-hour average (MDA8) ozone), though such values are the most challenging to predict using empirical models. We developed four different computational models, including the Generalized Additive Model (GAM), the Multivariate Adaptive Regression Splines, the Random Forest, and the Support Vector Regression, to develop observation-based relationships between the 4th highest MDA8 ozone in the South Coast Air Basin and precursor emissions, meteorological factors, and large-scale climate patterns. All models had similar predictive performance, though the GAM showed a relatively higher R2 value (0.96) with a lower root mean square error and mean bias.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Journal article(s) based on this preprint

16 Dec 2022
Predicting peak daily maximum 8 h ozone and linkages to emissions and meteorology in Southern California using machine learning methods (SoCAB-8HR V1.0)
Ziqi Gao, Yifeng Wang, Petros Vasilakos, Cesunica E. Ivey, Khanh Do, and Armistead G. Russell
Geosci. Model Dev., 15, 9015–9029, https://doi.org/10.5194/gmd-15-9015-2022,https://doi.org/10.5194/gmd-15-9015-2022, 2022
Short summary
Ziqi Gao, Yifeng Wang, Petros Vasilakos, Cesunica E. Ivey, Khanh Do, and Armistead Goode Russell

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2022-396', William Stockwell, 20 Sep 2022
  • RC2: 'Comment on egusphere-2022-396', Anonymous Referee #2, 01 Nov 2022

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2022-396', William Stockwell, 20 Sep 2022
  • RC2: 'Comment on egusphere-2022-396', Anonymous Referee #2, 01 Nov 2022

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Ziqi Gao on behalf of the Authors (24 Nov 2022)  Author's response   Author's tracked changes   Manuscript 
ED: Publish subject to technical corrections (02 Dec 2022) by Volker Grewe
AR by Ziqi Gao on behalf of the Authors (02 Dec 2022)  Manuscript 

Journal article(s) based on this preprint

16 Dec 2022
Predicting peak daily maximum 8 h ozone and linkages to emissions and meteorology in Southern California using machine learning methods (SoCAB-8HR V1.0)
Ziqi Gao, Yifeng Wang, Petros Vasilakos, Cesunica E. Ivey, Khanh Do, and Armistead G. Russell
Geosci. Model Dev., 15, 9015–9029, https://doi.org/10.5194/gmd-15-9015-2022,https://doi.org/10.5194/gmd-15-9015-2022, 2022
Short summary
Ziqi Gao, Yifeng Wang, Petros Vasilakos, Cesunica E. Ivey, Khanh Do, and Armistead Goode Russell

Data sets

Predicting peak daily maximum 8-hour ozone, and linkages to emissions and meteorology, in Southern California using machine learning methods Ziqi Gao https://doi.org/10.5281/zenodo.6892062

Model code and software

Predicting peak daily maximum 8-hour ozone, and linkages to emissions and meteorology, in Southern California using machine learning methods Ziqi Gao https://doi.org/10.5281/zenodo.6892066

Ziqi Gao, Yifeng Wang, Petros Vasilakos, Cesunica E. Ivey, Khanh Do, and Armistead Goode Russell

Viewed

Total article views: 492 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
347 129 16 492 36 3 5
  • HTML: 347
  • PDF: 129
  • XML: 16
  • Total: 492
  • Supplement: 36
  • BibTeX: 3
  • EndNote: 5
Views and downloads (calculated since 28 Jul 2022)
Cumulative views and downloads (calculated since 28 Jul 2022)

Viewed (geographical distribution)

Total article views: 479 (including HTML, PDF, and XML) Thereof 479 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Latest update: 11 Sep 2024
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
While the national ambient air quality standard of ozone is based on the 3-year average of the 4th highest 8-hour maximum (MDA8) ozone concentrations, these predicted extreme values using numerical methods are always biased low. We built 4 computational models (GAM, MARS, Random Forest & SVR) to predict the 4th highest MDA8 ozone in Southern California using precursor emissions, meteorology, and climatological patterns. All models presented acceptable performance with GAM being the best.