Preprints
https://doi.org/10.5194/egusphere-2025-1735
https://doi.org/10.5194/egusphere-2025-1735
25 Apr 2025
 | 25 Apr 2025
Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Estimating surface sulfur dioxide concentrations from satellite data: Using chemical transport models vs. machine learning

Zachary Watson, Can Li, Fei Liu, Sean W. Freeman, Huanxin Zhang, Jun Wang, and Shan-Hu Lee

Abstract. Sulfur dioxide (SO2) is an important air pollutant that contributes to negative health effects, acid rain, and aerosol formation and growth. SO2 has been measured using ground-based air quality monitoring networks, but the routine monitoring sites are predominantly placed in urban areas, leaving large gaps in the network in less populated locations. Previous studies have used chemical transport models (CTMs) or machine learning techniques to estimate surface SO2 concentrations from satellite vertical column densities, but no direct comparisons between the methods have been made. In this study, we estimated surface SO2 concentrations using Ozone Monitoring Instrument (OMI) retrievals over eastern China from 2015–2018 utilizing GEOS-Chem simulations and an extreme gradient boosting machine learning model. Compared to the in situ measurements, the SO2 concentrations estimated from the CTM method had similar spatial distributions (r = 0.58) and intra- and interannual variations but were underestimated (slope = 0.24) with a relative percent error of ~75 % and had worsening performance over time. The machine learning method produced more accurate spatial distributions (r = 0.77) and temporal variations, a smaller discrepancy and bias (~30 %; slope = 0.69) and relatively stable performance over time. The machine learning method performed better than the GEOS-Chem method on smaller datasets and timescales with shorter temporal averaging periods. Ultimately, both methods were useful for estimating surface SO2 concentrations since the CTM-based method does not rely on in situ monitoring and produced more reasonable spatial distributions than the machine learning method over areas without surface monitoring data.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Zachary Watson, Can Li, Fei Liu, Sean W. Freeman, Huanxin Zhang, Jun Wang, and Shan-Hu Lee

Status: open (until 06 Jun 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-1735', Anonymous Referee #1, 16 May 2025 reply
Zachary Watson, Can Li, Fei Liu, Sean W. Freeman, Huanxin Zhang, Jun Wang, and Shan-Hu Lee
Zachary Watson, Can Li, Fei Liu, Sean W. Freeman, Huanxin Zhang, Jun Wang, and Shan-Hu Lee

Viewed

Total article views: 129 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
105 17 7 129 12 4 3
  • HTML: 105
  • PDF: 17
  • XML: 7
  • Total: 129
  • Supplement: 12
  • BibTeX: 4
  • EndNote: 3
Views and downloads (calculated since 25 Apr 2025)
Cumulative views and downloads (calculated since 25 Apr 2025)

Viewed (geographical distribution)

Total article views: 144 (including HTML, PDF, and XML) Thereof 144 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 16 May 2025
Download
Short summary
Air pollutants like sulfur dioxide cause direct impacts on human health and the environment. Our work estimated surface concentrations from satellite data using atmospheric models and machine learning compared to an air quality monitoring network. We found that both methods can accurately determine the locations and changes in sulfur dioxide, but the machine learning method had better accuracy. Both methods are useful for monitoring air quality in locations without ground-based measurements.
Share