Preprints
https://doi.org/10.5194/egusphere-2023-1260
https://doi.org/10.5194/egusphere-2023-1260
23 Oct 2023
 | 23 Oct 2023

A close look at using national ground stations for the statistical modeling of NO2

Foeke Boersma and Meng Lu

Abstract. Air pollution causes a manifold of negative health and societal problems. It is therefore essential to model and predict air pollution over space. An increasing number of statistical models of air pollution have been developed using geospatial variables associated with air pollution emission and dispersion processes. However, the increasing number of air pollution models does not always equate to an increase in prediction accuracy and uncertainty reduction. An important aspect that is often disregarded is the spatial heterogeneity. In this study, we aim to evaluate and compare various spatial and non-spatial statistical and machine learning methods, with attention given to different spatial groups. Spatial groups are identified by the predictor variables. We found that prediction accuracy differs substantially in different spatial groups. Predictions in places close to roads with high populations show poor prediction accuracy, while prediction accuracy increases in low population density areas for both local and global models. Prediction accuracy is further increased in places that are far from roads for global models. This division into spatial groups also shows that global non-linear methods are capable of higher prediction accuracy than global linear methods. The spatial prediction patterns show that non-linear methods generally predict more smoothly than linear methods. Additionally, clusters of predicted air pollution differ within and between cities. Lastly, applying the same methods to the local dataset yields poor metrics, especially for the non-linear methods.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share

Journal article(s) based on this preprint

02 Oct 2025
A close look at using national ground stations for the statistical modeling of NO2
Foeke Boersma and Meng Lu
Geosci. Model Dev., 18, 6717–6735, https://doi.org/10.5194/gmd-18-6717-2025,https://doi.org/10.5194/gmd-18-6717-2025, 2025
Short summary
Foeke Boersma and Meng Lu

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-1260', Anonymous Referee #1, 09 Nov 2023
  • RC2: 'Comment on egusphere-2023-1260', Anonymous Referee #2, 14 Jan 2024
  • AC1: 'Comment on egusphere-2023-1260', Foeke Boersma, 25 Mar 2024

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2023-1260', Anonymous Referee #1, 09 Nov 2023
  • RC2: 'Comment on egusphere-2023-1260', Anonymous Referee #2, 14 Jan 2024
  • AC1: 'Comment on egusphere-2023-1260', Foeke Boersma, 25 Mar 2024

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Foeke Boersma on behalf of the Authors (26 Mar 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (06 Apr 2024) by Klaus Klingmüller
RR by Anonymous Referee #1 (08 May 2024)
RR by Anonymous Referee #3 (27 May 2024)
ED: Reconsider after major revisions (12 Jun 2024) by Klaus Klingmüller
AR by Foeke Boersma on behalf of the Authors (12 Sep 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (11 Oct 2024) by Klaus Klingmüller
RR by Anonymous Referee #4 (25 Oct 2024)
RR by Anonymous Referee #1 (27 Oct 2024)
ED: Reconsider after major revisions (14 Nov 2024) by Klaus Klingmüller
AR by Foeke Boersma on behalf of the Authors (24 Dec 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (26 Jan 2025) by Klaus Klingmüller
RR by Anonymous Referee #4 (05 Feb 2025)
ED: Publish subject to minor revisions (review by editor) (06 Feb 2025) by Klaus Klingmüller
AR by Foeke Boersma on behalf of the Authors (04 Apr 2025)  Author's response   Author's tracked changes   Manuscript 
ED: Publish subject to minor revisions (review by editor) (09 Apr 2025) by Klaus Klingmüller
AR by Foeke Boersma on behalf of the Authors (11 Apr 2025)  Author's response   Author's tracked changes   Manuscript 
ED: Publish as is (15 Apr 2025) by Klaus Klingmüller
AR by Foeke Boersma on behalf of the Authors (23 Apr 2025)  Author's response   Manuscript 

Journal article(s) based on this preprint

02 Oct 2025
A close look at using national ground stations for the statistical modeling of NO2
Foeke Boersma and Meng Lu
Geosci. Model Dev., 18, 6717–6735, https://doi.org/10.5194/gmd-18-6717-2025,https://doi.org/10.5194/gmd-18-6717-2025, 2025
Short summary
Foeke Boersma and Meng Lu
Foeke Boersma and Meng Lu

Viewed

Total article views: 947 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
716 194 37 947 55 48 76
  • HTML: 716
  • PDF: 194
  • XML: 37
  • Total: 947
  • Supplement: 55
  • BibTeX: 48
  • EndNote: 76
Views and downloads (calculated since 23 Oct 2023)
Cumulative views and downloads (calculated since 23 Oct 2023)

Viewed (geographical distribution)

Total article views: 928 (including HTML, PDF, and XML) Thereof 928 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 02 Oct 2025
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
Air pollution harms health and society. Understanding and predicting it is crucial. Various models are developed to model air pollution. However, the consistency exhibited by a model in different areas is commonly neglected. Our study accounts for this and shows lower accuracy near busy roads, but higher in less populated areas. Considering location characteristics in air pollution predictions is important in comparing statistical models and understanding the health-society-space relationship.
Share