the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A close look at using national ground stations for the statistical modeling of NO2
Abstract. Air pollution causes a manifold of negative health and societal problems. It is therefore essential to model and predict air pollution over space. An increasing number of statistical models of air pollution have been developed using geospatial variables associated with air pollution emission and dispersion processes. However, the increasing number of air pollution models does not always equate to an increase in prediction accuracy and uncertainty reduction. An important aspect that is often disregarded is the spatial heterogeneity. In this study, we aim to evaluate and compare various spatial and non-spatial statistical and machine learning methods, with attention given to different spatial groups. Spatial groups are identified by the predictor variables. We found that prediction accuracy differs substantially in different spatial groups. Predictions in places close to roads with high populations show poor prediction accuracy, while prediction accuracy increases in low population density areas for both local and global models. Prediction accuracy is further increased in places that are far from roads for global models. This division into spatial groups also shows that global non-linear methods are capable of higher prediction accuracy than global linear methods. The spatial prediction patterns show that non-linear methods generally predict more smoothly than linear methods. Additionally, clusters of predicted air pollution differ within and between cities. Lastly, applying the same methods to the local dataset yields poor metrics, especially for the non-linear methods.
- Preprint
(2684 KB) - Metadata XML
-
Supplement
(6298 KB) - BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2023-1260', Anonymous Referee #1, 09 Nov 2023
General comments:
Currently, many studies are using spatially sparse fixed-site measurements to map air pollution on a large scale, ignoring the local spatial heterogeneities such as the intra-city variations. This article evaluated the performance of various algorithms across different scales and validated the accuracy separately in subsets categorized by road density and population density. They found that the model performance varied significantly at different spatial locations. The pattern was found to be different in “global” and local models. The comparison between “global” and local models in terms of intra-city distribution patterns is valuable. However, in its present form, I cannot recommend the article for publication. With substantial revision and restructuring, this article could be a useful addition to the existing literature.
The writing needs further improvement. The current version is not easy to read. First, this is too long. I appreciate the solid work of the authors. But please simplify the main text and consider moving some descriptions/figures to the Appendix. Keep only the core story in the main text and make sure the primary findings and the most important messages stand out. Second, consider restructuring the method/data and the result section. Third, the caption of figures and tables needs more details, including the unit of NO2. Fourth, Clarify definitions like “Far from road” vs “Rural”. Last, please pay attention to the tense usage.
Specific comments:
- The mobile measurements from Kerckhoffs et. al., 2019 were measured on the road. How can they validate the accuracy for the “far from road” group? Did you perform any adjustments?
- Table 1 describes the predictor features. Why not include land use proportions? Land Use Regression models are efficient and well-accepted methods in air pollution modeling.
- Figure 4. It would be better to plot the map of differences between the model tested and the benchmark (i.e., NO2 estimations from Kerckhoffs et. al., 2019). I would be curious about the difference in spatial distributions between the “global” and local models.
- A restructuring of the data/method section is recommended. Begin with the introduction of the data source, ensuring clarity on the source of the population information and road class when discussing spatial groups. Consider adding a table summarizing model input/algorithms for ease of understanding. Move some algorithm introductions to the appendix.
- Please explain why 20-fold cross-validation?
Technical corrections:
I have listed some specific points. But not limited to them.
Abstract:
The abstract attempts to encompass numerous findings but allocates insufficient space to elucidate the methodology and experimental setting. A substantial rephrasing of the abstract is needed.
Line 1-5, toing and froing, can be simplified.
Line 6, please provide more details about the meaning of “spatial heterogeneity” in this context.
Line 9-10 what is the local and global model? Define first, before using it.
Methodology:
Line 100-105, not clear. How do you divide the area? Purpose? What is the time frame of these national measurements? Frequency of measuring? Any preprocessing? More details are needed here. How do you define the less densely populated area? What is the source of the population density data?
Line 121, “rural”= “Far from roads”? Please keep the terminology consistent.
Line 123, the label of models should be provided as the legend in the figure instead of in the caption.
Line 130-135, unit of NO2 is missing. This paragraph is not informative. The values can be integrated into the figure 1.
Line 145, More details about kriging and accuracy are needed.
Line 160-165, is the traffic volume used as the annual average? Table 1. it would help readers to understand the data distribution by adding columns such as numbers and some statistics like mean, median, quantiles etc.
Line 168, the section title should begin with a capital letter, and further refinement is necessary in terms of formatting.
Line 190, not clear. Please do not refer to the citation but to the dataset you have described in section 2.1.
Line 195, rephrase please instead of a direct quote.
Line 196, details of the tuning strategy are missing.
Result and discussion:
Line 465, how do you compare the influence of predictors between cities? The feature importance is a relative value. The magnitude is not meaningful when compared to the other models.
Line 515, which is opposite to the common knowledge (see Hoek et. al., 2008). Can you explain why non-linear model predictions were smoother?
Citation: https://doi.org/10.5194/egusphere-2023-1260-RC1 - RC2: 'Comment on egusphere-2023-1260', Anonymous Referee #2, 14 Jan 2024
-
AC1: 'Comment on egusphere-2023-1260', Foeke Boersma, 25 Mar 2024
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2023/egusphere-2023-1260/egusphere-2023-1260-AC1-supplement.pdf
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
312 | 108 | 27 | 447 | 33 | 19 | 29 |
- HTML: 312
- PDF: 108
- XML: 27
- Total: 447
- Supplement: 33
- BibTeX: 19
- EndNote: 29
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1