the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Gridded surface O3, NOx, and CO abundances for model metrics from the South Korean ground station network
Abstract. We present gridded surface air quality datasets over South Korea for three key species – ozone (O3), carbon monoxide (CO), and nitrogen oxides (NOx) during the timeframe of the Korea–US Air Quality (KORUS–AQ) mission (May–June 2016). The tenth degree hourly averaged abundances are constructed from the 300+ air quality network sites using inverse distance weighting with simple declustering. Cross–comparing the interpolated fields against the site data that was used to create them reveals high prediction skill for O3 (80 %) throughout South Korea, and moderate skill (60 %) for CO and NOx on average in densely observed regions after individual mean bias corrections. The gridded O3 and CO interpolations predict the NASA DC–8 observations in the planetary boundary layer (PBL) with high skill (80 %) in the Seoul Metropolitan Area (SMA) after subtracting the mean bias. DC–8 NOx observations were much less predictable on account of consistently negative vertical gradients within the PBL. Our gridded products capture the mean and variability of O3 throughout South Korea, and of CO and surface NOx in most site–dense urban centres (SMA, Cheongju, Gwangju, Daegu, Changwon, and Busan).
- Preprint
(1718 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2024-1173', Anonymous Referee #1, 13 Oct 2024
This manuscript presented an IDW-based spatial interpolation method and an hourly gridded (0.1 x 0.1 deg) dataset for O3, CO, NOx. The gridded dataset was derived from the interpolated ground site observations in South Korea during the period of KORUS-AQ field campaign. The authors used this approach to mitigate the bias due to uneven density of the ground sites. The interpolation method and the gridded dataset were rigorously tested and analyzed in terms of bias and variability. The gridded dataset described in the manuscript will be useful to assess and improve models. The IDW-based interpolation approach is relatively straightforward and can also be used by researchers to better use the ground network observations. At the same time, it should be recognized that the IDW-based approach may not fully address the effect of microscale meteorological and local emissions, which both can be important under certain conditions. This reviewer believes it would benefit the readers if the authors can add more detailed discussions on the advantages and limitations the IDW-based approach, specifically discussing the statistical test results in the context of microscale meteorological conditions (e.g., wind speed and direction) and local emissions. Another important issue is the need to highlight the difference between weighted average and arithmetic average approaches in three grid cases with low, mid, and high Q values. This can be done by contrasting O3, CO, and NOx values between the gridded data presented in this manuscript and those computed from simple averages.
Specific comments:
- Line 90 – 91: The authors should clarify how the correlation was computed between different sites and discuss if different sites have similar temporal variation patterns with a phase shift.
- Section 3.2: This reviewer would like to raise a question if the better results obtained for O3 is partially attributed to that O3 abundance is not directly influenced by local emissions while the CO and NOx at a given site can be substantially affected by local emissions. It is possible that certain emission events are seen only in a few sites and the IDW interpolation would not be able to predict these observations in the leave-one-out tests. In this context, it would be helpful if the authors can state the limitation of the IDW interpolation under certain conditions.
- Section 4.2: It should be stated in this section that the DC-8 sampling may not be representative of the grids due to limitation of the flight patterns.
- The conclusion section should highlight the advantages and limitations of the IDW interpolation approach.
- The authors should consider adding global attributes and variable attributes (e.g., units) to the gridded netCDF file and make the file more CF compliant, e.g., using CF variable names, like time, lat, and lon. This will enhance the (re)usability and interoperability of the hourly gridded dataset.
Citation: https://doi.org/10.5194/egusphere-2024-1173-RC1 -
AC1: 'Reply on RC1', Calum Wilson, 21 Oct 2024
We thank the reviewer once again for investing their time into our manuscript, and we detail how we addressed the comments in the upcoming manuscript version.
This manuscript presented an IDW-based spatial interpolation method and an hourly gridded (0.1 x 0.1 deg) dataset for O3, CO, NOx. The gridded dataset was derived from the interpolated ground site observations in South Korea during the period of KORUS-AQ field campaign. The authors used this approach to mitigate the bias due to uneven density of the ground sites. The interpolation method and the gridded dataset were rigorously tested and analyzed in terms of bias and variability. The gridded dataset described in the manuscript will be useful to assess and improve models. The IDW-based interpolation approach is relatively straightforward and can also be used by researchers to better use the ground network observations. At the same time, it should be recognized that the IDW-based approach may not fully address the effect of microscale meteorological and local emissions, which both can be important under certain conditions. This reviewer believes it would benefit the readers if the authors can add more detailed discussions on the advantages and limitations the IDW-based approach, specifically discussing the statistical test results in the context of microscale meteorological conditions (e.g., wind speed and direction) and local emissions. Another important issue is the need to highlight the difference between weighted average and arithmetic average approaches in three grid cases with low, mid, and high Q values. This can be done by contrasting O3, CO, and NOx values between the gridded data presented in this manuscript and those computed from simple averages.
We have added a discussion of the IDW vs. Arithmetic Mean techniques, noting significant differences in e.g. the Seoul Metropolitan Area, where IDW was previously shown to achieve good predictability.
Line 90 – 91: The authors should clarify how the correlation was computed between different sites and discuss if different sites have similar temporal variation patterns with a phase shift.We have clarified how the correlation was computed and added further analysis of the site autocorrelations.
Section 3.2: This reviewer would like to raise a question if the better results obtained for O3 is partially attributed to that O3 abundance is not directly influenced by local emissions while the CO and NOx at a given site can be substantially affected by local emissions. It is possible that certain emission events are seen only in a few sites and the IDW interpolation would not be able to predict these observations in the leave-one-out tests. In this context, it would be helpful if the authors can state the limitation of the IDW interpolation under certain conditions.
We agree with the reviewer in our arguments on lines 164 to 165, but have added the clarification that O3 is not directly emitted, unlike the other species. We added a description of the IDW limitations in the conclusion.
Section 4.2: It should be stated in this section that the DC-8 sampling may not be representative of the grids due to limitation of the flight patterns.
We updated our manuscript to acknowledge this fact.
The conclusion section should highlight the advantages and limitations of the IDW interpolation approach.
We have added a discussion of IDW vs. arithmetic mean technique in the conclusion and described the limitations of IDW, noting how some alternative techniques could address these limitations.
The authors should consider adding global attributes and variable attributes (e.g., units) to the gridded netCDF file and make the file more CF compliant, e.g., using CF variable names, like time, lat, and lon. This will enhance the (re)usability and interoperability of the hourly gridded dataset.
Great idea, thanks. We have updated our datasets to better comply with the CF standard. We use more conventional variable and dimension aliases (e.g. lat, lon, time) along with units, long_name, and description attributes. We have added global attributes that specify where found the data and how we processed it.
Citation: https://doi.org/10.5194/egusphere-2024-1173-AC1
-
RC2: 'Comment on egusphere-2024-1173', Anonymous Referee #3, 11 Dec 2024
Review on “Gridded surface O3, NOx, and CO abundances for model metrics from the South Korean ground station network”
Traditionally, simple grid-cell averages have been used to test and analyze the results of regional air quality models. However, spatial heterogeneity, local isolated emission sources and data uncertainties have often raised questions about the representativeness of this approach. This study represents a valuable and scientifically innovative effort to assess the validity of such methods and to explore potential alternatives. The results are not only in line with the aims of the journal, but also have significant potential for wider scientific impact, as they can contribute to critical assessments such as determining the suitability of measurement sites, identifying observational biases, and estimating contributions from local isolated sources. Nevertheless, this manuscript raises several important issues that require further clarification and resolution before publication.
Chemical species such as O3, CO and NOx are known to exhibit significant spatial variability due to various chemical and physical factors, as well as local emission sources and observational biases. To better highlight the performance of the method proposed by the authors for these chemical species, would it not be more effective to compare them to meteorological or physical variables such as temperature, which tend to have less spatial variability and are less prone to observational bias? Analyzing these stable variables using the same method and using them as a reference might provide a clearer context for interpreting the variability of the chemical species, more in line with the authors' aims.
In many cases, treating O3 and NO2 together as Ox (= O3 + NO2) rather than separately provides more robust results when comparing model outputs with observations. Would it not be valuable to assess how the results differ when these species are analyzed as a single group (Ox) compared to analyzing O3 and NOx separately? Such an evaluation could provide additional insight into the robustness and reliability of the proposed method.
I agree that the method proposed in this study produces significantly better results compared to simple grid-cell averages. However, the conclusions drawn regarding the predictive accuracy of IDW interpolation and the effectiveness of leave-one-out cross-validation (LOOCV) raise some questions. While LOOCV provides a useful validation of the interpolation method, does it sufficiently address the inherent limitations of IDW in capturing spatial heterogeneity and clustering effects in areas of uneven data distribution and local outliers? For example, poor prediction accuracy in specific regions such as Gwangyang, Yeosu, Suncheon, and Ulsan may indicate limitations of the IDW approach itself, beyond what bias corrections or LOOCV validation can mitigate for isolated outliers. Would alternative interpolation methods, such as kriging or hybrid approaches, incorporating geostatistical models with spatial emission source distributions, better address these challenges? Such a comparison would help determine whether IDW is indeed the most suitable choice for this dataset
Page 2 line 37-38: It is unclear from the manuscript whether the data from the AirKorea monitoring network were obtained directly from the official data center such as NIER, where they are subject to QA/QC management, or whether they were downloaded directly from the AirKorea website (https://www.airkorea.or.kr/eng). If the data were obtained from the AirKorea website, it is important to note that the data available there are real-time observations with minimal QA/QC and may contain some errors. This could potentially contribute to the observed elevated 𝐸1(𝑡) values. Clarification of this aspect seems necessary.
It would be beneficial if the methods and results sections provided more detailed explanations on the following points:
- A comparison of prediction errors across different regions, particularly between data-dense and data-sparse areas, to evaluate the robustness of the interpolation method under varying observation densities. Perhaps Taehwa and Olympic Park can be used for this purpose?
- A clear explanation of how missing or erroneous observations were handled during the analysis. Additionally, an assessment of whether the chosen approach introduces bias into the interpolation results would strengthen the reliability of the conclusions.
- Sensitivity analyses on the parameters (β, D, L) to ensure that the selected values are truly optimal and not overly reliant on specific conditions within the dataset.
page 7. Line 168 Jiju -> Jinju ?
Citation: https://doi.org/10.5194/egusphere-2024-1173-RC2
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
178 | 80 | 23 | 281 | 2 | 4 |
- HTML: 178
- PDF: 80
- XML: 23
- Total: 281
- BibTeX: 2
- EndNote: 4
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1