Gridded surface O<sub>3</sub>, NO<sub>x</sub>, and CO abundances for model metrics from the South Korean ground station network

Wilson, Calum Patrick; Prather, Michael John

doi:https://doi.org/10.5194/egusphere-2024-1173

Preprints

https://doi.org/10.5194/egusphere-2024-1173

Preprints

22 Aug 2024

| 22 Aug 2024

Gridded surface O₃, NO_x, and CO abundances for model metrics from the South Korean ground station network

Calum Patrick Wilson and Michael John Prather

Abstract. We present gridded surface air quality datasets over South Korea for three key species – ozone (O₃), carbon monoxide (CO), and nitrogen oxides (NO_x) during the timeframe of the Korea–US Air Quality (KORUS–AQ) mission (May–June 2016). The tenth degree hourly averaged abundances are constructed from the 300+ air quality network sites using inverse distance weighting with simple declustering. Cross–comparing the interpolated fields against the site data that was used to create them reveals high prediction skill for O₃ (80 %) throughout South Korea, and moderate skill (60 %) for CO and NO_x on average in densely observed regions after individual mean bias corrections. The gridded O₃ and CO interpolations predict the NASA DC–8 observations in the planetary boundary layer (PBL) with high skill (80 %) in the Seoul Metropolitan Area (SMA) after subtracting the mean bias. DC–8 NO_x observations were much less predictable on account of consistently negative vertical gradients within the PBL. Our gridded products capture the mean and variability of O₃ throughout South Korea, and of CO and surface NO_x in most site–dense urban centres (SMA, Cheongju, Gwangju, Daegu, Changwon, and Busan).

Received: 18 Apr 2024 – Discussion started: 22 Aug 2024

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Preprint (PDF, 1718 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (1718 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

23 Apr 2025

Gridded surface O₃, NO_x, and CO abundances for model metrics from the South Korean ground station network

Calum P. Wilson and Michael J. Prather

Atmos. Meas. Tech., 18, 1757–1769, https://doi.org/10.5194/amt-18-1757-2025,https://doi.org/10.5194/amt-18-1757-2025, 2025

Short summary

Calum Patrick Wilson and Michael John Prather

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-1173', Anonymous Referee #1, 13 Oct 2024
This manuscript presented an IDW-based spatial interpolation method and an hourly gridded (0.1 x 0.1 deg) dataset for O3, CO, NOx. The gridded dataset was derived from the interpolated ground site observations in South Korea during the period of KORUS-AQ field campaign. The authors used this approach to mitigate the bias due to uneven density of the ground sites. The interpolation method and the gridded dataset were rigorously tested and analyzed in terms of bias and variability. The gridded dataset described in the manuscript will be useful to assess and improve models. The IDW-based interpolation approach is relatively straightforward and can also be used by researchers to better use the ground network observations. At the same time, it should be recognized that the IDW-based approach may not fully address the effect of microscale meteorological and local emissions, which both can be important under certain conditions. This reviewer believes it would benefit the readers if the authors can add more detailed discussions on the advantages and limitations the IDW-based approach, specifically discussing the statistical test results in the context of microscale meteorological conditions (e.g., wind speed and direction) and local emissions. Another important issue is the need to highlight the difference between weighted average and arithmetic average approaches in three grid cases with low, mid, and high Q values. This can be done by contrasting O3, CO, and NOx values between the gridded data presented in this manuscript and those computed from simple averages.
Specific comments:
Line 90 – 91: The authors should clarify how the correlation was computed between different sites and discuss if different sites have similar temporal variation patterns with a phase shift.

Section 3.2: This reviewer would like to raise a question if the better results obtained for O3 is partially attributed to that O3 abundance is not directly influenced by local emissions while the CO and NOx at a given site can be substantially affected by local emissions. It is possible that certain emission events are seen only in a few sites and the IDW interpolation would not be able to predict these observations in the leave-one-out tests. In this context, it would be helpful if the authors can state the limitation of the IDW interpolation under certain conditions.

Section 4.2: It should be stated in this section that the DC-8 sampling may not be representative of the grids due to limitation of the flight patterns.

The conclusion section should highlight the advantages and limitations of the IDW interpolation approach.

The authors should consider adding global attributes and variable attributes (e.g., units) to the gridded netCDF file and make the file more CF compliant, e.g., using CF variable names, like time, lat, and lon. This will enhance the (re)usability and interoperability of the hourly gridded dataset.
Citation: https://doi.org/10.5194/egusphere-2024-1173-RC1
- AC1: 'Reply on RC1', Calum Wilson, 21 Oct 2024
  
  We thank the reviewer once again for investing their time into our manuscript, and we detail how we addressed the comments in the upcoming manuscript version.
  This manuscript presented an IDW-based spatial interpolation method and an hourly gridded (0.1 x 0.1 deg) dataset for O3, CO, NOx. The gridded dataset was derived from the interpolated ground site observations in South Korea during the period of KORUS-AQ field campaign. The authors used this approach to mitigate the bias due to uneven density of the ground sites. The interpolation method and the gridded dataset were rigorously tested and analyzed in terms of bias and variability. The gridded dataset described in the manuscript will be useful to assess and improve models. The IDW-based interpolation approach is relatively straightforward and can also be used by researchers to better use the ground network observations. At the same time, it should be recognized that the IDW-based approach may not fully address the effect of microscale meteorological and local emissions, which both can be important under certain conditions. This reviewer believes it would benefit the readers if the authors can add more detailed discussions on the advantages and limitations the IDW-based approach, specifically discussing the statistical test results in the context of microscale meteorological conditions (e.g., wind speed and direction) and local emissions. Another important issue is the need to highlight the difference between weighted average and arithmetic average approaches in three grid cases with low, mid, and high Q values. This can be done by contrasting O3, CO, and NOx values between the gridded data presented in this manuscript and those computed from simple averages.
  We have added a discussion of the IDW vs. Arithmetic Mean techniques, noting significant differences in e.g. the Seoul Metropolitan Area, where IDW was previously shown to achieve good predictability.
  
  Line 90 – 91: The authors should clarify how the correlation was computed between different sites and discuss if different sites have similar temporal variation patterns with a phase shift.
  We have clarified how the correlation was computed and added further analysis of the site autocorrelations.
  Section 3.2: This reviewer would like to raise a question if the better results obtained for O3 is partially attributed to that O3 abundance is not directly influenced by local emissions while the CO and NOx at a given site can be substantially affected by local emissions. It is possible that certain emission events are seen only in a few sites and the IDW interpolation would not be able to predict these observations in the leave-one-out tests. In this context, it would be helpful if the authors can state the limitation of the IDW interpolation under certain conditions.
  We agree with the reviewer in our arguments on lines 164 to 165, but have added the clarification that O3 is not directly emitted, unlike the other species. We added a description of the IDW limitations in the conclusion.
  Section 4.2: It should be stated in this section that the DC-8 sampling may not be representative of the grids due to limitation of the flight patterns.
  We updated our manuscript to acknowledge this fact.
  The conclusion section should highlight the advantages and limitations of the IDW interpolation approach.
  We have added a discussion of IDW vs. arithmetic mean technique in the conclusion and described the limitations of IDW, noting how some alternative techniques could address these limitations.
  The authors should consider adding global attributes and variable attributes (e.g., units) to the gridded netCDF file and make the file more CF compliant, e.g., using CF variable names, like time, lat, and lon. This will enhance the (re)usability and interoperability of the hourly gridded dataset.
  Great idea, thanks. We have updated our datasets to better comply with the CF standard. We use more conventional variable and dimension aliases (e.g. lat, lon, time) along with units, long_name, and description attributes. We have added global attributes that specify where found the data and how we processed it.
  
  Citation: https://doi.org/10.5194/egusphere-2024-1173-AC1
RC2:
'Comment on egusphere-2024-1173', Anonymous Referee #3, 11 Dec 2024
Review on “Gridded surface O₃, NOx, and CO abundances for model metrics from the South Korean ground station network”
Traditionally, simple grid-cell averages have been used to test and analyze the results of regional air quality models. However, spatial heterogeneity, local isolated emission sources and data uncertainties have often raised questions about the representativeness of this approach. This study represents a valuable and scientifically innovative effort to assess the validity of such methods and to explore potential alternatives. The results are not only in line with the aims of the journal, but also have significant potential for wider scientific impact, as they can contribute to critical assessments such as determining the suitability of measurement sites, identifying observational biases, and estimating contributions from local isolated sources. Nevertheless, this manuscript raises several important issues that require further clarification and resolution before publication.
Chemical species such as O₃, CO and NOx are known to exhibit significant spatial variability due to various chemical and physical factors, as well as local emission sources and observational biases. To better highlight the performance of the method proposed by the authors for these chemical species, would it not be more effective to compare them to meteorological or physical variables such as temperature, which tend to have less spatial variability and are less prone to observational bias? Analyzing these stable variables using the same method and using them as a reference might provide a clearer context for interpreting the variability of the chemical species, more in line with the authors' aims.
In many cases, treating O₃ and NO₂ together as Ox (= O₃ + NO₂) rather than separately provides more robust results when comparing model outputs with observations. Would it not be valuable to assess how the results differ when these species are analyzed as a single group (Ox) compared to analyzing O₃ and NOx separately? Such an evaluation could provide additional insight into the robustness and reliability of the proposed method.
I agree that the method proposed in this study produces significantly better results compared to simple grid-cell averages. However, the conclusions drawn regarding the predictive accuracy of IDW interpolation and the effectiveness of leave-one-out cross-validation (LOOCV) raise some questions. While LOOCV provides a useful validation of the interpolation method, does it sufficiently address the inherent limitations of IDW in capturing spatial heterogeneity and clustering effects in areas of uneven data distribution and local outliers? For example, poor prediction accuracy in specific regions such as Gwangyang, Yeosu, Suncheon, and Ulsan may indicate limitations of the IDW approach itself, beyond what bias corrections or LOOCV validation can mitigate for isolated outliers. Would alternative interpolation methods, such as kriging or hybrid approaches, incorporating geostatistical models with spatial emission source distributions, better address these challenges? Such a comparison would help determine whether IDW is indeed the most suitable choice for this dataset
Page 2 line 37-38: It is unclear from the manuscript whether the data from the AirKorea monitoring network were obtained directly from the official data center such as NIER, where they are subject to QA/QC management, or whether they were downloaded directly from the AirKorea website (https://www.airkorea.or.kr/eng). If the data were obtained from the AirKorea website, it is important to note that the data available there are real-time observations with minimal QA/QC and may contain some errors. This could potentially contribute to the observed elevated 𝐸1(𝑡) values. Clarification of this aspect seems necessary.
It would be beneficial if the methods and results sections provided more detailed explanations on the following points:
A comparison of prediction errors across different regions, particularly between data-dense and data-sparse areas, to evaluate the robustness of the interpolation method under varying observation densities. Perhaps Taehwa and Olympic Park can be used for this purpose?

A clear explanation of how missing or erroneous observations were handled during the analysis. Additionally, an assessment of whether the chosen approach introduces bias into the interpolation results would strengthen the reliability of the conclusions.

Sensitivity analyses on the parameters (β, D, L) to ensure that the selected values are truly optimal and not overly reliant on specific conditions within the dataset.

page 7. Line 168 Jiju -> Jinju ?
Citation: https://doi.org/10.5194/egusphere-2024-1173-RC2
- AC2: 'Reply on RC2', Calum Wilson, 04 Jan 2025
  
  Traditionally, simple grid-cell averages have been used to test and analyze the results of regional air quality models. However, spatial heterogeneity, local isolated emission sources and data uncertainties have often raised questions about the representativeness of this approach. This study represents a valuable and scientifically innovative effort to assess the validity of such methods and to explore potential alternatives. The results are not only in line with the aims of the journal, but also have significant potential for wider scientific impact, as they can contribute to critical assessments such as determining the suitability of measurement sites, identifying observational biases, and estimating contributions from local isolated sources. Nevertheless, this manuscript raises several important issues that require further clarification and resolution before publication.
  We thank you for your insights and your recognition of the value in gridding the site data. We offer our responses below.
  c1) Chemical species such as O3, CO and NOx are known to exhibit significant spatial variability due to various chemical and physical factors, as well as local emission sources and observational biases. To better highlight the performance of the method proposed by the authors for these chemical species, would it not be more effective to compare them to meteorological or physical variables such as temperature, which tend to have less spatial variability and are less prone to observational bias? Analyzing these stable variables using the same method and using them as a reference might provide a clearer context for interpreting the variability of the chemical species, more in line with the authors' aims.
  r1) Use of meteorological quantities is interesting but these are not available for the sites, and it brings new uncertainty as to the cause of biases that we cannot ascertain (e.g., surface and building albedo, urban heat island effects, orography). In the new Appendicised Figure C2 added to the paper, we examine the site predictability during daytime vs. nighttime. Daytime O3 predictability is higher than nighttime across South Korea by at least +10% points , and similarly a +5% increase is seen in urban (high emission) areas for NOx. This new work suggests that the comparatively turbulent daytime PBL mixes local (NOx) emissions more efficiently while nighttime stability produces sharper (and less predictable) concentration gradients. Discussion has been added to Section 3.2.1.
  c2) In many cases, treating O3 and NO2 together as Ox (= O3 + NO2) rather than separately provides more robust results when comparing model outputs with observations. Would it not be valuable to assess how the results differ when these species are analyzed as a single group (Ox) compared to analyzing O3 and NOx separately? Such an evaluation could provide additional insight into the robustness and reliability of the proposed method.
  r2) We kept O3 and NOx separate because they are fundamentally separate chemical-model diagnostics and their spatial variability has different patterns. Further, Ox is a derived quantity for the models and sometimes even includes higher nitrates (e.g., NO3 = 2 Ox).
  
  c3) I agree that the method proposed in this study produces significantly better results compared to simple grid-cell averages. However, the conclusions drawn regarding the predictive accuracy of IDW interpolation and the effectiveness of leave-one-out cross-validation (LOOCV) raise some questions. While LOOCV provides a useful validation of the interpolation method, does it sufficiently address the inherent limitations of IDW in capturing spatial heterogeneity and clustering effects in areas of uneven data distribution and local outliers? For example, poor prediction accuracy in specific regions such as Gwangyang, Yeosu, Suncheon, and Ulsan may indicate limitations of the IDW approach itself, beyond what bias corrections or LOOCV validation can mitigate for isolated outliers. Would alternative interpolation methods, such as kriging or hybrid approaches, incorporating geostatistical models with spatial emission source distributions, better address these challenges? Such a comparison would help determine whether IDW is indeed the most suitable choice for this dataset
  r3) A full parallel analysis with kriging and hybrid methods is beyond the scope of the work here, although it would be informative. It is reasonably certain that IDW (and other methods) outperform the arithmetic mean gridding approach that much of the community has been using. We explained why Kriging methods are inappropriate, essentially because we can’t establish a relationship between covariance and proximity for any species. It would be valuable to apply a land-use regression model for NOx in particular, since this could fix the projection of roadside NOx levels into wilderness sites such as Taehwa Forest and establish 1) whether the interpolation improves in coastal cities, and 2) how this improvement changes the gridded averages. Such endeavours require data and resources that are not immediately available to us, but we have noted these alternative techniques in the conclusion.
  c4) Page 2 line 37-38: It is unclear from the manuscript whether the data from the AirKorea monitoring network were obtained directly from the official data center such as NIER, where they are subject to QA/QC management, or whether they were downloaded directly from the AirKorea website (https://www.airkorea.or.kr/eng). If the data were obtained from the AirKorea website, it is important to note that the data available there are real-time observations with minimal QA/QC and may contain some errors. This could potentially contribute to the observed elevated 𝐸1(𝑡) values. Clarification of this aspect seems necessary.
  r4) This is an important distinction. https://www.airkorea.or.kr/ has often been cited as the original source of NIER air quality data (e.g. Crawford et al., 2021), but we could not access the data directly from the website, and instead used the access-controlled raw data from the KORUS-AQ period stored in the NASA archive: https://asdc.larc.nasa.gov/project/KORUS-AQ. We have now published our QA/QC flags for the raw data in our repository (https://doi.org/10.5061/dryad.sf7m0cgf5) and indicated this in Section 2, stating more explicitly where we got the data.
  It would be beneficial if the methods and results sections provided more detailed explanations on the following points:
  c5) A comparison of prediction errors across different regions, particularly between data-dense and data-sparse areas, to evaluate the robustness of the interpolation method under varying observation densities. Perhaps Taehwa and Olympic Park can be used for this purpose?
  r5) Our new additions to Figure A3 provide interesting insights into the interpolation robustness. By examining the improvement of the complete interpolation over the LOOCV interpolation, we highlight regions that were sufficiently sampled (little improvement, e.g. the Seoul Metropolitan Area) vs. undersampled (e.g. rural sites and some city districts). This discussion has been included in Section 3.2.1.
  c6) A clear explanation of how missing or erroneous observations were handled during the analysis. Additionally, an assessment of whether the chosen approach introduces bias into the interpolation results would strengthen the reliability of the conclusions.
  r6) This was an oversight in our methodology discussion: we were not explicit in how missing or erroneous data were treated by the interpolation and subsequent analysis. We have clarified this in Section 2.1. In summary, stations with missing data (or errors that we flagged) were excluded from the interpolation on an hourly basis and discounted from the cluster radius (D) of other stations for declustering purposes. Comparing LOOCV predictions with the complete interpolation provides a metric on the impact of station malfunction on predictability, which we have now charted in Figure A3.
  c7) Sensitivity analyses on the parameters (β, D, L) to ensure that the selected values are truly optimal and not overly reliant on specific conditions within the dataset.
  r7) We have added a statement on how we optimised the parameters: a recursive approach where we fix D and iterate through β, then fix β and iterate D, until a 2D minimum is reached (L is a time-saving constraint that does not significantly affect the global interpolation error above a certain threshold). We omitted sensitivity analysis because the 2D minimae are very similar for all species and shallow, i.e., the global interpolation error is not sensitive to changing the parameters.
  c8) page 7. Line 168 Jiju -> Jinju ?
  r8) Thank you for spotting this, it has been corrected.
  
  Citation: https://doi.org/10.5194/egusphere-2024-1173-AC2

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-1173', Anonymous Referee #1, 13 Oct 2024
This manuscript presented an IDW-based spatial interpolation method and an hourly gridded (0.1 x 0.1 deg) dataset for O3, CO, NOx. The gridded dataset was derived from the interpolated ground site observations in South Korea during the period of KORUS-AQ field campaign. The authors used this approach to mitigate the bias due to uneven density of the ground sites. The interpolation method and the gridded dataset were rigorously tested and analyzed in terms of bias and variability. The gridded dataset described in the manuscript will be useful to assess and improve models. The IDW-based interpolation approach is relatively straightforward and can also be used by researchers to better use the ground network observations. At the same time, it should be recognized that the IDW-based approach may not fully address the effect of microscale meteorological and local emissions, which both can be important under certain conditions. This reviewer believes it would benefit the readers if the authors can add more detailed discussions on the advantages and limitations the IDW-based approach, specifically discussing the statistical test results in the context of microscale meteorological conditions (e.g., wind speed and direction) and local emissions. Another important issue is the need to highlight the difference between weighted average and arithmetic average approaches in three grid cases with low, mid, and high Q values. This can be done by contrasting O3, CO, and NOx values between the gridded data presented in this manuscript and those computed from simple averages.
Specific comments:
Line 90 – 91: The authors should clarify how the correlation was computed between different sites and discuss if different sites have similar temporal variation patterns with a phase shift.

Section 3.2: This reviewer would like to raise a question if the better results obtained for O3 is partially attributed to that O3 abundance is not directly influenced by local emissions while the CO and NOx at a given site can be substantially affected by local emissions. It is possible that certain emission events are seen only in a few sites and the IDW interpolation would not be able to predict these observations in the leave-one-out tests. In this context, it would be helpful if the authors can state the limitation of the IDW interpolation under certain conditions.

Section 4.2: It should be stated in this section that the DC-8 sampling may not be representative of the grids due to limitation of the flight patterns.

The conclusion section should highlight the advantages and limitations of the IDW interpolation approach.

The authors should consider adding global attributes and variable attributes (e.g., units) to the gridded netCDF file and make the file more CF compliant, e.g., using CF variable names, like time, lat, and lon. This will enhance the (re)usability and interoperability of the hourly gridded dataset.
Citation: https://doi.org/10.5194/egusphere-2024-1173-RC1
- AC1: 'Reply on RC1', Calum Wilson, 21 Oct 2024
  
  We thank the reviewer once again for investing their time into our manuscript, and we detail how we addressed the comments in the upcoming manuscript version.
  This manuscript presented an IDW-based spatial interpolation method and an hourly gridded (0.1 x 0.1 deg) dataset for O3, CO, NOx. The gridded dataset was derived from the interpolated ground site observations in South Korea during the period of KORUS-AQ field campaign. The authors used this approach to mitigate the bias due to uneven density of the ground sites. The interpolation method and the gridded dataset were rigorously tested and analyzed in terms of bias and variability. The gridded dataset described in the manuscript will be useful to assess and improve models. The IDW-based interpolation approach is relatively straightforward and can also be used by researchers to better use the ground network observations. At the same time, it should be recognized that the IDW-based approach may not fully address the effect of microscale meteorological and local emissions, which both can be important under certain conditions. This reviewer believes it would benefit the readers if the authors can add more detailed discussions on the advantages and limitations the IDW-based approach, specifically discussing the statistical test results in the context of microscale meteorological conditions (e.g., wind speed and direction) and local emissions. Another important issue is the need to highlight the difference between weighted average and arithmetic average approaches in three grid cases with low, mid, and high Q values. This can be done by contrasting O3, CO, and NOx values between the gridded data presented in this manuscript and those computed from simple averages.
  We have added a discussion of the IDW vs. Arithmetic Mean techniques, noting significant differences in e.g. the Seoul Metropolitan Area, where IDW was previously shown to achieve good predictability.
  
  Line 90 – 91: The authors should clarify how the correlation was computed between different sites and discuss if different sites have similar temporal variation patterns with a phase shift.
  We have clarified how the correlation was computed and added further analysis of the site autocorrelations.
  Section 3.2: This reviewer would like to raise a question if the better results obtained for O3 is partially attributed to that O3 abundance is not directly influenced by local emissions while the CO and NOx at a given site can be substantially affected by local emissions. It is possible that certain emission events are seen only in a few sites and the IDW interpolation would not be able to predict these observations in the leave-one-out tests. In this context, it would be helpful if the authors can state the limitation of the IDW interpolation under certain conditions.
  We agree with the reviewer in our arguments on lines 164 to 165, but have added the clarification that O3 is not directly emitted, unlike the other species. We added a description of the IDW limitations in the conclusion.
  Section 4.2: It should be stated in this section that the DC-8 sampling may not be representative of the grids due to limitation of the flight patterns.
  We updated our manuscript to acknowledge this fact.
  The conclusion section should highlight the advantages and limitations of the IDW interpolation approach.
  We have added a discussion of IDW vs. arithmetic mean technique in the conclusion and described the limitations of IDW, noting how some alternative techniques could address these limitations.
  The authors should consider adding global attributes and variable attributes (e.g., units) to the gridded netCDF file and make the file more CF compliant, e.g., using CF variable names, like time, lat, and lon. This will enhance the (re)usability and interoperability of the hourly gridded dataset.
  Great idea, thanks. We have updated our datasets to better comply with the CF standard. We use more conventional variable and dimension aliases (e.g. lat, lon, time) along with units, long_name, and description attributes. We have added global attributes that specify where found the data and how we processed it.
  
  Citation: https://doi.org/10.5194/egusphere-2024-1173-AC1
RC2:
'Comment on egusphere-2024-1173', Anonymous Referee #3, 11 Dec 2024
Review on “Gridded surface O₃, NOx, and CO abundances for model metrics from the South Korean ground station network”
Traditionally, simple grid-cell averages have been used to test and analyze the results of regional air quality models. However, spatial heterogeneity, local isolated emission sources and data uncertainties have often raised questions about the representativeness of this approach. This study represents a valuable and scientifically innovative effort to assess the validity of such methods and to explore potential alternatives. The results are not only in line with the aims of the journal, but also have significant potential for wider scientific impact, as they can contribute to critical assessments such as determining the suitability of measurement sites, identifying observational biases, and estimating contributions from local isolated sources. Nevertheless, this manuscript raises several important issues that require further clarification and resolution before publication.
Chemical species such as O₃, CO and NOx are known to exhibit significant spatial variability due to various chemical and physical factors, as well as local emission sources and observational biases. To better highlight the performance of the method proposed by the authors for these chemical species, would it not be more effective to compare them to meteorological or physical variables such as temperature, which tend to have less spatial variability and are less prone to observational bias? Analyzing these stable variables using the same method and using them as a reference might provide a clearer context for interpreting the variability of the chemical species, more in line with the authors' aims.
In many cases, treating O₃ and NO₂ together as Ox (= O₃ + NO₂) rather than separately provides more robust results when comparing model outputs with observations. Would it not be valuable to assess how the results differ when these species are analyzed as a single group (Ox) compared to analyzing O₃ and NOx separately? Such an evaluation could provide additional insight into the robustness and reliability of the proposed method.
I agree that the method proposed in this study produces significantly better results compared to simple grid-cell averages. However, the conclusions drawn regarding the predictive accuracy of IDW interpolation and the effectiveness of leave-one-out cross-validation (LOOCV) raise some questions. While LOOCV provides a useful validation of the interpolation method, does it sufficiently address the inherent limitations of IDW in capturing spatial heterogeneity and clustering effects in areas of uneven data distribution and local outliers? For example, poor prediction accuracy in specific regions such as Gwangyang, Yeosu, Suncheon, and Ulsan may indicate limitations of the IDW approach itself, beyond what bias corrections or LOOCV validation can mitigate for isolated outliers. Would alternative interpolation methods, such as kriging or hybrid approaches, incorporating geostatistical models with spatial emission source distributions, better address these challenges? Such a comparison would help determine whether IDW is indeed the most suitable choice for this dataset
Page 2 line 37-38: It is unclear from the manuscript whether the data from the AirKorea monitoring network were obtained directly from the official data center such as NIER, where they are subject to QA/QC management, or whether they were downloaded directly from the AirKorea website (https://www.airkorea.or.kr/eng). If the data were obtained from the AirKorea website, it is important to note that the data available there are real-time observations with minimal QA/QC and may contain some errors. This could potentially contribute to the observed elevated 𝐸1(𝑡) values. Clarification of this aspect seems necessary.
It would be beneficial if the methods and results sections provided more detailed explanations on the following points:
A comparison of prediction errors across different regions, particularly between data-dense and data-sparse areas, to evaluate the robustness of the interpolation method under varying observation densities. Perhaps Taehwa and Olympic Park can be used for this purpose?

A clear explanation of how missing or erroneous observations were handled during the analysis. Additionally, an assessment of whether the chosen approach introduces bias into the interpolation results would strengthen the reliability of the conclusions.

Sensitivity analyses on the parameters (β, D, L) to ensure that the selected values are truly optimal and not overly reliant on specific conditions within the dataset.

page 7. Line 168 Jiju -> Jinju ?
Citation: https://doi.org/10.5194/egusphere-2024-1173-RC2
- AC2: 'Reply on RC2', Calum Wilson, 04 Jan 2025
  
  Traditionally, simple grid-cell averages have been used to test and analyze the results of regional air quality models. However, spatial heterogeneity, local isolated emission sources and data uncertainties have often raised questions about the representativeness of this approach. This study represents a valuable and scientifically innovative effort to assess the validity of such methods and to explore potential alternatives. The results are not only in line with the aims of the journal, but also have significant potential for wider scientific impact, as they can contribute to critical assessments such as determining the suitability of measurement sites, identifying observational biases, and estimating contributions from local isolated sources. Nevertheless, this manuscript raises several important issues that require further clarification and resolution before publication.
  We thank you for your insights and your recognition of the value in gridding the site data. We offer our responses below.
  c1) Chemical species such as O3, CO and NOx are known to exhibit significant spatial variability due to various chemical and physical factors, as well as local emission sources and observational biases. To better highlight the performance of the method proposed by the authors for these chemical species, would it not be more effective to compare them to meteorological or physical variables such as temperature, which tend to have less spatial variability and are less prone to observational bias? Analyzing these stable variables using the same method and using them as a reference might provide a clearer context for interpreting the variability of the chemical species, more in line with the authors' aims.
  r1) Use of meteorological quantities is interesting but these are not available for the sites, and it brings new uncertainty as to the cause of biases that we cannot ascertain (e.g., surface and building albedo, urban heat island effects, orography). In the new Appendicised Figure C2 added to the paper, we examine the site predictability during daytime vs. nighttime. Daytime O3 predictability is higher than nighttime across South Korea by at least +10% points , and similarly a +5% increase is seen in urban (high emission) areas for NOx. This new work suggests that the comparatively turbulent daytime PBL mixes local (NOx) emissions more efficiently while nighttime stability produces sharper (and less predictable) concentration gradients. Discussion has been added to Section 3.2.1.
  c2) In many cases, treating O3 and NO2 together as Ox (= O3 + NO2) rather than separately provides more robust results when comparing model outputs with observations. Would it not be valuable to assess how the results differ when these species are analyzed as a single group (Ox) compared to analyzing O3 and NOx separately? Such an evaluation could provide additional insight into the robustness and reliability of the proposed method.
  r2) We kept O3 and NOx separate because they are fundamentally separate chemical-model diagnostics and their spatial variability has different patterns. Further, Ox is a derived quantity for the models and sometimes even includes higher nitrates (e.g., NO3 = 2 Ox).
  
  c3) I agree that the method proposed in this study produces significantly better results compared to simple grid-cell averages. However, the conclusions drawn regarding the predictive accuracy of IDW interpolation and the effectiveness of leave-one-out cross-validation (LOOCV) raise some questions. While LOOCV provides a useful validation of the interpolation method, does it sufficiently address the inherent limitations of IDW in capturing spatial heterogeneity and clustering effects in areas of uneven data distribution and local outliers? For example, poor prediction accuracy in specific regions such as Gwangyang, Yeosu, Suncheon, and Ulsan may indicate limitations of the IDW approach itself, beyond what bias corrections or LOOCV validation can mitigate for isolated outliers. Would alternative interpolation methods, such as kriging or hybrid approaches, incorporating geostatistical models with spatial emission source distributions, better address these challenges? Such a comparison would help determine whether IDW is indeed the most suitable choice for this dataset
  r3) A full parallel analysis with kriging and hybrid methods is beyond the scope of the work here, although it would be informative. It is reasonably certain that IDW (and other methods) outperform the arithmetic mean gridding approach that much of the community has been using. We explained why Kriging methods are inappropriate, essentially because we can’t establish a relationship between covariance and proximity for any species. It would be valuable to apply a land-use regression model for NOx in particular, since this could fix the projection of roadside NOx levels into wilderness sites such as Taehwa Forest and establish 1) whether the interpolation improves in coastal cities, and 2) how this improvement changes the gridded averages. Such endeavours require data and resources that are not immediately available to us, but we have noted these alternative techniques in the conclusion.
  c4) Page 2 line 37-38: It is unclear from the manuscript whether the data from the AirKorea monitoring network were obtained directly from the official data center such as NIER, where they are subject to QA/QC management, or whether they were downloaded directly from the AirKorea website (https://www.airkorea.or.kr/eng). If the data were obtained from the AirKorea website, it is important to note that the data available there are real-time observations with minimal QA/QC and may contain some errors. This could potentially contribute to the observed elevated 𝐸1(𝑡) values. Clarification of this aspect seems necessary.
  r4) This is an important distinction. https://www.airkorea.or.kr/ has often been cited as the original source of NIER air quality data (e.g. Crawford et al., 2021), but we could not access the data directly from the website, and instead used the access-controlled raw data from the KORUS-AQ period stored in the NASA archive: https://asdc.larc.nasa.gov/project/KORUS-AQ. We have now published our QA/QC flags for the raw data in our repository (https://doi.org/10.5061/dryad.sf7m0cgf5) and indicated this in Section 2, stating more explicitly where we got the data.
  It would be beneficial if the methods and results sections provided more detailed explanations on the following points:
  c5) A comparison of prediction errors across different regions, particularly between data-dense and data-sparse areas, to evaluate the robustness of the interpolation method under varying observation densities. Perhaps Taehwa and Olympic Park can be used for this purpose?
  r5) Our new additions to Figure A3 provide interesting insights into the interpolation robustness. By examining the improvement of the complete interpolation over the LOOCV interpolation, we highlight regions that were sufficiently sampled (little improvement, e.g. the Seoul Metropolitan Area) vs. undersampled (e.g. rural sites and some city districts). This discussion has been included in Section 3.2.1.
  c6) A clear explanation of how missing or erroneous observations were handled during the analysis. Additionally, an assessment of whether the chosen approach introduces bias into the interpolation results would strengthen the reliability of the conclusions.
  r6) This was an oversight in our methodology discussion: we were not explicit in how missing or erroneous data were treated by the interpolation and subsequent analysis. We have clarified this in Section 2.1. In summary, stations with missing data (or errors that we flagged) were excluded from the interpolation on an hourly basis and discounted from the cluster radius (D) of other stations for declustering purposes. Comparing LOOCV predictions with the complete interpolation provides a metric on the impact of station malfunction on predictability, which we have now charted in Figure A3.
  c7) Sensitivity analyses on the parameters (β, D, L) to ensure that the selected values are truly optimal and not overly reliant on specific conditions within the dataset.
  r7) We have added a statement on how we optimised the parameters: a recursive approach where we fix D and iterate through β, then fix β and iterate D, until a 2D minimum is reached (L is a time-saving constraint that does not significantly affect the global interpolation error above a certain threshold). We omitted sensitivity analysis because the 2D minimae are very similar for all species and shallow, i.e., the global interpolation error is not sensitive to changing the parameters.
  c8) page 7. Line 168 Jiju -> Jinju ?
  r8) Thank you for spotting this, it has been corrected.
  
  Citation: https://doi.org/10.5194/egusphere-2024-1173-AC2

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Calum Wilson on behalf of the Authors (04 Jan 2025) Author's response Author's tracked changes

EF by Daria Karpachova (07 Jan 2025) Manuscript

ED: Publish as is (04 Feb 2025) by Jochen Stutz

AR by Calum Wilson on behalf of the Authors (10 Feb 2025)

Journal article(s) based on this preprint

23 Apr 2025

Gridded surface O₃, NO_x, and CO abundances for model metrics from the South Korean ground station network

Calum P. Wilson and Michael J. Prather

Atmos. Meas. Tech., 18, 1757–1769, https://doi.org/10.5194/amt-18-1757-2025,https://doi.org/10.5194/amt-18-1757-2025, 2025

Short summary

Calum Patrick Wilson and Michael John Prather

Viewed

Total article views: 502 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
266	123	113	502	12	11

HTML: 266
PDF: 123
XML: 113
Total: 502
BibTeX: 12
EndNote: 11

Views and downloads (calculated since 22 Aug 2024)

Month	HTML	PDF	XML	Total
Aug 2024	63	22	3	88
Sep 2024	23	4	1	28
Oct 2024	62	33	4	99
Nov 2024	12	7	3	22
Dec 2024	37	26	40	103
Jan 2025	28	6	50	84
Feb 2025	20	2	11	33
Mar 2025	10	15	1	26
Apr 2025	11	8	0	19

Cumulative views and downloads (calculated since 22 Aug 2024)

Month	HTML	PDF	XML	Total
Aug 2024	63	22	3	88
Sep 2024	23	4	1	28
Oct 2024	62	33	4	99
Nov 2024	12	7	3	22
Dec 2024	37	26	40	103
Jan 2025	28	6	50	84
Feb 2025	20	2	11	33
Mar 2025	10	15	1	26
Apr 2025	11	8	0	19

Viewed (geographical distribution)

Total article views: 505 (including HTML, PDF, and XML) Thereof 505 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 23 Apr 2025

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (1718 KB)
Metadata XML

Short summary

We evaluated how well we can infer air pollutant levels (ozone, carbon monoxide, and nitrogen oxides) between air quality stations throughout South Korea, finding good representation in most densely measured cities in spite of intense small-scale emission hotspots. Comparing observed air quality with gridded model output is desirable, and so we created gridded datasets over South Korea using air quality station measurements, which agreed with airborne measurements around Seoul.


Total:	0
HTML:	0
PDF:	0
XML:	0

Gridded surface O3, NOx, and CO abundances for model metrics from the South Korean ground station network

Journal article(s) based on this preprint

Interactive discussion

Interactive discussion

Peer review completion

Journal article(s) based on this preprint

Viewed

Viewed (geographical distribution)

Gridded surface O₃, NO_x, and CO abundances for model metrics from the South Korean ground station network