the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Influencing factors analysis and prediction of near-surface ozone in Henan Province from 2015 to 2022
Abstract. This study analyzed factors influencing near-surface ozone (O3) in Henan Province from 2015 to 2022 using real-time pollutant data from the China National Environmental Monitoring Centre and daily meteorological data from the Henan Provincial Ecological Environment Monitoring and Safety Center. Regression and machine learning models (including multiple linear regression (MLR), support vector machine (SVM), random forest (RF), ridge regression (RR), BP neural network, and extreme gradient boosting (XGBoost)) were used to predict O3 concentrations. The results showed that among the major pollutants (CO, NO2, SO2, PM2.5, and PM10), there was a consistent negative correlation with O3. Notably, NO2 had the strongest negative correlation (r = -0.825), while PM10 showed the weakest (r = -0.687). From the perspective of meteorological factors, temperature showed a strong positive correlation with O3, while wind speed, relative humidity, and precipitation showed weak negative correlations, influencing regional variations in O3 concentrations. Among the six prediction models constructed to predict O3 concentrations, the most accurate model for predicting concentrations for the next day was the extreme gradient boosting (XGBoost) model (R2 = 0.883). For the next 3 days, the random forest (RF) model demonstrated the highest accuracy (R2 = 0.704). Similarly, the random forest model (RF) also exhibited the highest accuracy for predicting the next 7 days (R2 = 0.651). In summary, over the past 7 years, there has been a strong correlation observed between O3 concentration and other major pollutants, as well as meteorological factors in Henan Province. Therefore, it is essential to implement targeted measures for O3 pollution prevention and control based on specific weather conditions.
- Preprint
(2293 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2024-3495', Anonymous Referee #1, 19 Mar 2025
To be honest, I'm very pleased to see a study focused on less previledged area rather than Beijing or London. However, as in my previous comments, the performance of boosting (e.g., XGBoost) has been well know for ozone estiamtion. This has been tested over multiple countries, including China. So what's the contribution of this work? Again, if the authors want to justify the 'new' approach or product of surface ozone, it's recommended to conduct intercomparisons with other exsiting products. Unfortunately, none of my comments before have been taken seriously by the authors -- no revision, rebuttal, nor explaining why. We need to see scientific or at least technical contributions and please justify it. Thanks.
Citation: https://doi.org/10.5194/egusphere-2024-3495-RC1 -
AC1: 'Reply on RC1', Bao Hao ming, 26 Apr 2025
感谢您的仔细阅读和宝贵意见。本文在前人研究的基础上,分析了 2015—2022 年河南省臭氧污染的时空分布特征和臭氧影响因素特征,并对未来臭氧浓度进行了短期预测,得出了关于河南省臭氧污染的一些结论。
利用时间序列分析和机器学习等多种预测模型对河南省臭氧浓度进行短期预测。综合评价不同预测模型的有效性,筛选出最优模型,为河南省臭氧污染防治提供科学依据。
在预测的 6 个回归模型中,O3未来 1 天的浓度XGBoost 模型的预测性能最好,RF 模型对 O 的预测性能最好3浓度,RF 模型对 O 的预测性能最佳3在接下来的 7 天内集中。升压(例如 XGBoost)的性能对于臭氧监测来说是众所周知的。然而,基于河南省的特殊条件,该实验为选择合适的模型来预测未来哪些天的臭氧浓度提供了参考。
目前,河南省对臭氧问题的研究还不够全面,用于预测臭氧浓度的模型类型有限,模型间预测精度的比较也较少。因此,本研究全面建立了从臭氧时空特征研究,到臭氧相关因素分析,再到臭氧浓度预测的完整臭氧污染研究流程,以期为河南臭氧等大气污染物防治 Province.In 后续研究提供参考, 需要继续训练机器学习模型,不断优化模型参数,实现对未来长期臭氧浓度的准确预测。Citation: https://doi.org/10.5194/egusphere-2024-3495-AC1 -
EC1: 'Reply on AC1', Jian Xu, 27 Apr 2025
Dear authors,
Thank you for your response to the reviewers' comments. However, I noticed that one of your replies was written in Chinese. As this is an international journal, and the open discussion is visible to all, we kindly ask that all communication, including responses to reviewers, be conducted in English.
Please revise the response accordingly and resubmit it for further processing.
Citation: https://doi.org/10.5194/egusphere-2024-3495-EC1 -
AC3: 'Reply on EC1', Bao Hao ming, 23 May 2025
感谢您的仔细阅读和宝贵意见。本文在前人研究的基础上,分析了 2015—2022 年河南省臭氧污染的时空分布特征和臭氧影响因素特征,并对未来臭氧浓度进行了短期预测,得出了关于河南省臭氧污染的一些结论。
利用时间序列分析和机器学习等多种预测模型对河南省臭氧浓度进行短期预测。综合评价不同预测模型的有效性,筛选出最优模型,为河南省臭氧污染防治提供科学依据。
在预测的 6 个回归模型中,O3未来 1 天的浓度XGBoost 模型的预测性能最好,RF 模型对 O 的预测性能最好3浓度,RF 模型对 O 的预测性能最佳3在接下来的 7 天内集中。升压(例如 XGBoost)的性能对于臭氧监测来说是众所周知的。然而,基于河南省的特殊条件,该实验为选择合适的模型来预测未来哪些天的臭氧浓度提供了参考。
目前,河南省对臭氧问题的研究还不够全面,用于预测臭氧浓度的模型类型有限,模型间预测精度的比较也较少。因此,本研究全面建立了从臭氧时空特征研究,到臭氧相关因素分析,再到臭氧浓度预测的完整臭氧污染研究流程,以期为河南臭氧等大气污染物防治 Province.In 后续研究提供参考, 需要继续训练机器学习模型,不断优化模型参数,实现对未来长期臭氧浓度的准确预测。Citation: https://doi.org/10.5194/egusphere-2024-3495-AC3 -
AC4: 'Reply on EC1', Bao Hao ming, 23 May 2025
Thank you for your careful reading and valuable comments.Based on previous studies, this study analyzes the temporal and spatial distribution characteristics of ozone pollution and the characteristics of ozone influencing factors in Henan Province from 2015 to 2022, and makes short-term predictions of ozone concentration in the future, and obtains some conclusions about ozone pollution in Henan Province.
Multiple prediction models such as time series analysis and machine learning are used to make short-term prediction of ozone concentration in Henan Province. The effectiveness of different prediction models is comprehensively evaluated and the optimal model is selected to provide scientific basis for the prevention and control of ozone pollution in Henan Province.
Among the six regression models predicted, O3 concentration for the next 1 day XGBoost model has the best prediction performance, and RF model has the best prediction performance for O3 concentration in the next 3 days, and RF model has the best prediction performance for O3 concentration in the next 7 days.The performance of boosting (e.g., XGBoost) has been well know for ozone estiamtion.However, based on the special conditions of Henan Province, the experiment provides a reference for selecting a suitable model for predicting the ozone concentration in which days in the future.
At present, the research on the ozone problem in Henan Province is not comprehensive enough, and the types of models used for the prediction of ozone concentration are limited, and the comparison of prediction accuracy between models is also less.Therefore, this study comprehensively establishes a complete ozone pollution research process from the study of temporal and spatial characteristics of ozone, to the analysis of ozone related factors, to the prediction of ozone concentration, so as to provide a reference for the prevention and control of ozone and other atmospheric pollutants in Henan Province.In the follow-up research, it is necessary to continue to train the machine learning model and continuously optimize the model parameters to achieve accurate prediction of future long-term ozone concentration.Citation: https://doi.org/10.5194/egusphere-2024-3495-AC4
-
AC3: 'Reply on EC1', Bao Hao ming, 23 May 2025
-
EC1: 'Reply on AC1', Jian Xu, 27 Apr 2025
-
AC1: 'Reply on RC1', Bao Hao ming, 26 Apr 2025
-
RC2: 'Comment on egusphere-2024-3495', Anonymous Referee #2, 30 Mar 2025
General comments
The manuscript presents the study devoted to the analysis of factors influencing ozone concentrations in Henan Province in 2015-2022 and estimates of the quality of several models to predict short-term ozone variability. In general, the idea of the research addresses relevant scientific questions within the scope of the journal, but the realization of the study suffers from various shortcomings.
To my mind, the scientific information presented in the manuscript - factors that influence the surface ozone concentration - are well known, the novelty of the research may be related to the geographical distribution (location) of the analysis (Henan Province) and the time period. Apart from analysis of the correlations between ozone and its precursors and meteorological conditions, authors compared different models for surface ozone prediction. But this comparison looks like statistical analysis only without significant conclusions on scientific concepts.
Authors introduced the role of ozone emissions and the importance of its monitoring and modelling. But these words are general, I am not convinced that prediction of ozone concentrations on short-term periods considered in the manuscript plays a significant role in controlling the air quality of the cities.
Manuscript suffers from the lack of scientific analysis; in my opinion, authors did not show that scientific significance of their research.
Minor comments
Line 25
Please, define the abbreviation “BP”.
Line 85
What does it mean – “to improve pollution”?
Line 93
“Prybutok 93 et al. (Victor, 2000)” – what does it mean?
Line 122
Please, give the definition of the abbreviation “MDA8”.
Line 160
“excluding Jiyuan” - authors mentioned several times this exclusion, and did not explain why it was excluded.
Line 251
Please, denote this abbreviation “GBDT”.
Lines 297-298
What are the errors of these estimates? Please, provide them with a confidential interval to assess the significance of these values. The same refers to the errors of determination of other trend estimates and correlation coefficients.
Line 382, Figure 5
The same colors on different parts of the figure represent different correlations. So, it can be confusing for the reader to analyze all parts together. For better perception of the figure, I recommend using different color scales for different meteorological factors.
Line 420
To my mind, all mentioned correlations between O3 concentration and wind speed are insignificant, so it does not make sense to analyze the changes in correlation coefficients in detail.
Line 465
If all these factors influence O3 pollution, how can you explain weak correlations between most of the factors and O3 concentration?
Citation: https://doi.org/10.5194/egusphere-2024-3495-RC2 -
AC2: 'Reply on RC2', Bao Hao ming, 26 Apr 2025
Thank you for your careful reading and valuable comments. Combining HrSOD data set and CAQRA data set, more detailed conclusions such as temporal and spatial characteristics, influencing factors and prediction results of ozone pollution are proposed and verified.
The ozone pollution situation of each city in Henan Province on an annual scale was calculated based on the time statistics method, and the conclusions of the relevant pollution degrees were obtained.
Based on the data of other pollutants and meteorological data of 17 prefecture-level cities in Henan Province from 2015 to 2022, the influence of variables on the spatial differentiation of ozone concentration in Henan Province was deeply analyzed, and the conclusion of the correlation between each influencing factor and the ozone concentration in Henan Province was obtained.
The data from January 1, 2022 to December 31, 2022 were selected to construct six regression models to predict the ozone concentrations in Henan Province in the short term of 1 day, 3 days and 7 days in the future, and the conclusions on the accuracy of each prediction model were obtained.
Based on these conclusions, a complete research process for ozone pollution was established, ranging from the study of the spatio-temporal characteristics of ozone, to the analysis of ozone-related factors, and to the prediction of ozone concentration. The best model was compared and analyzed to predict the ozone concentration more accurately and provide a reference for the prevention and control of ozone and other air pollutants in Henan Province.
Defined BP (Back Propagation).It means to reduce ozone pollution and has been modified to reduce O3 pollution.
It was the author Victor Prybutok, which has been changed to Victor.
The explanation has been given: MDA8 (Maximum Daily 8-Hour Average) : MDA8 represents the maximum daily 8-hour average concentration of ozone.
Previous studies focused on the ozone issue in Henan Province but did not study Jiyuan City. Moreover, the monitoring data could not obtain the ozone pollution data of Jiyuan City, which was added due to the availability of the data.
GBDT (Gradient Boosting Decision Tree) has been represented.
These errors represent the changing trends of major pollutants in Henan Province from 2015 to 2022. These changing trends indirectly reflect the degree of correlation between other pollutants and ozone.
Figure 5, after consideration, for the sake of the overall integrity of the pictures, the four pictures are still represented by the same color system. Whether there is a positive correlation or a negative correlation, dark colors represent high correlation and light colors represent low correlation.
The correlation between wind speed and ozone concentration in Henan Province from 2015 to 2022 was relatively weak. However, in other relevant literatures, wind speed is also an important indicator affecting ozone concentration and requires quantitative analysis.
Weak correlation indicates that a certain period of time may show a weak relationship with ozone, but the overall influence of related factors has led to the current situation of ozone pollution.
Citation: https://doi.org/10.5194/egusphere-2024-3495-AC2
-
AC2: 'Reply on RC2', Bao Hao ming, 26 Apr 2025
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
172 | 54 | 20 | 246 | 10 | 15 |
- HTML: 172
- PDF: 54
- XML: 20
- Total: 246
- BibTeX: 10
- EndNote: 15
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1