the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Machine learning significantly improves the simulation of hourly-to-yearly scale cloud nuclei concentration and radiative forcing in polluted atmosphere
Abstract. The accurate prediction of cloud condensation nuclei (CCN) number concentration (NCCN) on a large spatiotemporal scale is challenging but critical to evaluate the aerosol cloud interaction (ACI) effect. Combining multi-source dataset and the NCCN simulated by the Weather Research and Forecasting coupled with Chemistry (WRF-Chem) model, we have developed a new machine learning-based model which predicts well both regional and hourly-to-yearly scale NCCN at typical supersaturations in the North China Plain (NCP). We show that the prediction bias of NCCN compared to observations is reduced from -39 % with the WRF-Chem model to approximately -8 % with the new model. The greatest improvement is seen in polluted cases. The new model captures well the spatial variation and better describes long-term trends of NCCN than the WRF-Chem. More importantly, the study reveals a significant long-term decreasing trend of NCCN in NCP due to a rapid reduction in aerosol concentrations from 2014 to 2018, during which a series of strict emission reduction measures were implemented by the Chinese government. This reflects the climate benefit of pollution control. Our study further illustrates that the new model reduces the uncertainty in simulating cloud radiative forcing from an overestimation of 1.07±0.76 W m-2 to only 0.18±0.65 W m-2, illustrating the high sensitivity of climate forcing to changes in NCCN. This work offers a new modeling framework that has the potential to greatly improve the assessment of the ACI effect in current models, and guides the way to simulate CCN in other regions around the world.
- Preprint
(2398 KB) - Metadata XML
-
Supplement
(2763 KB) - BibTeX
- EndNote
Status: open (until 29 Jul 2025)
-
CEC1: 'Comment on egusphere-2025-1483 - No compliance with the policy of the journal', Juan Antonio Añel, 23 Jun 2025
reply
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.htmlBeyond one Zenodo repository which contains the machine learning model and the meteorological input variables hosted in the RDA-NCAR, none of the other sites that you cite to get access to the code (e.g. WRF) or data, are valid repositories for scientific publication, and they do not comply with the requirements exposed in the policy of the journal.
Therefore, the current situation with your manuscript is irregular, as we can not accept manuscripts in Discussions that do not comply with our policy. Please, publish your code and data in one of the appropriate repositories according to our policy and reply as soon as possible to this comment with a modified 'Code and Data Availability' section for your manuscript, which must include the relevant information (link and permanent identifier (e.g., handle, DOI)) of the new repositories, and which you must include in a potentially reviewed manuscript.
I must note that if you do not fix this problem, we will have to reject your manuscript for publication in our journal.
Juan A. Añel
Geosci. Model Dev. Executive EditorCitation: https://doi.org/10.5194/egusphere-2025-1483-CEC1 -
AC1: 'Reply on CEC1', Fang Zhang, 23 Jun 2025
reply
Re: Thank you for your efforts and time on handling the paper. The source codes of WRF-Chem, Python and the Scikit-Learn machine learning library have been revised in the Code and Data availability. See as follows: “
Code and Data availability
The data and code are publicly accessible at https://zenodo.org/records/15523200 (Ren et al., 2025). This includes the machine learning code, the corresponding training and testing dataset (chemical compositions, gaseous pollutants, meteorological datasets and simulated CCN concentration from WRF-Chem) and the observation CCN concentrations, the script and namelist file used in WRF-Chem and the scripts used for plotting, supporting the findings of this study. The release version of WRF-Chem source code is archived on GitHub (https://github.com/wrf-model/WRF, last access: May, 2025). The release version of Python and the Scikit-Learn machine learning library are open source from https://github.com/python and https://github.com/scikit-learn.”
-
AC1: 'Reply on CEC1', Fang Zhang, 23 Jun 2025
reply
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
78 | 24 | 3 | 105 | 18 | 4 | 3 |
- HTML: 78
- PDF: 24
- XML: 3
- Total: 105
- Supplement: 18
- BibTeX: 4
- EndNote: 3
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1