Preprints
https://doi.org/10.5194/egusphere-2026-1347
https://doi.org/10.5194/egusphere-2026-1347
21 Apr 2026
 | 21 Apr 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

Machine learning significantly improves the simulation of hourly-to-yearly scale cloud nuclei concentration and radiative forcing in polluted atmosphere

Jingye Ren, Songjian Zou, Honghao Xu, Guiquan Liu, Zhe Wang, Anran Zhang, Chuanfeng Zhao, Min Hu, Dongjie Shang, Lizi Tang, Ru-Jin Huang, Yele Sun, and Fang Zhang

Abstract. The accurate prediction of cloud condensation nuclei (CCN) number concentration (NCCN) on a large spatiotemporal scale is challenging but critical to evaluate the aerosol cloud interaction effect. Combining multi-source dataset and the NCCN simulated by the Weather Research and Forecasting coupled with Chemistry (WRF-Chem) model, we have developed a Random Forest Regression method (RFRM) model which achieves well prediction of hourly-to-yearly scale NCCN at typical supersaturations in polluted North China Plain (NCP). We show that the prediction bias of NCCN compared to observations is reduced from -59 % with the WRF-Chem model to approximately -31 % with the RFRM model (the prediction precision is improved by 1.6 times accordingly) during the campaigns. The greatest improvement is seen in both very polluted and clean cases. The RFRM model captures well the spatial variation and better describes long-term trends of NCCN. More importantly, the prediction reveals a significant long-term decreasing trend of NCCN in NCP due to a rapid reduction in aerosol concentrations from 2014 to 2018, during which a series of strict emission reduction measures were implemented by the Chinese government. This reflects the climate benefit of pollution control. Our study further illustrates that the RFRM model reduces the uncertainty in simulating cloud radiative forcing from an overestimation of 1.89 ± 0.78 W m-2 to 0.81 ± 0.63 W m-2, illustrating the high sensitivity of climate forcing to changes in NCCN. This work offers a new modeling framework that guides the way to simulate CCN in other regions around the world and has the potential to effectively filling the observation gap of CCN concentrations.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Jingye Ren, Songjian Zou, Honghao Xu, Guiquan Liu, Zhe Wang, Anran Zhang, Chuanfeng Zhao, Min Hu, Dongjie Shang, Lizi Tang, Ru-Jin Huang, Yele Sun, and Fang Zhang

Status: open (until 16 Jun 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Jingye Ren, Songjian Zou, Honghao Xu, Guiquan Liu, Zhe Wang, Anran Zhang, Chuanfeng Zhao, Min Hu, Dongjie Shang, Lizi Tang, Ru-Jin Huang, Yele Sun, and Fang Zhang
Jingye Ren, Songjian Zou, Honghao Xu, Guiquan Liu, Zhe Wang, Anran Zhang, Chuanfeng Zhao, Min Hu, Dongjie Shang, Lizi Tang, Ru-Jin Huang, Yele Sun, and Fang Zhang
Metrics will be available soon.
Latest update: 21 Apr 2026
Download
Short summary
In this study, a new framework of cloud condensation nuclei (CCN) prediction in polluted region has been developed and it achieves well prediction of hourly-to-yearly scale across North China Plain. The study reveals the machine learning model can largely reduce the uncertainty in simulating cloud radiative forcing, illustrating the high sensitivity of climate forcing to changes in CCN. This improvement of our new model would be helpful to aerosols climate effect assessment in models.
Share