Technical Note: Deep-GF-PRM – A physics-informed deep learning framework for parameterizing aerosol hygroscopic growth factor probability density function
Abstract. The hygroscopic properties of atmospheric aerosols are crucial for quantifying their impact on radiation and cloud formation. They are often characterized by a growth factor probability density function (GF-PDF), which can be parameterized as a superposition of multiple Gaussian distributions. Conventionally, nonparametric inversion methods are developed to retrieve GF-PDF from the instrument responses, e.g., measurements of humidified tandem differential mobility analyzer. However, additional parametric fittings are required to extract modal parameters from the inverted GF-PDF, a process that is computationally intensive and susceptible to fitting errors.
In this study, we introduce Deep-GF-PRM, a deep learning framework that parameterizes the GF-PDF modal parameters directly from the instrument responses. The core of Deep-GF-PRM is a physics-informed neural network that embeds the instrument’s kernel function and physical constraints, creating end-to-end mapping of the GF-PDF modal parameters to the instrument response. Trained on a large dataset of synthetic instrument responses generated using a wide range of GF-PDFs and noise levels, Deep-GF-PRM accurately reproduces synthetic GF-PDFs and retrieves modal parameters with higher fidelity than conventional fitting approaches. The model is applied to real-world measurements, and yields results highly consistent with nonparametric inversions. Deep-GF-PRM thus provides an efficient and unsupervised solution for parameterizing aerosol hygroscopic properties.
The study by Lu et al. developed the Deep-GF-PRM framework for parameterizing GF-PDF in aerosol hygroscopicity studies. Trained by large synthetic instrument responses, this model is shown to quickly retrieve modal parameters that performs no worse than the conventional fitting approaches. Especially, this method advantages in its quick responses and ensured convergence, which is of urgent need for large-scale online data analysis. Overall, this study is well conducted in innovative ways, clearly presented, and presented a promising and powerful new tool. I've only a few minor concerns.
1. In this study, the GFmean is set as the major target of comparison. How about the other parameters (e.g., mode numbers, and detailed f, G, σ for each mode), especially when more than one GF-PDF modes are present due to the external mixing, etc.?
2. Can the authors briefly state / predict the applicability of this methods? For example, when the deviations tend to be larger (except when total counts are low and therefore low SNR)? What kind of parameters / instrumental design changes, or under which scenarios the applicability of this method needs special attention and / or to be re-examined first?