Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).
Evaluation and improvement of CAMS-derived CCN number concentrations using in-situ measurements
Yannick Emanuel Anders,Karoline Block,Mira Pöhlker,and Johannes Quaas
Abstract. Cloud condensation nuclei (CCN) are essential components of aerosol-cloud interactions (ACI). Thus, a precise knowledge about their number concentrations (Nccn) is crucial for climate models and ACI studies. This study presents a comprehensive evaluation of the recently published CAMS-derived total Nccn using direct observations from 25 ground-based sites. The analysis specifically focuses on the temporal variability, the applicability of CAMS-derived Nccn across different environments and pollution regimes and in particular, the sensitivity of CCN to supersaturation. For the latter aspect, a bias shift is identified in simulated Nccn that correlates to the ratio of the two dominant CCN species, likely reflecting assumptions in the underlying size distributions and/or emissions fractions. To address this issue, we developed an observation-based parametrization that is applied to CAMS-derived total Nccn without modifying aerosol size distributions or species concentrations. This approach substantially reduces biases leading the way to an improved version of CAMS-derived Nccn.
Received: 13 Mar 2026 – Discussion started: 30 Mar 2026
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
The paper addresses two relevant scientific modelling questions that are relevant to GMD, namely the performance of CAMS derived CCN number concentrations with respect to in-situ ground-station observations and possible corrections to improve that performance. Both the evaluation against a large set of ground-station observations from varied environments, and the proposed correction are novel and present important advances for the field.
While the evaluation against in-situ observations is very valuable and well documented, the level of agreement between CAMS and observations is overstated as follows:
Line 227 states that “simulated temporal variability agrees well for the majority of sites” (line 227), but R is less than 0.5 for almost half the evaluated stations for monthly averages.
Line 230 states that “CAMS Nccn reproduces observed variability very well on seasonal and annual time scale and is well suited for climatologies”, but the seasonal and annual agreement with in-situ observations are not directly assessed in the manuscript, only the 11 day and monthly agreement.
Reproducing variability between Dry and Wet season evaluation is mentioned but has explicitly not been shown (line 236)
Block et al 2024 is sited as evidence for good agreement (lines 233-234), but this uses only sites SGP, PVC, PGH, GRG, MAG, and only 2 out of these 5 sites have R>0.5
Regarding the method of comparison between CAMS and observations, CAMS derived NCCN are at 0UTC only, but the full diurnal cycle of in-situ observations is used (Section 2.3 Data Treatment). It is worth questioning whether diurnal cycles in observed NCCN lead to a bias here. The approach of longer temporal averaging taken here is not sufficient to remove such a bias. If hourly observations are available from the stations, it should be straightforward to test this.
The method for deriving the correction factor is well justified in terms of justifying an s-dependent k parameter, and in using empirical fits of the Nccn-s spectrum to derive this. Limitations in this method being unable to use the full 3-hourly CAMS reanalysis data or change the underlying size distribution are well justified. However, aspects of the method seem unjustified and sensitivities to the large uncertainties involved not fully investigated. Details of these are as follows:
The authors clearly note that CAMS derived CCN at latitudes above 70N and 70S are problematic because of the lack of data assimilation at these latitudes. The inclusion of these stations in the derivation of the global correction factor therefore seems poorly justified.
The k parameter is derived by grouping together median continental, coastal/polar and remote marine stations to find average factors for each environment. However, the variability of Sbest within coastal/polar and continental environment groups is greater than between the different environments (Fig. 4). This suggests that the grouping is not valid for this purpose.
The k factor is derived by averaging the median continental, coastal/polar and remote marine correction factors with equal weighting (line 333-335). While the authors are correct to try to account for the fact that the environmental distribution of stations with available observations does not accurately reflect the global distribution, there is no justification given for assuming that the global atmosphere is made of equal parts coast, ocean and continent.
There are large uncertainties/variability in the empirical fits for the k-factor (Fig. 4). Some understanding of how this uncertainty affects the resulting Nccn and the bias reduction is needed.
The PGH site is excluded from the derivation of k(s) because they lead to negative values. This is only noted in the caption for Fig. 6, and no physical justification is given for excluding these data. This is especially problematic given that PGH is the only site from southern Asia. That these data lead to an unphysical k(s) seems motivation to reevaluate aspects of the method used for this correction.
More minor comments are as follows:
The abstract fails to state the result of the evaluation of CAMS Nccn.
Line 218 “temporal presentability” - unclear terminology
Section 2.2 Observed Data - While the number of stations and different environments included in the evaluation is high, and reflects well what is available, it is of note that available observations are still biased toward Europe and North America, with Africa and Australasia remaining particularly underrepresented. This is not a fault of this study, but it would be helpful to note this bias and its’ potential implications in the manuscript.
The conclusions sections states both that there is no dependence of model performance on environment nor pollution but also that more polluted sites show a positive bias (lines 392-395). These are contradictory.
Why do the R values for the daily comparisons with SGP, PVC and PGH differ from those presented in Block et al 2024 (see table below)?
Daily 11day month Block2024 (daily)
SGP 0.069 0.133 0.709 0.24
PVC 0.296 (0.098) (0.360) 0.66
PGH 0.532 0.623 (0.480) 0.41
Line 219 typo “andcorrelations”
Fig A3 legend hard to distinguish between station and environment specific
Line 245 “CCNO are significantly overestimated” – does this means observations>CAMS or other way round? Similar at line 265
Line 247 “Most stations fall in the respective pollution regime category as defined by observations” unclear – I think the authors mean the pollution categorization from CAMS matches that from observations in most cases - needs to be stated more clearly.
6a Impossible to see which datapoints belong to the same station except at the edges, so impossible to see if k does decrease with s for each station as stated at line 320-323. Solid lines for environment specific medians not very visible in 6a either.
ATT station is used as an example of how the method works (Fig. A3) with some acknowledgement that the global average approach does not improve the predicted Nccn for all stations (lines 344-352). However, ATT has s-best in the middle of the range of all stations, and therefore is likely a case where the global average works best. It would be more justifiable to illustrated also cases that work less well.
Particles in the atmosphere can trigger the formation of cloud droplets, affecting cloud properties and climate. This study evaluates a new global dataset of these particles with measurements from 25 sites around the world. The variability in time and space and their conditional formation behaviour is analysed. The authors identify systematic biases and introduce a simple correction based on observations that greatly improves the dataset’s accuracy.
Particles in the atmosphere can trigger the formation of cloud droplets, affecting cloud...
The paper addresses two relevant scientific modelling questions that are relevant to GMD, namely the performance of CAMS derived CCN number concentrations with respect to in-situ ground-station observations and possible corrections to improve that performance. Both the evaluation against a large set of ground-station observations from varied environments, and the proposed correction are novel and present important advances for the field.
While the evaluation against in-situ observations is very valuable and well documented, the level of agreement between CAMS and observations is overstated as follows:
Regarding the method of comparison between CAMS and observations, CAMS derived NCCN are at 0UTC only, but the full diurnal cycle of in-situ observations is used (Section 2.3 Data Treatment). It is worth questioning whether diurnal cycles in observed NCCN lead to a bias here. The approach of longer temporal averaging taken here is not sufficient to remove such a bias. If hourly observations are available from the stations, it should be straightforward to test this.
The method for deriving the correction factor is well justified in terms of justifying an s-dependent k parameter, and in using empirical fits of the Nccn-s spectrum to derive this. Limitations in this method being unable to use the full 3-hourly CAMS reanalysis data or change the underlying size distribution are well justified. However, aspects of the method seem unjustified and sensitivities to the large uncertainties involved not fully investigated. Details of these are as follows:
More minor comments are as follows:
Daily 11day month Block2024 (daily)
SGP 0.069 0.133 0.709 0.24
PVC 0.296 (0.098) (0.360) 0.66
PGH 0.532 0.623 (0.480) 0.41