Survival analysis for droplet-freezing data: Kaplan–Meier confidence intervals and log-rank tests
Abstract. Droplet‑freezing assays underpin immersion‑mode ice‑nucleation research yet approaches to uncertainty quantification for fraction‑frozen curves and derived active‑site densities (ns(T)) are inconsistent. Further, there is not currently a rigorous method for significance testing the difference between fraction frozen curves. To address these issues, we recast droplet‑freezing measurements as survival data and apply analysis techniques typically used in medical statistics. Using the Kaplan–Meier estimator, we derive nonparametric confidence intervals for droplet fraction frozen and ns(T) without binning or model assumptions, matching Monte‑Carlo and studentized‑bootstrapped intervals on a literature volcanic ash ice nucleation dataset. Confidence intervals calculated for simulated datasets show precision improves with sample size and with steeper fraction frozen curves. Adapting the log-rank test, we introduce a method for comparing fraction frozen curves and demonstrate its application to literature and simulated droplet freezing datasets. We recommend reporting Kaplan–Meier confidence intervals on droplet freezing datasets and employing the log-rank test when comparing droplet fraction frozen curves.
In “Survival analysis for droplet-freezing data: Kaplan-Meier confidence intervals and log-rank tests,” Whale et al. present an approach to quantify uncertainty in droplet-freezing data and test differences in frozen fraction curves based on the non-parametric Kaplan-Meier survival function estimator. By applying this method, they are able to derive confidence intervals for frozen fraction curves and cumulative ice nucleation activity spectra. They also adapt the log-rank test to droplet freezing data to hypothesis test whether two frozen fraction curves are identical or not. They then demonstrate both methods on selected literature and simulated datasets. The method of calculating confidence intervals they present is rigorous, easy to use, and addresses a persistent problem of a lack of standardization in ice nucleation statistics. However, I have much more significant concerns surrounding the use of the log-rank test for ice nucleation data. Specifically, the hypothesis being tested by the log-rank test is limited in the context of ice nucleation, where in many samples there may be multiple different populations of ice active species that dominate at different temperatures in the frozen fraction spectrum. The requirement of proportional hazards is also a concern as many frozen fraction curves in literature do not meet this assumption, further limiting the usefulness of this test. Given that it can also only be applied to frozen fraction curves (and so generally cannot be used to compare across instruments or across samples at different concentrations), I am not convinced of the broad applicability that the authors imply when recommending this statistical test be used. When combined with the fact that the method of calculating confidence intervals has been previously published (Kinney et al., 2024), I am not convinced that this manuscript in its current form constitutes a sufficiently novel and significant contribution to the field. I encourage the authors to consider whether refinements mentioned in the manuscript that address the shortcomings of the log-rank test or other approaches to statistical testing in the context of survival analysis might be more appropriate and useful to a wider range of applications. Specific comments and suggestions are below.
Major comments:
Minor comments:
References
Cumming, G.: Inference by eye: Reading the overlap of independent confidence intervals, Stat. Med., 28, 205–220, https://doi.org/10.1002/sim.3471, 2009.
Fahy, W. D., Shalizi, C. R., and Sullivan, R. C.: A universally applicable method of calculating confidence bands for ice nucleation spectra derived from droplet freezing experiments, Atmospheric Meas. Tech., 15, 6819–6836, https://doi.org/10.5194/amt-15-6819-2022, 2022.
Jahl, L. G., Brubaker, T. A., Polen, M. J., Jahn, L. G., Cain, K. P., Bowers, B. B., Fahy, W. D., Graves, S., and Sullivan, R. C.: Atmospheric aging enhances the ice nucleation ability of biomass-burning aerosol, Sci. Adv., 7, eabd3440, https://doi.org/10.1126/sciadv.abd3440, 2021.
Kinney, N. L. H., Hepburn, C. A., Gibson, M. I., Ballesteros, D., and Whale, T. F.: High interspecific variability in ice nucleation activity suggests pollen ice nucleators are incidental, Biogeosciences, 21, 3201–3214, https://doi.org/10.5194/bg-21-3201-2024, 2024.
Losey, D. J., Sihvonen, S. K., Veghte, D. P., Chong, E., and Freedman, M. A.: Acidic processing of fly ash: chemical characterization, morphology, and immersion freezing, Environ. Sci. Process. Impacts, 20, 1581–1592, https://doi.org/10.1039/c8em00319j, 2018.
Sachs, M. C., Brand, A., and Gabriel, E. E.: Confidence bands in survival analysis, Br. J. Cancer, 127, 1636–1641, https://doi.org/10.1038/s41416-022-01920-5, 2022.
Whale, T. F., Holden, M. A., Wilson, T. W., O’Sullivan, D., and Murray, B. J.: The enhancement and suppression of immersion mode heterogeneous ice-nucleation by solutes, Chem. Sci., 9, 4142–4151, https://doi.org/10.1039/C7SC05421A, 2018.