DDA-BNN v1.0: A Morphology-Aware Surrogate Model for the Optical Properties of Black Carbon&ndash;Containing Particles

Beeler, Payton; Donald, Sam; Fierce, Laura

doi:10.5194/egusphere-2026-1270

Preprints

https://doi.org/10.5194/egusphere-2026-1270

Preprints

24 Mar 2026

| 24 Mar 2026

Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

DDA-BNN v1.0: A Morphology-Aware Surrogate Model for the Optical Properties of Black Carbon–Containing Particles

Payton Beeler, Sam Donald, and Laura Fierce

Abstract. Black carbon (BC) is the most strongly absorbing component of atmospheric aerosol and significantly impacts Earth's energy balance. The optical properties of BC-containing particles depend on particle-level variability in size, chemical composition, and internal morphology. Such particle-level details are not easily represented in large-scale atmospheric models. Existing parameterizations typically assume idealized particle geometries (e.g., homogeneous spheres or concentric core–shell spheres) and homogeneous mixing, which can yield biased predictions and provide no quantitative estimate of model-form uncertainty at the single-particle level. In this work, we present a probabilistic framework for predicting the optical properties of individual BC-containing particles using a hybrid Bayesian neural network (BNN) model trained on numerically exact discrete dipole approximation (DDA) simulations. The hybrid BNN is a flexible combination of deterministic and Bayesian layers allowing for more realistic treatment of particle optical properties and quantification of uncertainty. The hybrid BNN model predicts extinction efficiency, single-scattering albedo, and asymmetry parameter and returns predictive uncertainty that can be decomposed into aleatoric (data-driven variability) and epistemic (uncertainty due to limited training coverage) components. We show that the hybrid BNN outperforms homogeneous-sphere and core–shell Mie approximations for calculating extinction and scattering-sensitive quantities (Q_ext, SSA and g), while maintaining comparable accuracy for absorption-related metrics. We further demonstrate how epistemic uncertainty highlights under-sampled regions of particle parameter space, enabling targeted design of future DDA simulations that most effectively reduce model uncertainty. This uncertainty-aware surrogate provides a practical pathway for incorporating realistically complex particle morphologies into parameterizations of aerosol optical properties, which will ultimately improve the reliability of model-based assessments of BC impacts on the atmosphere.

Received: 13 Mar 2026 – Discussion started: 24 Mar 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Payton Beeler, Sam Donald, and Laura Fierce

Status: open (until 21 May 2026)

Post a comment Subscribe to comment alert

CEC1:
'Comment on egusphere-2026-1270 - No compliance with the policy of the journal', Juan Antonio Añel, 28 Mar 2026 reply

Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.html
In your manuscript, you do not provide a repository containing the data used and produced in your work. The GMD review and publication process depends on reviewers and community commentators being able to access, during the discussion phase, the code and data on which a manuscript depends, and on ensuring the provenance of replicability of the published papers for years after their publication. Please, therefore, publish your data in one of the appropriate repositories and reply to this comment with the relevant information (link and a permanent identifier for it (e.g. DOI)) as soon as possible. We cannot have manuscripts under discussion that do not comply with our policy.
The 'Code and Data Availability’ section must also be modified to cite the new repository locations, and corresponding references added to the bibliography.
I must note that if you do not fix this problem, we cannot continue with the peer-review process or accept your manuscript for publication in GMD.
Juan A. Añel

Geosci. Model Dev. Executive Editor

Reply

Citation: https://doi.org/10.5194/egusphere-2026-1270-CEC1
- AC1:
  'Reply on CEC1', Laura Fierce, 30 Mar 2026 reply
  
  We have revised the manuscript to comply with the GMD Code and Data Policy.
  The DDA-BNN codebase, including the training pipeline and inference tools, is publicly available at:
  
  https://github.com/pnnl/DDA-BNN (MIT License).
  The exact version of the code used in this study has been permanently archived on Zenodo and assigned a DOI:
  
  https://doi.org/10.5281/zenodo.19324375
  
  (https://zenodo.org/records/19324375)
  The training dataset used in this work is publicly available and has also been permanently archived on Zenodo with a DOI:
  
  https://doi.org/10.5281/zenodo.19324185
  
  (https://zenodo.org/records/19324185)
  We have updated the “Code and Data Availability” section of the revised manuscript to include these repository links and DOIs, and have added the corresponding citations to the bibliography. All code and data are publicly accessible at the links above for the duration of the discussion phase.
  
  Reply
  
  Citation: https://doi.org/10.5194/egusphere-2026-1270-AC1
  - CEC2: 'Reply on AC1', Juan Antonio Añel, 30 Mar 2026 reply
    
    Dear authors,
    Thanks for addressing this issue so quickly. I have checked the repositories and we can consider now the current version of your manuscript in compliance with the code policy of the journal.
    Juan A. Añel
    
    Reply
    
    Citation: https://doi.org/10.5194/egusphere-2026-1270-CEC2
CC1: 'Details of the DDA simulations', Maxim A. Yurkin, 03 Apr 2026 reply

The authors have performed a lot of DDA simulations and carefully described the training and validation starting from the DDA dataset (available at Zenodo). However, I couldn't find a description of the ADDA parameters used for simulations. It would be great to specify them for the overall reproducibility, including the ADDA version, DDA formulation, and discretization. Mentioning that some parameters are set to default values will also be fine.
Putting all raw DDA output online would probably be an overkill, but the authors may consider sharing the scripts, which perform these runs (to build a dataset). They will necessarily contain all the command line options for ADDA. Total or per run computational requirements of the DDA can also be interesting for readers.
Related question is that of the uncertainty of the DDA simulations (expected errors). Is there any estimate on that, at least for a few representative cases? It can be obtained, e.g., by several steps of discretization refinement for the same problem. I suspect, that the corresponding DDA errors are small enough to not influence any conclusions or further steps in the manuscript. Still, it would be great to quantify them.
The latter seems especially natural, since the authors already perform an advanced error analysis, separating aleatoric and epistemic uncertainties. The DDA uncertainty probably fall into aleatoric class, but I am not sure if it needs any special consideration.
Finally, I have a minor stylistic note concerning the name of the ADDA code. The official guideline is not to deabbreviate it - see https://github.com/adda-team/adda/wiki/FAQ#what-is-the-official-name-of-the-code-what-does-a-stands-for . In other words, the standard naming is just ADDA.
Similar aspects were recently discussed in another GMD paper - https://doi.org/10.5194/gmd-19-887-2026 .

Reply

Citation: https://doi.org/10.5194/egusphere-2026-1270-CC1

Payton Beeler, Sam Donald, and Laura Fierce

Viewed

Total article views: 781 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
570	153	58	781	36	45

HTML: 570
PDF: 153
XML: 58
Total: 781
BibTeX: 36
EndNote: 45

Views and downloads (calculated since 24 Mar 2026)

Month	HTML	PDF	XML	Total
Mar 2026	434	97	40	571
Apr 2026	122	42	13	177
May 2026	14	14	5	33

Cumulative views and downloads (calculated since 24 Mar 2026)

Month	HTML	PDF	XML	Total
Mar 2026	434	97	40	571
Apr 2026	122	42	13	177
May 2026	14	14	5	33

Viewed (geographical distribution)

Total article views: 781 (including HTML, PDF, and XML) Thereof 781 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 16 May 2026

Short summary

We developed a surrogate model to predict how black carbon particles absorb and scatter light based on their size, shape, and mixing with other aerosol species. The model was trained on detailed physics simulations and also estimates uncertainty caused by limited training data and unresolved internal structures. It outperformed common simplified particle assumptions, especially for scattering, and can help target new simulations to improve predictions of black carbon’s radiative effects.


Total:	0
HTML:	0
PDF:	0
XML:	0