Preprints
https://doi.org/10.5194/egusphere-2026-1615
https://doi.org/10.5194/egusphere-2026-1615
08 Apr 2026
 | 08 Apr 2026
Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Assessment of Aerosol Iron Solubility using Global Dataset, Part II: Machine Learning and Deep Neural Network Coupled with SHapley Additive exPlanation Combined with Independent Component Analysis (SHAP-ICA)

Kohei Sakata, Minako Kurisu, and Yoshio Takahashi

Abstract. The supply of dissolved iron (d-Fe) can enhance marine CO2 fixation. Aerosols are one source of d-Fe to the ocean surface, but aerosol iron solubility (Fesol%) depends on emission sources and atmospheric alteration processes that remain poorly reproduced by global climate and chemical transport models. Although recent advances in machine and deep learning models can capture nonlinear relationships in observational datasets, applications to environmental samples are still limited and approaches for improving interpretability require further development. This study trained XGBoost and a deep neural network (DNN) using East Asian aerosol data and tested whether Fesol% and d-Fe concentrations in marine aerosols can be reproduced. The effects of individual features on Fesol% and d-Fe were quantified using SHapley Additive exPlanations (SHAP), and independent component analysis (ICA) was applied to SHAP values to extract independent components representing dominant controlling processes of Fesol%. East Asian Fesol% was reproduced well by both XGBoost and DNN. For marine aerosols, higher reproducibility was achieved by the DNN than by XGBoost, likely because deeper relationships among features can be learned. SHAP indicated that variability in Fesol% and d-Fe is primarily driven by chemical alteration of Fe in mineral dust and anthropogenic aerosols. ICA further suggested that additional processes, including heavy oil combustion, influence a subset of samples. Spatial variations in process contributions were visualized by mapping the influence of each independent component. This DNN-based framework can improve interpretation of both current results and future observational datasets.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Kohei Sakata, Minako Kurisu, and Yoshio Takahashi

Status: open (until 20 May 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Kohei Sakata, Minako Kurisu, and Yoshio Takahashi
Kohei Sakata, Minako Kurisu, and Yoshio Takahashi
Metrics will be available soon.
Latest update: 08 Apr 2026
Download
Short summary
Aerosols supply dissolved iron (d-Fe) to the ocean surface, where it can enhance marine CO2 fixation. Machine and deep learning can capture nonlinear relationships in observational datasets, but applications to atmospheric chemistry remain limited. Using East Asian aerosol data, this study trained XGBoost and a deep neural network to predict Fesol% and d-Fe in marine aerosols. SHAP and ICA showed that variability was governed mainly by chemical processing of mineral dust and anthropogenic Fe.
Share