Assessment of Aerosol Iron Solubility using Global Dataset, Part II: Machine Learning and Deep Neural Network Coupled with SHapley Additive exPlanation Combined with Independent Component Analysis (SHAP-ICA)

Sakata, Kohei; Kurisu, Minako; Takahashi, Yoshio

doi:10.5194/egusphere-2026-1615

Preprints

https://doi.org/10.5194/egusphere-2026-1615

Preprints

08 Apr 2026

| 08 Apr 2026

Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Assessment of Aerosol Iron Solubility using Global Dataset, Part II: Machine Learning and Deep Neural Network Coupled with SHapley Additive exPlanation Combined with Independent Component Analysis (SHAP-ICA)

Kohei Sakata, Minako Kurisu, and Yoshio Takahashi

Abstract. The supply of dissolved iron (d-Fe) can enhance marine CO₂ fixation. Aerosols are one source of d-Fe to the ocean surface, but aerosol iron solubility (Fe_sol%) depends on emission sources and atmospheric alteration processes that remain poorly reproduced by global climate and chemical transport models. Although recent advances in machine and deep learning models can capture nonlinear relationships in observational datasets, applications to environmental samples are still limited and approaches for improving interpretability require further development. This study trained XGBoost and a deep neural network (DNN) using East Asian aerosol data and tested whether Fe_sol% and d-Fe concentrations in marine aerosols can be reproduced. The effects of individual features on Fe_sol% and d-Fe were quantified using SHapley Additive exPlanations (SHAP), and independent component analysis (ICA) was applied to SHAP values to extract independent components representing dominant controlling processes of Fe_sol%. East Asian Fe_sol% was reproduced well by both XGBoost and DNN. For marine aerosols, higher reproducibility was achieved by the DNN than by XGBoost, likely because deeper relationships among features can be learned. SHAP indicated that variability in Fe_sol% and d-Fe is primarily driven by chemical alteration of Fe in mineral dust and anthropogenic aerosols. ICA further suggested that additional processes, including heavy oil combustion, influence a subset of samples. Spatial variations in process contributions were visualized by mapping the influence of each independent component. This DNN-based framework can improve interpretation of both current results and future observational datasets.

Received: 23 Mar 2026 – Discussion started: 08 Apr 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 2958 KB)

Supplement (3633 KB)

Download & links

Kohei Sakata, Minako Kurisu, and Yoshio Takahashi

Status: open (until 20 May 2026)

Post a comment Subscribe to comment alert

Kohei Sakata, Minako Kurisu, and Yoshio Takahashi

Supplement

https://doi.org/10.5194/egusphere-2026-1615-supplement

Kohei Sakata, Minako Kurisu, and Yoshio Takahashi

Metrics will be available soon.

Latest update: 08 Apr 2026

Short summary

Aerosols supply dissolved iron (d-Fe) to the ocean surface, where it can enhance marine CO₂ fixation. Machine and deep learning can capture nonlinear relationships in observational datasets, but applications to atmospheric chemistry remain limited. Using East Asian aerosol data, this study trained XGBoost and a deep neural network to predict Fesol% and d-Fe in marine aerosols. SHAP and ICA showed that variability was governed mainly by chemical processing of mineral dust and anthropogenic Fe.