Preprints
https://doi.org/10.5194/egusphere-2025-6097
https://doi.org/10.5194/egusphere-2025-6097
30 Dec 2025
 | 30 Dec 2025
Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Technical note: Expanding ensemble forecasts with generative AI: application to volcanic clouds

Leonardo Mingari, Arnau Folch, Heribert Pascual, and Manuel Titos

Abstract. Ensemble-based modelling of the atmospheric dispersal of volcanic clouds enables more realistic forecasts by explicitly accounting for uncertainties in eruption source parameters, meteorological data, and systematic errors in transport models. Many ensemble applications, including quantification of forecast uncertainties, data assimilation, or probabilistic hazard assessments, require a large number of members to mitigate sampling errors and to properly capture probability distributions. However, running large ensembles with Volcanic Ash Transport and Dispersal (VATD) models can be computationally demanding, even for high-performance computing clusters. As a result, operational forecasting is typically restricted to smaller ensembles in order to fit time-to-solution requirements. In contrast, generative AI models can produce large volumes of physically-consistent data samples with minimal computational cost. In this work, a convolutional Variational AutoEncoder (VAE) is trained on an ensemble of 256 forecasts simulated with the FALL3D model and subsequently used to generate larger ensembles, effectively augmenting physics-based ensemble modelling capacity. Ensembles with up to 8192 members were generated nearly instantaneously using the trained neural network, with no reliance on HPC resources. The statistical properties of the expanded ensembles are characterised in detail, and the VAE performance is evaluated against a test dataset composed of 2048 numerical simulations. The VAE-generated ensembles closely approximate the actual (target) probability distribution as well as key sample statistics, such as ensemble mean and spread, with minimal degradation in the evaluation metrics. Finally, we discuss possible future applications of this work, including latent space data assimilation via deep learning.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Leonardo Mingari, Arnau Folch, Heribert Pascual, and Manuel Titos

Status: open (until 10 Feb 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Leonardo Mingari, Arnau Folch, Heribert Pascual, and Manuel Titos
Leonardo Mingari, Arnau Folch, Heribert Pascual, and Manuel Titos
Metrics will be available soon.
Latest update: 30 Dec 2025
Download
Short summary
Forecasting volcanic clouds is challenging because accurate modelling can require many numerical simulations which are computationally demanding. We use a generative AI model to create large ensembles of realistic forecasts quickly and inexpensively. This approach expands traditional modelling capabilities, helping scientists better understand uncertainty and improve volcanic cloud predictions, unleashing generative AI modelling for atmospheric transport of volcanic clouds.
Share