Preprints
https://doi.org/10.5194/egusphere-2025-4137
https://doi.org/10.5194/egusphere-2025-4137
04 Sep 2025
 | 04 Sep 2025
Status: this preprint is open for discussion and under review for Atmospheric Measurement Techniques (AMT).

A Machine Learning Method for Estimating Atmospheric Trace Gas Concentration Baselines

Kirstin Gerrand, Elena Fillola, Alistair J. Manning, Jgor Arduini, Paul B. Krummel, Chris R. Lunder, Jens Mühle, Simon O'Doherty, Sunyoung Park, Ronald G. Prinn, Stefan Reimann, Dickon Young, and Matthew Rigby

Abstract. Estimates of trace gas baseline mole fractions in high-frequency atmospheric measurement records are crucial for analysing long-term changes in atmospheric composition. Baseline mole fractions are those that would be observed far from emission sources (and hence are representative of background conditions) at specific latitudes in the atmosphere. Previous methods for inferring baseline mole fractions have used statistical or meteorological approaches, or, if available, co-measured tracer species thought only to be emitted from non-baseline wind sectors. Combinations of these techniques have also been employed in some applications. Statistical methods typically fit a baseline to the observations themselves, while meteorological methods use atmospheric models of varying complexity to categorise air mass origins. In this paper, we present a novel machine learning method for estimating trace gas baseline mole fractions, which benefits from the physical basis of model-based filtering without the need for running an expensive simulator. Our approach offers the accessibility and computational cost-effectiveness of statistical models, without the associated smoothing or difficulty in identifying rapid baseline variations. By training on historical Lagrangian particle dispersion model outputs, our model learns to predict baseline mole fractions directly from meteorological fields. This advancement opens new avenues for low-latency trace gas time series data analysis, reconstruction of historical baseline trends, and improved utilisation of tracer measurement air mass classification methods.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Kirstin Gerrand, Elena Fillola, Alistair J. Manning, Jgor Arduini, Paul B. Krummel, Chris R. Lunder, Jens Mühle, Simon O'Doherty, Sunyoung Park, Ronald G. Prinn, Stefan Reimann, Dickon Young, and Matthew Rigby

Status: open (until 10 Oct 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Kirstin Gerrand, Elena Fillola, Alistair J. Manning, Jgor Arduini, Paul B. Krummel, Chris R. Lunder, Jens Mühle, Simon O'Doherty, Sunyoung Park, Ronald G. Prinn, Stefan Reimann, Dickon Young, and Matthew Rigby
Kirstin Gerrand, Elena Fillola, Alistair J. Manning, Jgor Arduini, Paul B. Krummel, Chris R. Lunder, Jens Mühle, Simon O'Doherty, Sunyoung Park, Ronald G. Prinn, Stefan Reimann, Dickon Young, and Matthew Rigby
Metrics will be available soon.
Latest update: 04 Sep 2025
Download
Short summary
To analyse long-term trends in atmospheric trace gas concentrations, it is important to identify data points minimally affected by local pollution sources or air masses carried from other latitudes or altitudes. Traditional methods for detecting these “baselines” are computationally expensive or lack a basis in physical principles. This paper introduces a machine-learning method that uses meteorological data and offers significantly lower computational costs compared to physics-based techniques.
Share