Preprints
https://doi.org/10.5194/egusphere-2026-1562
https://doi.org/10.5194/egusphere-2026-1562
21 Apr 2026
 | 21 Apr 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

Transferable Hourly Ozone Forecasting with Transformers

Sindhu Vasireddy, Michael Langguth, and Martin Schultz

Abstract. We investigate the suitability of a transformer-based approach for air-quality forecasting, focusing on 4-day ahead hourly predictions of surface ozone (O3). The study employs Google’s Temporal Fusion Transformer (TFT) to integrate meteorological predictors, historical pollutant observations, and static station metadata, using an open source implementation with minimal domain-specific preprocessing. The analysis addresses two questions: (1) how efficiently a transformer model can be deployed for regional air quality forecasting, and (2) how well the learned representations transfer across geophysically distinct regions.

Model performance is evaluated against state-of-the-art regional chemical transport model Copernicus Atmosphere Monitoring Service (CAMS) ensemble forecast using observations from Germany. The TFT consistently achieves lower bias and higher forecast skill across all lead times. Suburban monitoring sites exhibit the highest skill relative to CAMS based on RMSE and SMAPE-based metrics. Urban stations show moderate skill against CAMS baseline, while rural stations have reduced skill in comparison but remain positive across the full 96 h forecast, with the strongest improvements observed at shorter lead times. Post–day-1 results indicate a clear separation of performance by station type; suggesting increasing performance stratification by station type beyond day 1, with larger relative gains at urban and suburban sites and smaller but consistently positive skill at rural locations.

Geographic transferability is assessed by adapting a model trained over Germany to South Korea by retraining region-specific metadata embeddings while preserving learned temporal representations. Forecast errors increase by only 5–10 %, indicating that the model captures meteorological drivers of O3 variability that generalize across contrasting anthropogenic and climatic regimes. Ablation experiments further demonstrate the robustness of the chosen experimental configuration for both forecasting performance and cross region transferability.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Sindhu Vasireddy, Michael Langguth, and Martin Schultz

Status: open (until 16 Jun 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Sindhu Vasireddy, Michael Langguth, and Martin Schultz
Metrics will be available soon.
Latest update: 22 Apr 2026
Download
Short summary
This study evaluates a transformer model for hourly air quality forecasting using past pollution, weather, and anthropogenic metadata (emissions, land use). It outperforms Copernicus Atmosphere Monitoring Service forecasts, especially in urban regions, with lower bias and improved stability. Trained in Germany, it transfers to South Korea with minimal adaptation, preserving geochemical relationships and showing strong cross-regional generalization.
Share