Correcting Aerosol Extinction Coefficient Vertical Structure Biases in GEOS-Chem via a Physics-Informed Transformer with Physical Mechanism Diagnosis

Xiong, Jiajun; Wang, Yi; Wang, Jun; Wang, Yanyu; Zhou, Meng; Tao, Minghui; Dong, Wenhui; Kim, Jhoon; Wang, Lunche

doi:10.5194/egusphere-2026-397

Preprints

https://doi.org/10.5194/egusphere-2026-397

Preprints

17 Feb 2026

| 17 Feb 2026

Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Correcting Aerosol Extinction Coefficient Vertical Structure Biases in GEOS-Chem via a Physics-Informed Transformer with Physical Mechanism Diagnosis

Jiajun Xiong, Yi Wang, Jun Wang, Yanyu Wang, Meng Zhou, Minghui Tao, Wenhui Dong, Jhoon Kim, and Lunche Wang

Abstract. We propose a physics-informed Transformer framework to correct biases in the Aerosol Extinction Coefficient (AEC, km^-1) profiles simulated by GEOS-Chem. Unlike standard Transformer, our framework features a dual-stream architecture with explicit physical constraints. It employs Gated Feature Fusion to integrate vertical structures (combining GEOS-Chem priors with MERRA-2 profiles) by dynamically identifying height-dependent drivers, and leverages Cross-Attention to incorporate MERRA-2 surface environmental constraints for modulating AEC vertical reconstruction with synoptic contexts. This approach effectively predicts systematic biases relative to Cloud-Aerosol Lidar with Orthogonal Polarization satellite observations and resolves AEC profiles, surpassing methods retrieving only aerosol layer heights. "Leave-One-Year-Out" validation over East Asia during 2017–2019 demonstrates significant AEC fidelity improvements, increasing R from 0.49–0.53 in the GEOS-Chem simulations to 0.66–0.73 and reducing RMSE by approximately 25 %. The model effectively mitigates over-diffusion, significantly reducing AEC simulation biases in the critical near-surface layer while restoring smoothed biomass burning and dust plumes. Additionally, it exhibits robust cross-continental transferability, reproducing bias patterns over North American domain (R=0.70) without retraining, confirming the internalization of universal physicochemical relationships linking atmospheric states to simulation biases. Furthermore, interpretability analysis establishes a feedback loop from data-driven correction to physical model improvement. The model identifies temperature and sensible heat flux as primary drivers to constrain boundary layer mixing, and uses environmental proxies (e.g., vegetation indices) to diagnose deficiencies in dust uplift and secondary aerosol formation. These insights provide a physical basis for refining parameterization schemes in chemical transport models.

Received: 24 Jan 2026 – Discussion started: 17 Feb 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 7550 KB)

Supplement (7441 KB)

Download & links

Jiajun Xiong, Yi Wang, Jun Wang, Yanyu Wang, Meng Zhou, Minghui Tao, Wenhui Dong, Jhoon Kim, and Lunche Wang

Status: open (until 31 Mar 2026)

Post a comment Subscribe to comment alert

RC1: 'Comment on egusphere-2026-397', Anonymous Referee #1, 09 Mar 2026 reply

This manuscript presents a sophisticated physics-informed Transformer framework to correct GEOS-Chem aerosol extinction coefficient profiles using CALIOP observations. The study is ambitious, methodologically advanced, and addresses an important problem in bridging chemical transport models (CTMs) and vertically resolved lidar observations. The reported improvements in correlation and RMSE, along with cross-continental transferability tests, are promising. However, several issues require clarification before the scientific contribution and methodological advantage can be properly evaluated as follows.
First, the scientific objective requires clearer framing. CALIOP observations are used to define simulation bias during training, but they are not included as inputs during inference. Therefore, the framework is not performing data assimilation, but rather learning a state-dependent mapping between atmospheric variables and historical GEOS-Chem biases. If the goal is to generate corrected AEC fields when CALIOP is unavailable, the method should be clearly described as a supervised bias-correction model conditioned on CTM state and meteorology, and its limitations should be acknowledged. For example, if key emissions (e.g., wildfire events) are missing in GEOS-Chem and not represented in the input features, the model cannot reconstruct those missing signals. The correction is inherently constrained by the information content of the CTM and meteorological predictors. The manuscript should therefore distinguish more carefully between correcting systematic state-dependent biases and compensating for missing physical processes. Clarifying this distinction would strengthen the scientific positioning of the study.
Second, the model architecture appears to rely on instantaneous vertical profiles and meteorological context, without explicit time-series modeling. It is unclear whether any temporal continuity, lagged predictors, or time-window averaging is incorporated into the inputs. A precise description of the temporal collocation strategy between GEOS-Chem and CALIOP is necessary to assess the robustness of the results. In addition, the manuscript does not discuss how diurnal variability in aerosol vertical structure is handled. Given the strong diurnal cycle of boundary layer evolution, turbulent mixing, hygroscopic growth, and photochemistry, aerosol extinction can vary substantially on hourly timescales. It should be clarified whether simple hour-by-hour matching is sufficient, or whether a temporal window similar to those used in traditional data assimilation frameworks, was considered to reduce representativeness errors. Without such analysis, it remains uncertain whether the reported improvements reflect stable bias correction or sensitivity to sampling timing and diurnal variability.
Third, the proposed architecture includes multiple advanced components. While the performance improvements are reported relative to the original GEOS-Chem simulation, there is no comparison with simpler machine learning baselines. It is therefore unclear whether the reported gains arise from the Transformer architecture itself, from the inclusion of additional meteorological predictors, or simply from the supervised bias-learning framework. To justify the methodological novelty, the study should include comparisons with at least one conventional model, such as a multilayer perceptron, a CNN-based model, or a tree-based regression approach. Ideally, ablation experiments isolating the contributions of the cross-attention module and gated fusion mechanism would further demonstrate the necessity of the proposed architecture. Without such benchmarks, it is difficult to assess whether the architectural complexity is warranted.

Reply

Citation: https://doi.org/10.5194/egusphere-2026-397-RC1

Jiajun Xiong, Yi Wang, Jun Wang, Yanyu Wang, Meng Zhou, Minghui Tao, Wenhui Dong, Jhoon Kim, and Lunche Wang

Supplement

https://doi.org/10.5194/egusphere-2026-397-supplement

Jiajun Xiong, Yi Wang, Jun Wang, Yanyu Wang, Meng Zhou, Minghui Tao, Wenhui Dong, Jhoon Kim, and Lunche Wang

Viewed

Total article views: 175 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
103	66	6	175	28	5	11

HTML: 103
PDF: 66
XML: 6
Total: 175
Supplement: 28
BibTeX: 5
EndNote: 11

Views and downloads (calculated since 17 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	72	51	6	129
Mar 2026	31	15	0	46

Cumulative views and downloads (calculated since 17 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	72	51	6	129
Mar 2026	31	15	0	46

Viewed (geographical distribution)

Total article views: 197 (including HTML, PDF, and XML) Thereof 197 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 09 Mar 2026

Short summary

Current models struggle to simulate aerosol extinction profiles accurately. We introduce a physics-informed deep learning framework combining model simulations with satellite data to reconstruct precise three-dimensional aerosol fields. This method significantly reduces biases over East Asia and works effectively in North America without retraining. Crucially, it acts as a diagnostic tool to identify specific physical flaws in models, guiding improvements for climate research.


Total:	0
HTML:	0
PDF:	0
XML:	0