Machine learning interatomic potentials with accurate long-range interactions for molecular dynamics collision simulations of atmospherically-relevant molecules

Neefjes, Ivo; Kubečka, Jakub; Elm, Jonas

doi:10.5194/egusphere-2026-696

Preprints

https://doi.org/10.5194/egusphere-2026-696

Preprints

16 Feb 2026

| 16 Feb 2026

Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Machine learning interatomic potentials with accurate long-range interactions for molecular dynamics collision simulations of atmospherically-relevant molecules

Ivo Neefjes, Jakub Kubečka, and Jonas Elm

Abstract. Molecular collisions and subsequent clustering events are fundamental to atmospheric cluster formation. Accurately modeling these processes requires interatomic potentials that capture long-range forces governing collision kinetics and short-range quantum effects driving reactivity. In this work, we evaluate the AIMNet2 and PaiNN machine learning architectures trained on GFN1-xTB and ωB97X-3c data for molecular collisions involving sulfuric acid.

The models exhibit low mean absolute errors in energies and forces and accurately reproduce potentials of mean force relative to GFN1-xTB. Comparing models trained on GFN1-xTB and ωB97X-3c data reveals that while increasing the electronic structure theory level significantly alters the potential energy surface in the binding region, it has negligible impact on the long-range shoulder and collision rate coefficients. Notably, PaiNN demonstrates superior performance in reproducing binding and repulsive regions, making it highly effective for sampling stable cluster configurations.

However, discrepancies are observed in collision dynamics. While AIMNet2 accurately reproduces reference collision rates across all systems, PaiNN underestimates the rate for the charged sulfuric acid–bisulfate system by ~50 %. This error originates from the model's local atomic environment approximation, which neglects long-range attractive forces at large intermolecular distances. Comparisons with the OPLS-AA force field demonstrate that simple fixed partial charges are sufficient to describe these interactions.

Our results highlight that while local equivariant models like PaiNN offer exceptional accuracy for thermodynamics, correctly simulating collision kinetics in systems with strong long-range interactions requires models that explicitly account for forces beyond the local environment, such as AIMNet2.

Received: 05 Feb 2026 – Discussion started: 16 Feb 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 3103 KB)

Supplement (3474 KB)

Download & links

Ivo Neefjes, Jakub Kubečka, and Jonas Elm

Status: open (until 05 Apr 2026)

Post a comment Subscribe to comment alert

RC1: 'Comment on egusphere-2026-696', Anonymous Referee #1, 09 Mar 2026 reply

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2026/egusphere-2026-696/egusphere-2026-696-RC1-supplement.pdf
Reply

Citation: https://doi.org/10.5194/egusphere-2026-696-RC1
RC2: 'Comment on egusphere-2026-696', Anonymous Referee #2, 10 Mar 2026 reply

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2026/egusphere-2026-696/egusphere-2026-696-RC2-supplement.pdf
Reply

Citation: https://doi.org/10.5194/egusphere-2026-696-RC2

Ivo Neefjes, Jakub Kubečka, and Jonas Elm

Supplement

https://doi.org/10.5194/egusphere-2026-696-supplement

Data sets

neefjes26_long_range_NN Ivo Neefjes, Jakub Kubečka, and Jonas Elm https://github.com/elmjonas/ACDB/tree/master/Articles/neefjes26_long_range_NN

Model code and software

neefjes26_long_range_NN Ivo Neefjes, Jakub Kubečka, and Jonas Elm https://github.com/elmjonas/ACDB/tree/master/Articles/neefjes26_long_range_NN

Ivo Neefjes, Jakub Kubečka, and Jonas Elm

Viewed

Total article views: 155 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
97	49	9	155	16	10	16

HTML: 97
PDF: 49
XML: 9
Total: 155
Supplement: 16
BibTeX: 10
EndNote: 16

Views and downloads (calculated since 16 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	66	40	9	115
Mar 2026	31	9	0	40

Cumulative views and downloads (calculated since 16 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	66	40	9	115
Mar 2026	31	9	0	40

Viewed (geographical distribution)

Total article views: 159 (including HTML, PDF, and XML) Thereof 159 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 10 Mar 2026

Short summary

Atmospheric particles impact climate and health. Most particles form through gas molecules colliding and sticking together. We use molecular dynamics accelerated by machine learning to study this process. We found that standard machine learning models often fail to capture the long-range forces driving collisions, and models with explicit long-range corrections are needed. This work provides a blueprint for accurate simulations of particle formation.


Total:	0
HTML:	0
PDF:	0
XML:	0