Preprints
https://doi.org/10.5194/egusphere-2026-696
https://doi.org/10.5194/egusphere-2026-696
16 Feb 2026
 | 16 Feb 2026
Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Machine learning interatomic potentials with accurate long-range interactions for molecular dynamics collision simulations of atmospherically-relevant molecules

Ivo Neefjes, Jakub Kubečka, and Jonas Elm

Abstract. Molecular collisions and subsequent clustering events are fundamental to atmospheric cluster formation. Accurately modeling these processes requires interatomic potentials that capture long-range forces governing collision kinetics and short-range quantum effects driving reactivity. In this work, we evaluate the AIMNet2 and PaiNN machine learning architectures trained on GFN1-xTB and ωB97X-3c data for molecular collisions involving sulfuric acid.

The models exhibit low mean absolute errors in energies and forces and accurately reproduce potentials of mean force relative to GFN1-xTB. Comparing models trained on GFN1-xTB and ωB97X-3c data reveals that while increasing the electronic structure theory level significantly alters the potential energy surface in the binding region, it has negligible impact on the long-range shoulder and collision rate coefficients. Notably, PaiNN demonstrates superior performance in reproducing binding and repulsive regions, making it highly effective for sampling stable cluster configurations.

However, discrepancies are observed in collision dynamics. While AIMNet2 accurately reproduces reference collision rates across all systems, PaiNN underestimates the rate for the charged sulfuric acid–bisulfate system by ~50 %. This error originates from the model's local atomic environment approximation, which neglects long-range attractive forces at large intermolecular distances. Comparisons with the OPLS-AA force field demonstrate that simple fixed partial charges are sufficient to describe these interactions.

Our results highlight that while local equivariant models like PaiNN offer exceptional accuracy for thermodynamics, correctly simulating collision kinetics in systems with strong long-range interactions requires models that explicitly account for forces beyond the local environment, such as AIMNet2.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Ivo Neefjes, Jakub Kubečka, and Jonas Elm

Status: open (until 30 Mar 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Ivo Neefjes, Jakub Kubečka, and Jonas Elm

Data sets

neefjes26_long_range_NN Ivo Neefjes, Jakub Kubečka, and Jonas Elm https://github.com/elmjonas/ACDB/tree/master/Articles/neefjes26_long_range_NN

Model code and software

neefjes26_long_range_NN Ivo Neefjes, Jakub Kubečka, and Jonas Elm https://github.com/elmjonas/ACDB/tree/master/Articles/neefjes26_long_range_NN

Ivo Neefjes, Jakub Kubečka, and Jonas Elm

Viewed

Total article views: 34 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
28 5 1 34 3 2 1
  • HTML: 28
  • PDF: 5
  • XML: 1
  • Total: 34
  • Supplement: 3
  • BibTeX: 2
  • EndNote: 1
Views and downloads (calculated since 16 Feb 2026)
Cumulative views and downloads (calculated since 16 Feb 2026)

Viewed (geographical distribution)

Total article views: 29 (including HTML, PDF, and XML) Thereof 29 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 17 Feb 2026
Download
Short summary
Atmospheric particles impact climate and health. Most particles form through gas molecules colliding and sticking together. We use molecular dynamics accelerated by machine learning to study this process. We found that standard machine learning models often fail to capture the long-range forces driving collisions, and models with explicit long-range corrections are needed. This work provides a blueprint for accurate simulations of particle formation.
Share