Cargando…

Accurate global machine learning force fields for molecules with hundreds of atoms

Global machine learning force fields, with the capacity to capture collective interactions in molecular systems, now scale up to a few dozen atoms due to considerable growth of model complexity with system size. For larger molecules, locality assumptions are introduced, with the consequence that non...

Descripción completa

Detalles Bibliográficos
Autores principales: Chmiela, Stefan, Vassilev-Galindo, Valentin, Unke, Oliver T., Kabylda, Adil, Sauceda, Huziel E., Tkatchenko, Alexandre, Müller, Klaus-Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9833674/
https://www.ncbi.nlm.nih.gov/pubmed/36630510
http://dx.doi.org/10.1126/sciadv.adf0873
_version_ 1784868294450216960
author Chmiela, Stefan
Vassilev-Galindo, Valentin
Unke, Oliver T.
Kabylda, Adil
Sauceda, Huziel E.
Tkatchenko, Alexandre
Müller, Klaus-Robert
author_facet Chmiela, Stefan
Vassilev-Galindo, Valentin
Unke, Oliver T.
Kabylda, Adil
Sauceda, Huziel E.
Tkatchenko, Alexandre
Müller, Klaus-Robert
author_sort Chmiela, Stefan
collection PubMed
description Global machine learning force fields, with the capacity to capture collective interactions in molecular systems, now scale up to a few dozen atoms due to considerable growth of model complexity with system size. For larger molecules, locality assumptions are introduced, with the consequence that nonlocal interactions are not described. Here, we develop an exact iterative approach to train global symmetric gradient domain machine learning (sGDML) force fields (FFs) for several hundred atoms, without resorting to any potentially uncontrolled approximations. All atomic degrees of freedom remain correlated in the global sGDML FF, allowing the accurate description of complex molecules and materials that present phenomena with far-reaching characteristic correlation lengths. We assess the accuracy and efficiency of sGDML on a newly developed MD22 benchmark dataset containing molecules from 42 to 370 atoms. The robustness of our approach is demonstrated in nanosecond path-integral molecular dynamics simulations for supramolecular complexes in the MD22 dataset.
format Online
Article
Text
id pubmed-9833674
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Association for the Advancement of Science
record_format MEDLINE/PubMed
spelling pubmed-98336742023-01-18 Accurate global machine learning force fields for molecules with hundreds of atoms Chmiela, Stefan Vassilev-Galindo, Valentin Unke, Oliver T. Kabylda, Adil Sauceda, Huziel E. Tkatchenko, Alexandre Müller, Klaus-Robert Sci Adv Physical and Materials Sciences Global machine learning force fields, with the capacity to capture collective interactions in molecular systems, now scale up to a few dozen atoms due to considerable growth of model complexity with system size. For larger molecules, locality assumptions are introduced, with the consequence that nonlocal interactions are not described. Here, we develop an exact iterative approach to train global symmetric gradient domain machine learning (sGDML) force fields (FFs) for several hundred atoms, without resorting to any potentially uncontrolled approximations. All atomic degrees of freedom remain correlated in the global sGDML FF, allowing the accurate description of complex molecules and materials that present phenomena with far-reaching characteristic correlation lengths. We assess the accuracy and efficiency of sGDML on a newly developed MD22 benchmark dataset containing molecules from 42 to 370 atoms. The robustness of our approach is demonstrated in nanosecond path-integral molecular dynamics simulations for supramolecular complexes in the MD22 dataset. American Association for the Advancement of Science 2023-01-11 /pmc/articles/PMC9833674/ /pubmed/36630510 http://dx.doi.org/10.1126/sciadv.adf0873 Text en Copyright © 2023 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC). https://creativecommons.org/licenses/by-nc/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (https://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.
spellingShingle Physical and Materials Sciences
Chmiela, Stefan
Vassilev-Galindo, Valentin
Unke, Oliver T.
Kabylda, Adil
Sauceda, Huziel E.
Tkatchenko, Alexandre
Müller, Klaus-Robert
Accurate global machine learning force fields for molecules with hundreds of atoms
title Accurate global machine learning force fields for molecules with hundreds of atoms
title_full Accurate global machine learning force fields for molecules with hundreds of atoms
title_fullStr Accurate global machine learning force fields for molecules with hundreds of atoms
title_full_unstemmed Accurate global machine learning force fields for molecules with hundreds of atoms
title_short Accurate global machine learning force fields for molecules with hundreds of atoms
title_sort accurate global machine learning force fields for molecules with hundreds of atoms
topic Physical and Materials Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9833674/
https://www.ncbi.nlm.nih.gov/pubmed/36630510
http://dx.doi.org/10.1126/sciadv.adf0873
work_keys_str_mv AT chmielastefan accurateglobalmachinelearningforcefieldsformoleculeswithhundredsofatoms
AT vassilevgalindovalentin accurateglobalmachinelearningforcefieldsformoleculeswithhundredsofatoms
AT unkeolivert accurateglobalmachinelearningforcefieldsformoleculeswithhundredsofatoms
AT kabyldaadil accurateglobalmachinelearningforcefieldsformoleculeswithhundredsofatoms
AT saucedahuziele accurateglobalmachinelearningforcefieldsformoleculeswithhundredsofatoms
AT tkatchenkoalexandre accurateglobalmachinelearningforcefieldsformoleculeswithhundredsofatoms
AT mullerklausrobert accurateglobalmachinelearningforcefieldsformoleculeswithhundredsofatoms