Cargando…

Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments

[Image: see text] Developing a force field is a difficult task because its design is typically pulled in opposite directions by speed and accuracy. FFLUX breaks this trend by utilizing Gaussian process regression (GPR) to predict, at ab initio accuracy, atomic energies and multipole moments as obtai...

Descripción completa

Detalles Bibliográficos
Autores principales: Burn, Matthew J., Popelier, Paul L. A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979601/
https://www.ncbi.nlm.nih.gov/pubmed/36757024
http://dx.doi.org/10.1021/acs.jctc.2c00731
_version_ 1784899756910182400
author Burn, Matthew J.
Popelier, Paul L. A.
author_facet Burn, Matthew J.
Popelier, Paul L. A.
author_sort Burn, Matthew J.
collection PubMed
description [Image: see text] Developing a force field is a difficult task because its design is typically pulled in opposite directions by speed and accuracy. FFLUX breaks this trend by utilizing Gaussian process regression (GPR) to predict, at ab initio accuracy, atomic energies and multipole moments as obtained from the quantum theory of atoms in molecules (QTAIM). This work demonstrates that the in-house FFLUX training pipeline can generate successful GPR models for six representative molecules: peptide-capped glycine and alanine, glucose, paracetamol, aspirin, and ibuprofen. The molecules were sufficiently distorted to represent configurations from an AMBER-GAFF2 molecular dynamics run. All internal degrees of freedom were covered corresponding to 93 dimensions in the case of the largest molecule ibuprofen (33 atoms). Benefiting from active learning, the GPR models contain only about 2000 training points and return largely sub-kcal mol(–1) prediction errors for the validation sets. A proof of concept has been reached for transferring the model produced through active learning on one atomic property to that of the remaining atomic properties. The prediction of electrostatic interaction can be assessed at the intermolecular level, and the vast majority of interactions have a root-mean-square error of less than 0.1 kJ mol(–1) with a maximum value of ∼1 kJ mol(–1) for a glycine and paracetamol dimer.
format Online
Article
Text
id pubmed-9979601
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-99796012023-03-03 Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments Burn, Matthew J. Popelier, Paul L. A. J Chem Theory Comput [Image: see text] Developing a force field is a difficult task because its design is typically pulled in opposite directions by speed and accuracy. FFLUX breaks this trend by utilizing Gaussian process regression (GPR) to predict, at ab initio accuracy, atomic energies and multipole moments as obtained from the quantum theory of atoms in molecules (QTAIM). This work demonstrates that the in-house FFLUX training pipeline can generate successful GPR models for six representative molecules: peptide-capped glycine and alanine, glucose, paracetamol, aspirin, and ibuprofen. The molecules were sufficiently distorted to represent configurations from an AMBER-GAFF2 molecular dynamics run. All internal degrees of freedom were covered corresponding to 93 dimensions in the case of the largest molecule ibuprofen (33 atoms). Benefiting from active learning, the GPR models contain only about 2000 training points and return largely sub-kcal mol(–1) prediction errors for the validation sets. A proof of concept has been reached for transferring the model produced through active learning on one atomic property to that of the remaining atomic properties. The prediction of electrostatic interaction can be assessed at the intermolecular level, and the vast majority of interactions have a root-mean-square error of less than 0.1 kJ mol(–1) with a maximum value of ∼1 kJ mol(–1) for a glycine and paracetamol dimer. American Chemical Society 2023-02-09 /pmc/articles/PMC9979601/ /pubmed/36757024 http://dx.doi.org/10.1021/acs.jctc.2c00731 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Burn, Matthew J.
Popelier, Paul L. A.
Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments
title Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments
title_full Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments
title_fullStr Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments
title_full_unstemmed Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments
title_short Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments
title_sort gaussian process regression models for predicting atomic energies and multipole moments
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979601/
https://www.ncbi.nlm.nih.gov/pubmed/36757024
http://dx.doi.org/10.1021/acs.jctc.2c00731
work_keys_str_mv AT burnmatthewj gaussianprocessregressionmodelsforpredictingatomicenergiesandmultipolemoments
AT popelierpaulla gaussianprocessregressionmodelsforpredictingatomicenergiesandmultipolemoments