Cargando…
Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments
[Image: see text] Developing a force field is a difficult task because its design is typically pulled in opposite directions by speed and accuracy. FFLUX breaks this trend by utilizing Gaussian process regression (GPR) to predict, at ab initio accuracy, atomic energies and multipole moments as obtai...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979601/ https://www.ncbi.nlm.nih.gov/pubmed/36757024 http://dx.doi.org/10.1021/acs.jctc.2c00731 |
_version_ | 1784899756910182400 |
---|---|
author | Burn, Matthew J. Popelier, Paul L. A. |
author_facet | Burn, Matthew J. Popelier, Paul L. A. |
author_sort | Burn, Matthew J. |
collection | PubMed |
description | [Image: see text] Developing a force field is a difficult task because its design is typically pulled in opposite directions by speed and accuracy. FFLUX breaks this trend by utilizing Gaussian process regression (GPR) to predict, at ab initio accuracy, atomic energies and multipole moments as obtained from the quantum theory of atoms in molecules (QTAIM). This work demonstrates that the in-house FFLUX training pipeline can generate successful GPR models for six representative molecules: peptide-capped glycine and alanine, glucose, paracetamol, aspirin, and ibuprofen. The molecules were sufficiently distorted to represent configurations from an AMBER-GAFF2 molecular dynamics run. All internal degrees of freedom were covered corresponding to 93 dimensions in the case of the largest molecule ibuprofen (33 atoms). Benefiting from active learning, the GPR models contain only about 2000 training points and return largely sub-kcal mol(–1) prediction errors for the validation sets. A proof of concept has been reached for transferring the model produced through active learning on one atomic property to that of the remaining atomic properties. The prediction of electrostatic interaction can be assessed at the intermolecular level, and the vast majority of interactions have a root-mean-square error of less than 0.1 kJ mol(–1) with a maximum value of ∼1 kJ mol(–1) for a glycine and paracetamol dimer. |
format | Online Article Text |
id | pubmed-9979601 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-99796012023-03-03 Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments Burn, Matthew J. Popelier, Paul L. A. J Chem Theory Comput [Image: see text] Developing a force field is a difficult task because its design is typically pulled in opposite directions by speed and accuracy. FFLUX breaks this trend by utilizing Gaussian process regression (GPR) to predict, at ab initio accuracy, atomic energies and multipole moments as obtained from the quantum theory of atoms in molecules (QTAIM). This work demonstrates that the in-house FFLUX training pipeline can generate successful GPR models for six representative molecules: peptide-capped glycine and alanine, glucose, paracetamol, aspirin, and ibuprofen. The molecules were sufficiently distorted to represent configurations from an AMBER-GAFF2 molecular dynamics run. All internal degrees of freedom were covered corresponding to 93 dimensions in the case of the largest molecule ibuprofen (33 atoms). Benefiting from active learning, the GPR models contain only about 2000 training points and return largely sub-kcal mol(–1) prediction errors for the validation sets. A proof of concept has been reached for transferring the model produced through active learning on one atomic property to that of the remaining atomic properties. The prediction of electrostatic interaction can be assessed at the intermolecular level, and the vast majority of interactions have a root-mean-square error of less than 0.1 kJ mol(–1) with a maximum value of ∼1 kJ mol(–1) for a glycine and paracetamol dimer. American Chemical Society 2023-02-09 /pmc/articles/PMC9979601/ /pubmed/36757024 http://dx.doi.org/10.1021/acs.jctc.2c00731 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Burn, Matthew J. Popelier, Paul L. A. Gaussian Process Regression Models for Predicting Atomic Energies and Multipole Moments |
title | Gaussian Process
Regression Models for Predicting
Atomic Energies and Multipole Moments |
title_full | Gaussian Process
Regression Models for Predicting
Atomic Energies and Multipole Moments |
title_fullStr | Gaussian Process
Regression Models for Predicting
Atomic Energies and Multipole Moments |
title_full_unstemmed | Gaussian Process
Regression Models for Predicting
Atomic Energies and Multipole Moments |
title_short | Gaussian Process
Regression Models for Predicting
Atomic Energies and Multipole Moments |
title_sort | gaussian process
regression models for predicting
atomic energies and multipole moments |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979601/ https://www.ncbi.nlm.nih.gov/pubmed/36757024 http://dx.doi.org/10.1021/acs.jctc.2c00731 |
work_keys_str_mv | AT burnmatthewj gaussianprocessregressionmodelsforpredictingatomicenergiesandmultipolemoments AT popelierpaulla gaussianprocessregressionmodelsforpredictingatomicenergiesandmultipolemoments |