Cargando…

The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations

Indirect (S)QM/MM free energy simulations (FES) are vital to efficiently incorporating sufficient sampling and accurate (QM) energetic evaluations when estimating free energies of practical/experimental interest. Connecting between levels of theory, i.e., calculating [Formula: see text] , remains to...

Descripción completa

Detalles Bibliográficos
Autores principales: Kearns, Fiona L., Warrensford, Luke, Boresch, Stefan, Woodcock, H. Lee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6413162/
https://www.ncbi.nlm.nih.gov/pubmed/30769826
http://dx.doi.org/10.3390/molecules24040681
_version_ 1783402772880687104
author Kearns, Fiona L.
Warrensford, Luke
Boresch, Stefan
Woodcock, H. Lee
author_facet Kearns, Fiona L.
Warrensford, Luke
Boresch, Stefan
Woodcock, H. Lee
author_sort Kearns, Fiona L.
collection PubMed
description Indirect (S)QM/MM free energy simulations (FES) are vital to efficiently incorporating sufficient sampling and accurate (QM) energetic evaluations when estimating free energies of practical/experimental interest. Connecting between levels of theory, i.e., calculating [Formula: see text] , remains to be the most challenging step within an indirect FES protocol. To improve calculations of [Formula: see text] , we must: (1) compare the performance of all FES methods currently available; and (2) compile and maintain datasets of [Formula: see text] calculated for a wide-variety of molecules so that future practitioners may replicate or improve upon the current state-of-the-art. Towards these two aims, we introduce a new dataset, “HiPen”, which tabulates [Formula: see text] (the free energy associated with switching from an [Formula: see text] to an [Formula: see text] molecular description using the 3ob parameter set in gas phase), calculated for 22 drug-like small molecules. We compare the calculation of this value using free energy perturbation, Bennett’s acceptance ratio, Jarzynski’s equation, and Crooks’ equation. We also predict the reliability of each calculated [Formula: see text] by evaluating several convergence criteria including sample size hysteresis, overlap statistics, and bias metric ([Formula: see text]). Within the total dataset, three distinct categories of molecules emerge: the “good” molecules, for which we can obtain converged [Formula: see text] using Jarzynski’s equation; “bad” molecules which require Crooks’ equation to obtain a converged [Formula: see text]; and “ugly” molecules for which we cannot obtain reliably converged [Formula: see text] with either Jarzynski’s or Crooks’ equations. We discuss, in depth, results from several example molecules in each of these categories and describe how dihedral discrepancies between levels of theory cause convergence failures even for these gas phase free energy simulations.
format Online
Article
Text
id pubmed-6413162
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64131622019-03-29 The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations Kearns, Fiona L. Warrensford, Luke Boresch, Stefan Woodcock, H. Lee Molecules Article Indirect (S)QM/MM free energy simulations (FES) are vital to efficiently incorporating sufficient sampling and accurate (QM) energetic evaluations when estimating free energies of practical/experimental interest. Connecting between levels of theory, i.e., calculating [Formula: see text] , remains to be the most challenging step within an indirect FES protocol. To improve calculations of [Formula: see text] , we must: (1) compare the performance of all FES methods currently available; and (2) compile and maintain datasets of [Formula: see text] calculated for a wide-variety of molecules so that future practitioners may replicate or improve upon the current state-of-the-art. Towards these two aims, we introduce a new dataset, “HiPen”, which tabulates [Formula: see text] (the free energy associated with switching from an [Formula: see text] to an [Formula: see text] molecular description using the 3ob parameter set in gas phase), calculated for 22 drug-like small molecules. We compare the calculation of this value using free energy perturbation, Bennett’s acceptance ratio, Jarzynski’s equation, and Crooks’ equation. We also predict the reliability of each calculated [Formula: see text] by evaluating several convergence criteria including sample size hysteresis, overlap statistics, and bias metric ([Formula: see text]). Within the total dataset, three distinct categories of molecules emerge: the “good” molecules, for which we can obtain converged [Formula: see text] using Jarzynski’s equation; “bad” molecules which require Crooks’ equation to obtain a converged [Formula: see text]; and “ugly” molecules for which we cannot obtain reliably converged [Formula: see text] with either Jarzynski’s or Crooks’ equations. We discuss, in depth, results from several example molecules in each of these categories and describe how dihedral discrepancies between levels of theory cause convergence failures even for these gas phase free energy simulations. MDPI 2019-02-14 /pmc/articles/PMC6413162/ /pubmed/30769826 http://dx.doi.org/10.3390/molecules24040681 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kearns, Fiona L.
Warrensford, Luke
Boresch, Stefan
Woodcock, H. Lee
The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations
title The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations
title_full The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations
title_fullStr The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations
title_full_unstemmed The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations
title_short The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations
title_sort good, the bad, and the ugly: “hipen”, a new dataset for validating (s)qm/mm free energy simulations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6413162/
https://www.ncbi.nlm.nih.gov/pubmed/30769826
http://dx.doi.org/10.3390/molecules24040681
work_keys_str_mv AT kearnsfional thegoodthebadandtheuglyhipenanewdatasetforvalidatingsqmmmfreeenergysimulations
AT warrensfordluke thegoodthebadandtheuglyhipenanewdatasetforvalidatingsqmmmfreeenergysimulations
AT boreschstefan thegoodthebadandtheuglyhipenanewdatasetforvalidatingsqmmmfreeenergysimulations
AT woodcockhlee thegoodthebadandtheuglyhipenanewdatasetforvalidatingsqmmmfreeenergysimulations
AT kearnsfional goodthebadandtheuglyhipenanewdatasetforvalidatingsqmmmfreeenergysimulations
AT warrensfordluke goodthebadandtheuglyhipenanewdatasetforvalidatingsqmmmfreeenergysimulations
AT boreschstefan goodthebadandtheuglyhipenanewdatasetforvalidatingsqmmmfreeenergysimulations
AT woodcockhlee goodthebadandtheuglyhipenanewdatasetforvalidatingsqmmmfreeenergysimulations