Cargando…

The Good, the Bad, and the Ugly: “HiPen”, a New Dataset for Validating (S)QM/MM Free Energy Simulations

Indirect (S)QM/MM free energy simulations (FES) are vital to efficiently incorporating sufficient sampling and accurate (QM) energetic evaluations when estimating free energies of practical/experimental interest. Connecting between levels of theory, i.e., calculating [Formula: see text] , remains to...

Descripción completa

Detalles Bibliográficos
Autores principales: Kearns, Fiona L., Warrensford, Luke, Boresch, Stefan, Woodcock, H. Lee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6413162/
https://www.ncbi.nlm.nih.gov/pubmed/30769826
http://dx.doi.org/10.3390/molecules24040681
Descripción
Sumario:Indirect (S)QM/MM free energy simulations (FES) are vital to efficiently incorporating sufficient sampling and accurate (QM) energetic evaluations when estimating free energies of practical/experimental interest. Connecting between levels of theory, i.e., calculating [Formula: see text] , remains to be the most challenging step within an indirect FES protocol. To improve calculations of [Formula: see text] , we must: (1) compare the performance of all FES methods currently available; and (2) compile and maintain datasets of [Formula: see text] calculated for a wide-variety of molecules so that future practitioners may replicate or improve upon the current state-of-the-art. Towards these two aims, we introduce a new dataset, “HiPen”, which tabulates [Formula: see text] (the free energy associated with switching from an [Formula: see text] to an [Formula: see text] molecular description using the 3ob parameter set in gas phase), calculated for 22 drug-like small molecules. We compare the calculation of this value using free energy perturbation, Bennett’s acceptance ratio, Jarzynski’s equation, and Crooks’ equation. We also predict the reliability of each calculated [Formula: see text] by evaluating several convergence criteria including sample size hysteresis, overlap statistics, and bias metric ([Formula: see text]). Within the total dataset, three distinct categories of molecules emerge: the “good” molecules, for which we can obtain converged [Formula: see text] using Jarzynski’s equation; “bad” molecules which require Crooks’ equation to obtain a converged [Formula: see text]; and “ugly” molecules for which we cannot obtain reliably converged [Formula: see text] with either Jarzynski’s or Crooks’ equations. We discuss, in depth, results from several example molecules in each of these categories and describe how dihedral discrepancies between levels of theory cause convergence failures even for these gas phase free energy simulations.