Cargando…

Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures

[Image: see text] Identification of the most stable structure(s) of a system is a prerequisite for the calculation of any of its properties from first-principles. However, even for relatively small molecules, exhaustive explorations of the potential energy surface (PES) are severely hampered by the...

Descripción completa

Detalles Bibliográficos
Autores principales: Villard, Justin, Kılıç, Murat, Rothlisberger, Ursula
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933449/
https://www.ncbi.nlm.nih.gov/pubmed/36692853
http://dx.doi.org/10.1021/acs.jctc.2c01078
Descripción
Sumario:[Image: see text] Identification of the most stable structure(s) of a system is a prerequisite for the calculation of any of its properties from first-principles. However, even for relatively small molecules, exhaustive explorations of the potential energy surface (PES) are severely hampered by the dimensionality bottleneck. In this work, we address the challenging task of efficiently sampling realistic low-lying peptide coordinates by resorting to a surrogate based genetic algorithm (GA)/density functional theory (DFT) approach (sGADFT) in which promising candidates provided by the GA are ultimately optimized with DFT. We provide a benchmark of several computational methods (GAFF, AMOEBApro13, PM6, PM7, DFTB3-D3(BJ)) as possible prescanning surrogates and apply sGADFT to two test case systems that are (i) two isomer families of the protonated Gly-Pro-Gly-Gly tetrapeptide ( A. Masson; J. Am. Soc. Mass Spectrom.2015, 26, 1444−145426091889) and (ii) the doubly protonated cyclic decapeptide gramicidin S ( N. S. Nagornova; J. Am. Chem. Soc.2010, 132, 4040−404120201525). We show that our GA procedure can correctly identify low-energy minima in as little as a few hours. Subsequent refinement of surrogate low-energy structures within a given energy threshold (≤10 kcal/mol (i), ≤5 kcal/mol (ii)) via DFT relaxation invariably led to the identification of the most stable structures as determined from high-resolution infrared (IR) spectroscopy at low temperature. The sGADFT method therefore constitutes a highly efficient route for the screening of realistic low-lying peptide structures in the gas phase as needed for instance for the interpretation and assignment of experimental IR spectra.