Cargando…

Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures

[Image: see text] Identification of the most stable structure(s) of a system is a prerequisite for the calculation of any of its properties from first-principles. However, even for relatively small molecules, exhaustive explorations of the potential energy surface (PES) are severely hampered by the...

Descripción completa

Detalles Bibliográficos
Autores principales: Villard, Justin, Kılıç, Murat, Rothlisberger, Ursula
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933449/
https://www.ncbi.nlm.nih.gov/pubmed/36692853
http://dx.doi.org/10.1021/acs.jctc.2c01078
_version_ 1784889679971090432
author Villard, Justin
Kılıç, Murat
Rothlisberger, Ursula
author_facet Villard, Justin
Kılıç, Murat
Rothlisberger, Ursula
author_sort Villard, Justin
collection PubMed
description [Image: see text] Identification of the most stable structure(s) of a system is a prerequisite for the calculation of any of its properties from first-principles. However, even for relatively small molecules, exhaustive explorations of the potential energy surface (PES) are severely hampered by the dimensionality bottleneck. In this work, we address the challenging task of efficiently sampling realistic low-lying peptide coordinates by resorting to a surrogate based genetic algorithm (GA)/density functional theory (DFT) approach (sGADFT) in which promising candidates provided by the GA are ultimately optimized with DFT. We provide a benchmark of several computational methods (GAFF, AMOEBApro13, PM6, PM7, DFTB3-D3(BJ)) as possible prescanning surrogates and apply sGADFT to two test case systems that are (i) two isomer families of the protonated Gly-Pro-Gly-Gly tetrapeptide ( A. Masson; J. Am. Soc. Mass Spectrom.2015, 26, 1444−145426091889) and (ii) the doubly protonated cyclic decapeptide gramicidin S ( N. S. Nagornova; J. Am. Chem. Soc.2010, 132, 4040−404120201525). We show that our GA procedure can correctly identify low-energy minima in as little as a few hours. Subsequent refinement of surrogate low-energy structures within a given energy threshold (≤10 kcal/mol (i), ≤5 kcal/mol (ii)) via DFT relaxation invariably led to the identification of the most stable structures as determined from high-resolution infrared (IR) spectroscopy at low temperature. The sGADFT method therefore constitutes a highly efficient route for the screening of realistic low-lying peptide structures in the gas phase as needed for instance for the interpretation and assignment of experimental IR spectra.
format Online
Article
Text
id pubmed-9933449
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-99334492023-02-17 Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures Villard, Justin Kılıç, Murat Rothlisberger, Ursula J Chem Theory Comput [Image: see text] Identification of the most stable structure(s) of a system is a prerequisite for the calculation of any of its properties from first-principles. However, even for relatively small molecules, exhaustive explorations of the potential energy surface (PES) are severely hampered by the dimensionality bottleneck. In this work, we address the challenging task of efficiently sampling realistic low-lying peptide coordinates by resorting to a surrogate based genetic algorithm (GA)/density functional theory (DFT) approach (sGADFT) in which promising candidates provided by the GA are ultimately optimized with DFT. We provide a benchmark of several computational methods (GAFF, AMOEBApro13, PM6, PM7, DFTB3-D3(BJ)) as possible prescanning surrogates and apply sGADFT to two test case systems that are (i) two isomer families of the protonated Gly-Pro-Gly-Gly tetrapeptide ( A. Masson; J. Am. Soc. Mass Spectrom.2015, 26, 1444−145426091889) and (ii) the doubly protonated cyclic decapeptide gramicidin S ( N. S. Nagornova; J. Am. Chem. Soc.2010, 132, 4040−404120201525). We show that our GA procedure can correctly identify low-energy minima in as little as a few hours. Subsequent refinement of surrogate low-energy structures within a given energy threshold (≤10 kcal/mol (i), ≤5 kcal/mol (ii)) via DFT relaxation invariably led to the identification of the most stable structures as determined from high-resolution infrared (IR) spectroscopy at low temperature. The sGADFT method therefore constitutes a highly efficient route for the screening of realistic low-lying peptide structures in the gas phase as needed for instance for the interpretation and assignment of experimental IR spectra. American Chemical Society 2023-01-24 /pmc/articles/PMC9933449/ /pubmed/36692853 http://dx.doi.org/10.1021/acs.jctc.2c01078 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Villard, Justin
Kılıç, Murat
Rothlisberger, Ursula
Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures
title Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures
title_full Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures
title_fullStr Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures
title_full_unstemmed Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures
title_short Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures
title_sort surrogate based genetic algorithm method for efficient identification of low-energy peptide structures
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933449/
https://www.ncbi.nlm.nih.gov/pubmed/36692853
http://dx.doi.org/10.1021/acs.jctc.2c01078
work_keys_str_mv AT villardjustin surrogatebasedgeneticalgorithmmethodforefficientidentificationoflowenergypeptidestructures
AT kılıcmurat surrogatebasedgeneticalgorithmmethodforefficientidentificationoflowenergypeptidestructures
AT rothlisbergerursula surrogatebasedgeneticalgorithmmethodforefficientidentificationoflowenergypeptidestructures