Cargando…
Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures
[Image: see text] Identification of the most stable structure(s) of a system is a prerequisite for the calculation of any of its properties from first-principles. However, even for relatively small molecules, exhaustive explorations of the potential energy surface (PES) are severely hampered by the...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933449/ https://www.ncbi.nlm.nih.gov/pubmed/36692853 http://dx.doi.org/10.1021/acs.jctc.2c01078 |
_version_ | 1784889679971090432 |
---|---|
author | Villard, Justin Kılıç, Murat Rothlisberger, Ursula |
author_facet | Villard, Justin Kılıç, Murat Rothlisberger, Ursula |
author_sort | Villard, Justin |
collection | PubMed |
description | [Image: see text] Identification of the most stable structure(s) of a system is a prerequisite for the calculation of any of its properties from first-principles. However, even for relatively small molecules, exhaustive explorations of the potential energy surface (PES) are severely hampered by the dimensionality bottleneck. In this work, we address the challenging task of efficiently sampling realistic low-lying peptide coordinates by resorting to a surrogate based genetic algorithm (GA)/density functional theory (DFT) approach (sGADFT) in which promising candidates provided by the GA are ultimately optimized with DFT. We provide a benchmark of several computational methods (GAFF, AMOEBApro13, PM6, PM7, DFTB3-D3(BJ)) as possible prescanning surrogates and apply sGADFT to two test case systems that are (i) two isomer families of the protonated Gly-Pro-Gly-Gly tetrapeptide ( A. Masson; J. Am. Soc. Mass Spectrom.2015, 26, 1444−145426091889) and (ii) the doubly protonated cyclic decapeptide gramicidin S ( N. S. Nagornova; J. Am. Chem. Soc.2010, 132, 4040−404120201525). We show that our GA procedure can correctly identify low-energy minima in as little as a few hours. Subsequent refinement of surrogate low-energy structures within a given energy threshold (≤10 kcal/mol (i), ≤5 kcal/mol (ii)) via DFT relaxation invariably led to the identification of the most stable structures as determined from high-resolution infrared (IR) spectroscopy at low temperature. The sGADFT method therefore constitutes a highly efficient route for the screening of realistic low-lying peptide structures in the gas phase as needed for instance for the interpretation and assignment of experimental IR spectra. |
format | Online Article Text |
id | pubmed-9933449 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-99334492023-02-17 Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures Villard, Justin Kılıç, Murat Rothlisberger, Ursula J Chem Theory Comput [Image: see text] Identification of the most stable structure(s) of a system is a prerequisite for the calculation of any of its properties from first-principles. However, even for relatively small molecules, exhaustive explorations of the potential energy surface (PES) are severely hampered by the dimensionality bottleneck. In this work, we address the challenging task of efficiently sampling realistic low-lying peptide coordinates by resorting to a surrogate based genetic algorithm (GA)/density functional theory (DFT) approach (sGADFT) in which promising candidates provided by the GA are ultimately optimized with DFT. We provide a benchmark of several computational methods (GAFF, AMOEBApro13, PM6, PM7, DFTB3-D3(BJ)) as possible prescanning surrogates and apply sGADFT to two test case systems that are (i) two isomer families of the protonated Gly-Pro-Gly-Gly tetrapeptide ( A. Masson; J. Am. Soc. Mass Spectrom.2015, 26, 1444−145426091889) and (ii) the doubly protonated cyclic decapeptide gramicidin S ( N. S. Nagornova; J. Am. Chem. Soc.2010, 132, 4040−404120201525). We show that our GA procedure can correctly identify low-energy minima in as little as a few hours. Subsequent refinement of surrogate low-energy structures within a given energy threshold (≤10 kcal/mol (i), ≤5 kcal/mol (ii)) via DFT relaxation invariably led to the identification of the most stable structures as determined from high-resolution infrared (IR) spectroscopy at low temperature. The sGADFT method therefore constitutes a highly efficient route for the screening of realistic low-lying peptide structures in the gas phase as needed for instance for the interpretation and assignment of experimental IR spectra. American Chemical Society 2023-01-24 /pmc/articles/PMC9933449/ /pubmed/36692853 http://dx.doi.org/10.1021/acs.jctc.2c01078 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Villard, Justin Kılıç, Murat Rothlisberger, Ursula Surrogate Based Genetic Algorithm Method for Efficient Identification of Low-Energy Peptide Structures |
title | Surrogate Based
Genetic Algorithm Method for Efficient
Identification of Low-Energy Peptide Structures |
title_full | Surrogate Based
Genetic Algorithm Method for Efficient
Identification of Low-Energy Peptide Structures |
title_fullStr | Surrogate Based
Genetic Algorithm Method for Efficient
Identification of Low-Energy Peptide Structures |
title_full_unstemmed | Surrogate Based
Genetic Algorithm Method for Efficient
Identification of Low-Energy Peptide Structures |
title_short | Surrogate Based
Genetic Algorithm Method for Efficient
Identification of Low-Energy Peptide Structures |
title_sort | surrogate based
genetic algorithm method for efficient
identification of low-energy peptide structures |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933449/ https://www.ncbi.nlm.nih.gov/pubmed/36692853 http://dx.doi.org/10.1021/acs.jctc.2c01078 |
work_keys_str_mv | AT villardjustin surrogatebasedgeneticalgorithmmethodforefficientidentificationoflowenergypeptidestructures AT kılıcmurat surrogatebasedgeneticalgorithmmethodforefficientidentificationoflowenergypeptidestructures AT rothlisbergerursula surrogatebasedgeneticalgorithmmethodforefficientidentificationoflowenergypeptidestructures |