Cargando…

A random forest learning assisted “divide and conquer” approach for peptide conformation search

Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of app...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xin, Yang, Bing, Lin, Zijing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995823/
https://www.ncbi.nlm.nih.gov/pubmed/29891960
http://dx.doi.org/10.1038/s41598-018-27167-w
_version_ 1783330678965796864
author Chen, Xin
Yang, Bing
Lin, Zijing
author_facet Chen, Xin
Yang, Bing
Lin, Zijing
author_sort Chen, Xin
collection PubMed
description Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of applicability of the “divide and conquer” approach. A random forest classification algorithm is used to characterize the distributions of the backbone φ-ψ units (“words”). A random forest supervised learning model is developed to analyze the combinations of the φ-ψ units (“grammar”). It is found that amino acid residues may be grouped as equivalent “words”, while the φ-ψ combinations in low-energy peptide conformations follow a distinct “grammar”. The finding of equivalent words empowers the “divide and conquer” method with the flexibility of fragment substitution. The learnt grammar is used to improve the efficiency of the “divide and conquer” method by removing unfavorable φ-ψ combinations without the need of dedicated human effort. The machine learning assisted search method is illustrated by efficiently searching the conformations of GGG/AAA/GGGG/AAAA/GGGGG through assembling the structures of GFG/GFGG. Moreover, the computational cost of the new method is shown to increase rather slowly with the peptide length.
format Online
Article
Text
id pubmed-5995823
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-59958232018-06-21 A random forest learning assisted “divide and conquer” approach for peptide conformation search Chen, Xin Yang, Bing Lin, Zijing Sci Rep Article Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of applicability of the “divide and conquer” approach. A random forest classification algorithm is used to characterize the distributions of the backbone φ-ψ units (“words”). A random forest supervised learning model is developed to analyze the combinations of the φ-ψ units (“grammar”). It is found that amino acid residues may be grouped as equivalent “words”, while the φ-ψ combinations in low-energy peptide conformations follow a distinct “grammar”. The finding of equivalent words empowers the “divide and conquer” method with the flexibility of fragment substitution. The learnt grammar is used to improve the efficiency of the “divide and conquer” method by removing unfavorable φ-ψ combinations without the need of dedicated human effort. The machine learning assisted search method is illustrated by efficiently searching the conformations of GGG/AAA/GGGG/AAAA/GGGGG through assembling the structures of GFG/GFGG. Moreover, the computational cost of the new method is shown to increase rather slowly with the peptide length. Nature Publishing Group UK 2018-06-11 /pmc/articles/PMC5995823/ /pubmed/29891960 http://dx.doi.org/10.1038/s41598-018-27167-w Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Chen, Xin
Yang, Bing
Lin, Zijing
A random forest learning assisted “divide and conquer” approach for peptide conformation search
title A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_full A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_fullStr A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_full_unstemmed A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_short A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_sort random forest learning assisted “divide and conquer” approach for peptide conformation search
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995823/
https://www.ncbi.nlm.nih.gov/pubmed/29891960
http://dx.doi.org/10.1038/s41598-018-27167-w
work_keys_str_mv AT chenxin arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch
AT yangbing arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch
AT linzijing arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch
AT chenxin randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch
AT yangbing randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch
AT linzijing randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch