Cargando…

A random forest learning assisted “divide and conquer” approach for peptide conformation search

Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of app...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Xin, Yang, Bing, Lin, Zijing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995823/ https://www.ncbi.nlm.nih.gov/pubmed/29891960 http://dx.doi.org/10.1038/s41598-018-27167-w

_version_	1783330678965796864
author	Chen, Xin Yang, Bing Lin, Zijing
author_facet	Chen, Xin Yang, Bing Lin, Zijing
author_sort	Chen, Xin
collection	PubMed
description	Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of applicability of the “divide and conquer” approach. A random forest classification algorithm is used to characterize the distributions of the backbone φ-ψ units (“words”). A random forest supervised learning model is developed to analyze the combinations of the φ-ψ units (“grammar”). It is found that amino acid residues may be grouped as equivalent “words”, while the φ-ψ combinations in low-energy peptide conformations follow a distinct “grammar”. The finding of equivalent words empowers the “divide and conquer” method with the flexibility of fragment substitution. The learnt grammar is used to improve the efficiency of the “divide and conquer” method by removing unfavorable φ-ψ combinations without the need of dedicated human effort. The machine learning assisted search method is illustrated by efficiently searching the conformations of GGG/AAA/GGGG/AAAA/GGGGG through assembling the structures of GFG/GFGG. Moreover, the computational cost of the new method is shown to increase rather slowly with the peptide length.
format	Online Article Text
id	pubmed-5995823
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-59958232018-06-21 A random forest learning assisted “divide and conquer” approach for peptide conformation search Chen, Xin Yang, Bing Lin, Zijing Sci Rep Article Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of applicability of the “divide and conquer” approach. A random forest classification algorithm is used to characterize the distributions of the backbone φ-ψ units (“words”). A random forest supervised learning model is developed to analyze the combinations of the φ-ψ units (“grammar”). It is found that amino acid residues may be grouped as equivalent “words”, while the φ-ψ combinations in low-energy peptide conformations follow a distinct “grammar”. The finding of equivalent words empowers the “divide and conquer” method with the flexibility of fragment substitution. The learnt grammar is used to improve the efficiency of the “divide and conquer” method by removing unfavorable φ-ψ combinations without the need of dedicated human effort. The machine learning assisted search method is illustrated by efficiently searching the conformations of GGG/AAA/GGGG/AAAA/GGGGG through assembling the structures of GFG/GFGG. Moreover, the computational cost of the new method is shown to increase rather slowly with the peptide length. Nature Publishing Group UK 2018-06-11 /pmc/articles/PMC5995823/ /pubmed/29891960 http://dx.doi.org/10.1038/s41598-018-27167-w Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Chen, Xin Yang, Bing Lin, Zijing A random forest learning assisted “divide and conquer” approach for peptide conformation search
title	A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_full	A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_fullStr	A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_full_unstemmed	A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_short	A random forest learning assisted “divide and conquer” approach for peptide conformation search
title_sort	random forest learning assisted “divide and conquer” approach for peptide conformation search
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995823/ https://www.ncbi.nlm.nih.gov/pubmed/29891960 http://dx.doi.org/10.1038/s41598-018-27167-w
work_keys_str_mv	AT chenxin arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT yangbing arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT linzijing arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT chenxin randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT yangbing randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT linzijing randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch

A random forest learning assisted “divide and conquer” approach for peptide conformation search

Ejemplares similares