Cargando…
A random forest learning assisted “divide and conquer” approach for peptide conformation search
Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of app...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995823/ https://www.ncbi.nlm.nih.gov/pubmed/29891960 http://dx.doi.org/10.1038/s41598-018-27167-w |
_version_ | 1783330678965796864 |
---|---|
author | Chen, Xin Yang, Bing Lin, Zijing |
author_facet | Chen, Xin Yang, Bing Lin, Zijing |
author_sort | Chen, Xin |
collection | PubMed |
description | Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of applicability of the “divide and conquer” approach. A random forest classification algorithm is used to characterize the distributions of the backbone φ-ψ units (“words”). A random forest supervised learning model is developed to analyze the combinations of the φ-ψ units (“grammar”). It is found that amino acid residues may be grouped as equivalent “words”, while the φ-ψ combinations in low-energy peptide conformations follow a distinct “grammar”. The finding of equivalent words empowers the “divide and conquer” method with the flexibility of fragment substitution. The learnt grammar is used to improve the efficiency of the “divide and conquer” method by removing unfavorable φ-ψ combinations without the need of dedicated human effort. The machine learning assisted search method is illustrated by efficiently searching the conformations of GGG/AAA/GGGG/AAAA/GGGGG through assembling the structures of GFG/GFGG. Moreover, the computational cost of the new method is shown to increase rather slowly with the peptide length. |
format | Online Article Text |
id | pubmed-5995823 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-59958232018-06-21 A random forest learning assisted “divide and conquer” approach for peptide conformation search Chen, Xin Yang, Bing Lin, Zijing Sci Rep Article Computational determination of peptide conformations is challenging as it is a problem of finding minima in a high-dimensional space. The “divide and conquer” approach is promising for reliably reducing the search space size. A random forest learning model is proposed here to expand the scope of applicability of the “divide and conquer” approach. A random forest classification algorithm is used to characterize the distributions of the backbone φ-ψ units (“words”). A random forest supervised learning model is developed to analyze the combinations of the φ-ψ units (“grammar”). It is found that amino acid residues may be grouped as equivalent “words”, while the φ-ψ combinations in low-energy peptide conformations follow a distinct “grammar”. The finding of equivalent words empowers the “divide and conquer” method with the flexibility of fragment substitution. The learnt grammar is used to improve the efficiency of the “divide and conquer” method by removing unfavorable φ-ψ combinations without the need of dedicated human effort. The machine learning assisted search method is illustrated by efficiently searching the conformations of GGG/AAA/GGGG/AAAA/GGGGG through assembling the structures of GFG/GFGG. Moreover, the computational cost of the new method is shown to increase rather slowly with the peptide length. Nature Publishing Group UK 2018-06-11 /pmc/articles/PMC5995823/ /pubmed/29891960 http://dx.doi.org/10.1038/s41598-018-27167-w Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Chen, Xin Yang, Bing Lin, Zijing A random forest learning assisted “divide and conquer” approach for peptide conformation search |
title | A random forest learning assisted “divide and conquer” approach for peptide conformation search |
title_full | A random forest learning assisted “divide and conquer” approach for peptide conformation search |
title_fullStr | A random forest learning assisted “divide and conquer” approach for peptide conformation search |
title_full_unstemmed | A random forest learning assisted “divide and conquer” approach for peptide conformation search |
title_short | A random forest learning assisted “divide and conquer” approach for peptide conformation search |
title_sort | random forest learning assisted “divide and conquer” approach for peptide conformation search |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5995823/ https://www.ncbi.nlm.nih.gov/pubmed/29891960 http://dx.doi.org/10.1038/s41598-018-27167-w |
work_keys_str_mv | AT chenxin arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT yangbing arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT linzijing arandomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT chenxin randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT yangbing randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch AT linzijing randomforestlearningassisteddivideandconquerapproachforpeptideconformationsearch |