Cargando…
Sorting protein decoys by machine-learning-to-rank
Much progress has been made in Protein structure prediction during the last few decades. As the predicted models can span a broad range of accuracy spectrum, the accuracy of quality estimation becomes one of the key elements of successful protein structure prediction. Over the past years, a number o...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987638/ https://www.ncbi.nlm.nih.gov/pubmed/27530967 http://dx.doi.org/10.1038/srep31571 |
_version_ | 1782448333581713408 |
---|---|
author | Jing, Xiaoyang Wang, Kai Lu, Ruqian Dong, Qiwen |
author_facet | Jing, Xiaoyang Wang, Kai Lu, Ruqian Dong, Qiwen |
author_sort | Jing, Xiaoyang |
collection | PubMed |
description | Much progress has been made in Protein structure prediction during the last few decades. As the predicted models can span a broad range of accuracy spectrum, the accuracy of quality estimation becomes one of the key elements of successful protein structure prediction. Over the past years, a number of methods have been developed to address this issue, and these methods could be roughly divided into three categories: the single-model methods, clustering-based methods and quasi single-model methods. In this study, we develop a single-model method MQAPRank based on the learning-to-rank algorithm firstly, and then implement a quasi single-model method Quasi-MQAPRank. The proposed methods are benchmarked on the 3DRobot and CASP11 dataset. The five-fold cross-validation on the 3DRobot dataset shows the proposed single model method outperforms other methods whose outputs are taken as features of the proposed method, and the quasi single-model method can further enhance the performance. On the CASP11 dataset, the proposed methods also perform well compared with other leading methods in corresponding categories. In particular, the Quasi-MQAPRank method achieves a considerable performance on the CASP11 Best150 dataset. |
format | Online Article Text |
id | pubmed-4987638 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-49876382016-08-30 Sorting protein decoys by machine-learning-to-rank Jing, Xiaoyang Wang, Kai Lu, Ruqian Dong, Qiwen Sci Rep Article Much progress has been made in Protein structure prediction during the last few decades. As the predicted models can span a broad range of accuracy spectrum, the accuracy of quality estimation becomes one of the key elements of successful protein structure prediction. Over the past years, a number of methods have been developed to address this issue, and these methods could be roughly divided into three categories: the single-model methods, clustering-based methods and quasi single-model methods. In this study, we develop a single-model method MQAPRank based on the learning-to-rank algorithm firstly, and then implement a quasi single-model method Quasi-MQAPRank. The proposed methods are benchmarked on the 3DRobot and CASP11 dataset. The five-fold cross-validation on the 3DRobot dataset shows the proposed single model method outperforms other methods whose outputs are taken as features of the proposed method, and the quasi single-model method can further enhance the performance. On the CASP11 dataset, the proposed methods also perform well compared with other leading methods in corresponding categories. In particular, the Quasi-MQAPRank method achieves a considerable performance on the CASP11 Best150 dataset. Nature Publishing Group 2016-08-17 /pmc/articles/PMC4987638/ /pubmed/27530967 http://dx.doi.org/10.1038/srep31571 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Jing, Xiaoyang Wang, Kai Lu, Ruqian Dong, Qiwen Sorting protein decoys by machine-learning-to-rank |
title | Sorting protein decoys by machine-learning-to-rank |
title_full | Sorting protein decoys by machine-learning-to-rank |
title_fullStr | Sorting protein decoys by machine-learning-to-rank |
title_full_unstemmed | Sorting protein decoys by machine-learning-to-rank |
title_short | Sorting protein decoys by machine-learning-to-rank |
title_sort | sorting protein decoys by machine-learning-to-rank |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987638/ https://www.ncbi.nlm.nih.gov/pubmed/27530967 http://dx.doi.org/10.1038/srep31571 |
work_keys_str_mv | AT jingxiaoyang sortingproteindecoysbymachinelearningtorank AT wangkai sortingproteindecoysbymachinelearningtorank AT luruqian sortingproteindecoysbymachinelearningtorank AT dongqiwen sortingproteindecoysbymachinelearningtorank |