Cargando…
RRCRank: a fusion method using rank strategy for residue-residue contact prediction
BACKGROUND: In structural biology area, protein residue-residue contacts play a crucial role in protein structure prediction. Some researchers have found that the predicted residue-residue contacts could effectively constrain the conformational search space, which is significant for de novo protein...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5581475/ https://www.ncbi.nlm.nih.gov/pubmed/28865433 http://dx.doi.org/10.1186/s12859-017-1811-9 |
_version_ | 1783261056854917120 |
---|---|
author | Jing, Xiaoyang Dong, Qiwen Lu, Ruqian |
author_facet | Jing, Xiaoyang Dong, Qiwen Lu, Ruqian |
author_sort | Jing, Xiaoyang |
collection | PubMed |
description | BACKGROUND: In structural biology area, protein residue-residue contacts play a crucial role in protein structure prediction. Some researchers have found that the predicted residue-residue contacts could effectively constrain the conformational search space, which is significant for de novo protein structure prediction. In the last few decades, related researchers have developed various methods to predict residue-residue contacts, especially, significant performance has been achieved by using fusion methods in recent years. In this work, a novel fusion method based on rank strategy has been proposed to predict contacts. Unlike the traditional regression or classification strategies, the contact prediction task is regarded as a ranking task. First, two kinds of features are extracted from correlated mutations methods and ensemble machine-learning classifiers, and then the proposed method uses the learning-to-rank algorithm to predict contact probability of each residue pair. RESULTS: First, we perform two benchmark tests for the proposed fusion method (RRCRank) on CASP11 dataset and CASP12 dataset respectively. The test results show that the RRCRank method outperforms other well-developed methods, especially for medium and short range contacts. Second, in order to verify the superiority of ranking strategy, we predict contacts by using the traditional regression and classification strategies based on the same features as ranking strategy. Compared with these two traditional strategies, the proposed ranking strategy shows better performance for three contact types, in particular for long range contacts. Third, the proposed RRCRank has been compared with several state-of-the-art methods in CASP11 and CASP12. The results show that the RRCRank could achieve comparable prediction precisions and is better than three methods in most assessment metrics. CONCLUSIONS: The learning-to-rank algorithm is introduced to develop a novel rank-based method for the residue-residue contact prediction of proteins, which achieves state-of-the-art performance based on the extensive assessment. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1811-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5581475 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-55814752017-09-06 RRCRank: a fusion method using rank strategy for residue-residue contact prediction Jing, Xiaoyang Dong, Qiwen Lu, Ruqian BMC Bioinformatics Research Article BACKGROUND: In structural biology area, protein residue-residue contacts play a crucial role in protein structure prediction. Some researchers have found that the predicted residue-residue contacts could effectively constrain the conformational search space, which is significant for de novo protein structure prediction. In the last few decades, related researchers have developed various methods to predict residue-residue contacts, especially, significant performance has been achieved by using fusion methods in recent years. In this work, a novel fusion method based on rank strategy has been proposed to predict contacts. Unlike the traditional regression or classification strategies, the contact prediction task is regarded as a ranking task. First, two kinds of features are extracted from correlated mutations methods and ensemble machine-learning classifiers, and then the proposed method uses the learning-to-rank algorithm to predict contact probability of each residue pair. RESULTS: First, we perform two benchmark tests for the proposed fusion method (RRCRank) on CASP11 dataset and CASP12 dataset respectively. The test results show that the RRCRank method outperforms other well-developed methods, especially for medium and short range contacts. Second, in order to verify the superiority of ranking strategy, we predict contacts by using the traditional regression and classification strategies based on the same features as ranking strategy. Compared with these two traditional strategies, the proposed ranking strategy shows better performance for three contact types, in particular for long range contacts. Third, the proposed RRCRank has been compared with several state-of-the-art methods in CASP11 and CASP12. The results show that the RRCRank could achieve comparable prediction precisions and is better than three methods in most assessment metrics. CONCLUSIONS: The learning-to-rank algorithm is introduced to develop a novel rank-based method for the residue-residue contact prediction of proteins, which achieves state-of-the-art performance based on the extensive assessment. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1811-9) contains supplementary material, which is available to authorized users. BioMed Central 2017-09-02 /pmc/articles/PMC5581475/ /pubmed/28865433 http://dx.doi.org/10.1186/s12859-017-1811-9 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Jing, Xiaoyang Dong, Qiwen Lu, Ruqian RRCRank: a fusion method using rank strategy for residue-residue contact prediction |
title | RRCRank: a fusion method using rank strategy for residue-residue contact prediction |
title_full | RRCRank: a fusion method using rank strategy for residue-residue contact prediction |
title_fullStr | RRCRank: a fusion method using rank strategy for residue-residue contact prediction |
title_full_unstemmed | RRCRank: a fusion method using rank strategy for residue-residue contact prediction |
title_short | RRCRank: a fusion method using rank strategy for residue-residue contact prediction |
title_sort | rrcrank: a fusion method using rank strategy for residue-residue contact prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5581475/ https://www.ncbi.nlm.nih.gov/pubmed/28865433 http://dx.doi.org/10.1186/s12859-017-1811-9 |
work_keys_str_mv | AT jingxiaoyang rrcrankafusionmethodusingrankstrategyforresidueresiduecontactprediction AT dongqiwen rrcrankafusionmethodusingrankstrategyforresidueresiduecontactprediction AT luruqian rrcrankafusionmethodusingrankstrategyforresidueresiduecontactprediction |