Cargando…
Identification of residue pairing in interacting β-strands from a predicted residue contact map
BACKGROUND: Despite the rapid progress of protein residue contact prediction, predicted residue contact maps frequently contain many errors. However, information of residue pairing in β strands could be extracted from a noisy contact map, due to the presence of characteristic contact patterns in β-β...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907701/ https://www.ncbi.nlm.nih.gov/pubmed/29673311 http://dx.doi.org/10.1186/s12859-018-2150-1 |
_version_ | 1783315587797090304 |
---|---|
author | Mao, Wenzhi Wang, Tong Zhang, Wenxuan Gong, Haipeng |
author_facet | Mao, Wenzhi Wang, Tong Zhang, Wenxuan Gong, Haipeng |
author_sort | Mao, Wenzhi |
collection | PubMed |
description | BACKGROUND: Despite the rapid progress of protein residue contact prediction, predicted residue contact maps frequently contain many errors. However, information of residue pairing in β strands could be extracted from a noisy contact map, due to the presence of characteristic contact patterns in β-β interactions. This information may benefit the tertiary structure prediction of mainly β proteins. In this work, we propose a novel ridge-detection-based β-β contact predictor to identify residue pairing in β strands from any predicted residue contact map. RESULTS: Our algorithm RDb(2)C adopts ridge detection, a well-developed technique in computer image processing, to capture consecutive residue contacts, and then utilizes a novel multi-stage random forest framework to integrate the ridge information and additional features for prediction. Starting from the predicted contact map of CCMpred, RDb(2)C remarkably outperforms all state-of-the-art methods on two conventional test sets of β proteins (BetaSheet916 and BetaSheet1452), and achieves F1-scores of ~ 62% and ~ 76% at the residue level and strand level, respectively. Taking the prediction of the more advanced RaptorX-Contact as input, RDb(2)C achieves impressively higher performance, with F1-scores reaching ~ 76% and ~ 86% at the residue level and strand level, respectively. In a test of structural modeling using the top 1 L predicted contacts as constraints, for 61 mainly β proteins, the average TM-score achieves 0.442 when using the raw RaptorX-Contact prediction, but increases to 0.506 when using the improved prediction by RDb(2)C. CONCLUSION: Our method can significantly improve the prediction of β-β contacts from any predicted residue contact maps. Prediction results of our algorithm could be directly applied to effectively facilitate the practical structure prediction of mainly β proteins. AVAILABILITY: All source data and codes are available at http://166.111.152.91/Downloads.html or the GitHub address of https://github.com/wzmao/RDb2C. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2150-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5907701 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59077012018-04-30 Identification of residue pairing in interacting β-strands from a predicted residue contact map Mao, Wenzhi Wang, Tong Zhang, Wenxuan Gong, Haipeng BMC Bioinformatics Methodology Article BACKGROUND: Despite the rapid progress of protein residue contact prediction, predicted residue contact maps frequently contain many errors. However, information of residue pairing in β strands could be extracted from a noisy contact map, due to the presence of characteristic contact patterns in β-β interactions. This information may benefit the tertiary structure prediction of mainly β proteins. In this work, we propose a novel ridge-detection-based β-β contact predictor to identify residue pairing in β strands from any predicted residue contact map. RESULTS: Our algorithm RDb(2)C adopts ridge detection, a well-developed technique in computer image processing, to capture consecutive residue contacts, and then utilizes a novel multi-stage random forest framework to integrate the ridge information and additional features for prediction. Starting from the predicted contact map of CCMpred, RDb(2)C remarkably outperforms all state-of-the-art methods on two conventional test sets of β proteins (BetaSheet916 and BetaSheet1452), and achieves F1-scores of ~ 62% and ~ 76% at the residue level and strand level, respectively. Taking the prediction of the more advanced RaptorX-Contact as input, RDb(2)C achieves impressively higher performance, with F1-scores reaching ~ 76% and ~ 86% at the residue level and strand level, respectively. In a test of structural modeling using the top 1 L predicted contacts as constraints, for 61 mainly β proteins, the average TM-score achieves 0.442 when using the raw RaptorX-Contact prediction, but increases to 0.506 when using the improved prediction by RDb(2)C. CONCLUSION: Our method can significantly improve the prediction of β-β contacts from any predicted residue contact maps. Prediction results of our algorithm could be directly applied to effectively facilitate the practical structure prediction of mainly β proteins. AVAILABILITY: All source data and codes are available at http://166.111.152.91/Downloads.html or the GitHub address of https://github.com/wzmao/RDb2C. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2150-1) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-19 /pmc/articles/PMC5907701/ /pubmed/29673311 http://dx.doi.org/10.1186/s12859-018-2150-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Mao, Wenzhi Wang, Tong Zhang, Wenxuan Gong, Haipeng Identification of residue pairing in interacting β-strands from a predicted residue contact map |
title | Identification of residue pairing in interacting β-strands from a predicted residue contact map |
title_full | Identification of residue pairing in interacting β-strands from a predicted residue contact map |
title_fullStr | Identification of residue pairing in interacting β-strands from a predicted residue contact map |
title_full_unstemmed | Identification of residue pairing in interacting β-strands from a predicted residue contact map |
title_short | Identification of residue pairing in interacting β-strands from a predicted residue contact map |
title_sort | identification of residue pairing in interacting β-strands from a predicted residue contact map |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907701/ https://www.ncbi.nlm.nih.gov/pubmed/29673311 http://dx.doi.org/10.1186/s12859-018-2150-1 |
work_keys_str_mv | AT maowenzhi identificationofresiduepairingininteractingbstrandsfromapredictedresiduecontactmap AT wangtong identificationofresiduepairingininteractingbstrandsfromapredictedresiduecontactmap AT zhangwenxuan identificationofresiduepairingininteractingbstrandsfromapredictedresiduecontactmap AT gonghaipeng identificationofresiduepairingininteractingbstrandsfromapredictedresiduecontactmap |