Cargando…

Identification of residue pairing in interacting β-strands from a predicted residue contact map

BACKGROUND: Despite the rapid progress of protein residue contact prediction, predicted residue contact maps frequently contain many errors. However, information of residue pairing in β strands could be extracted from a noisy contact map, due to the presence of characteristic contact patterns in β-β...

Descripción completa

Detalles Bibliográficos
Autores principales: Mao, Wenzhi, Wang, Tong, Zhang, Wenxuan, Gong, Haipeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907701/
https://www.ncbi.nlm.nih.gov/pubmed/29673311
http://dx.doi.org/10.1186/s12859-018-2150-1
_version_ 1783315587797090304
author Mao, Wenzhi
Wang, Tong
Zhang, Wenxuan
Gong, Haipeng
author_facet Mao, Wenzhi
Wang, Tong
Zhang, Wenxuan
Gong, Haipeng
author_sort Mao, Wenzhi
collection PubMed
description BACKGROUND: Despite the rapid progress of protein residue contact prediction, predicted residue contact maps frequently contain many errors. However, information of residue pairing in β strands could be extracted from a noisy contact map, due to the presence of characteristic contact patterns in β-β interactions. This information may benefit the tertiary structure prediction of mainly β proteins. In this work, we propose a novel ridge-detection-based β-β contact predictor to identify residue pairing in β strands from any predicted residue contact map. RESULTS: Our algorithm RDb(2)C adopts ridge detection, a well-developed technique in computer image processing, to capture consecutive residue contacts, and then utilizes a novel multi-stage random forest framework to integrate the ridge information and additional features for prediction. Starting from the predicted contact map of CCMpred, RDb(2)C remarkably outperforms all state-of-the-art methods on two conventional test sets of β proteins (BetaSheet916 and BetaSheet1452), and achieves F1-scores of ~ 62% and ~ 76% at the residue level and strand level, respectively. Taking the prediction of the more advanced RaptorX-Contact as input, RDb(2)C achieves impressively higher performance, with F1-scores reaching ~ 76% and ~ 86% at the residue level and strand level, respectively. In a test of structural modeling using the top 1 L predicted contacts as constraints, for 61 mainly β proteins, the average TM-score achieves 0.442 when using the raw RaptorX-Contact prediction, but increases to 0.506 when using the improved prediction by RDb(2)C. CONCLUSION: Our method can significantly improve the prediction of β-β contacts from any predicted residue contact maps. Prediction results of our algorithm could be directly applied to effectively facilitate the practical structure prediction of mainly β proteins. AVAILABILITY: All source data and codes are available at http://166.111.152.91/Downloads.html or the GitHub address of https://github.com/wzmao/RDb2C. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2150-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5907701
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59077012018-04-30 Identification of residue pairing in interacting β-strands from a predicted residue contact map Mao, Wenzhi Wang, Tong Zhang, Wenxuan Gong, Haipeng BMC Bioinformatics Methodology Article BACKGROUND: Despite the rapid progress of protein residue contact prediction, predicted residue contact maps frequently contain many errors. However, information of residue pairing in β strands could be extracted from a noisy contact map, due to the presence of characteristic contact patterns in β-β interactions. This information may benefit the tertiary structure prediction of mainly β proteins. In this work, we propose a novel ridge-detection-based β-β contact predictor to identify residue pairing in β strands from any predicted residue contact map. RESULTS: Our algorithm RDb(2)C adopts ridge detection, a well-developed technique in computer image processing, to capture consecutive residue contacts, and then utilizes a novel multi-stage random forest framework to integrate the ridge information and additional features for prediction. Starting from the predicted contact map of CCMpred, RDb(2)C remarkably outperforms all state-of-the-art methods on two conventional test sets of β proteins (BetaSheet916 and BetaSheet1452), and achieves F1-scores of ~ 62% and ~ 76% at the residue level and strand level, respectively. Taking the prediction of the more advanced RaptorX-Contact as input, RDb(2)C achieves impressively higher performance, with F1-scores reaching ~ 76% and ~ 86% at the residue level and strand level, respectively. In a test of structural modeling using the top 1 L predicted contacts as constraints, for 61 mainly β proteins, the average TM-score achieves 0.442 when using the raw RaptorX-Contact prediction, but increases to 0.506 when using the improved prediction by RDb(2)C. CONCLUSION: Our method can significantly improve the prediction of β-β contacts from any predicted residue contact maps. Prediction results of our algorithm could be directly applied to effectively facilitate the practical structure prediction of mainly β proteins. AVAILABILITY: All source data and codes are available at http://166.111.152.91/Downloads.html or the GitHub address of https://github.com/wzmao/RDb2C. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2150-1) contains supplementary material, which is available to authorized users. BioMed Central 2018-04-19 /pmc/articles/PMC5907701/ /pubmed/29673311 http://dx.doi.org/10.1186/s12859-018-2150-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Mao, Wenzhi
Wang, Tong
Zhang, Wenxuan
Gong, Haipeng
Identification of residue pairing in interacting β-strands from a predicted residue contact map
title Identification of residue pairing in interacting β-strands from a predicted residue contact map
title_full Identification of residue pairing in interacting β-strands from a predicted residue contact map
title_fullStr Identification of residue pairing in interacting β-strands from a predicted residue contact map
title_full_unstemmed Identification of residue pairing in interacting β-strands from a predicted residue contact map
title_short Identification of residue pairing in interacting β-strands from a predicted residue contact map
title_sort identification of residue pairing in interacting β-strands from a predicted residue contact map
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5907701/
https://www.ncbi.nlm.nih.gov/pubmed/29673311
http://dx.doi.org/10.1186/s12859-018-2150-1
work_keys_str_mv AT maowenzhi identificationofresiduepairingininteractingbstrandsfromapredictedresiduecontactmap
AT wangtong identificationofresiduepairingininteractingbstrandsfromapredictedresiduecontactmap
AT zhangwenxuan identificationofresiduepairingininteractingbstrandsfromapredictedresiduecontactmap
AT gonghaipeng identificationofresiduepairingininteractingbstrandsfromapredictedresiduecontactmap