Cargando…

SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences

MOTIVATION: Accurate predictions of protein-binding residues (PBRs) enhances understanding of molecular-level rules governing protein–protein interactions, helps protein–protein docking and facilitates annotation of protein functions. Recent studies show that current sequence-based predictors of PBR...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Jian, Kurgan, Lukasz
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2019
Materias:	Ismb/Eccb 2019 Conference Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612887/ https://www.ncbi.nlm.nih.gov/pubmed/31510679 http://dx.doi.org/10.1093/bioinformatics/btz324

_version_	1783432958668963840
author	Zhang, Jian Kurgan, Lukasz
author_facet	Zhang, Jian Kurgan, Lukasz
author_sort	Zhang, Jian
collection	PubMed
description	MOTIVATION: Accurate predictions of protein-binding residues (PBRs) enhances understanding of molecular-level rules governing protein–protein interactions, helps protein–protein docking and facilitates annotation of protein functions. Recent studies show that current sequence-based predictors of PBRs severely cross-predict residues that interact with other types of protein partners (e.g. RNA and DNA) as PBRs. Moreover, these methods are relatively slow, prohibiting genome-scale use. RESULTS: We propose a novel, accurate and fast sequence-based predictor of PBRs that minimizes the cross-predictions. Our SCRIBER (SeleCtive pRoteIn-Binding rEsidue pRedictor) method takes advantage of three innovations: comprehensive dataset that covers multiple types of binding residues, novel types of inputs that are relevant to the prediction of PBRs, and an architecture that is tailored to reduce the cross-predictions. The dataset includes complete protein chains and offers improved coverage of binding annotations that are transferred from multiple protein–protein complexes. We utilize innovative two-layer architecture where the first layer generates a prediction of protein-binding, RNA-binding, DNA-binding and small ligand-binding residues. The second layer re-predicts PBRs by reducing overlap between PBRs and the other types of binding residues produced in the first layer. Empirical tests on an independent test dataset reveal that SCRIBER significantly outperforms current predictors and that all three innovations contribute to its high predictive performance. SCRIBER reduces cross-predictions by between 41% and 69% and our conservative estimates show that it is at least 3 times faster. We provide putative PBRs produced by SCRIBER for the entire human proteome and use these results to hypothesize that about 14% of currently known human protein domains bind proteins. AVAILABILITY AND IMPLEMENTATION: SCRIBER webserver is available at http://biomine.cs.vcu.edu/servers/SCRIBER/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-6612887
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-66128872019-07-12 SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences Zhang, Jian Kurgan, Lukasz Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: Accurate predictions of protein-binding residues (PBRs) enhances understanding of molecular-level rules governing protein–protein interactions, helps protein–protein docking and facilitates annotation of protein functions. Recent studies show that current sequence-based predictors of PBRs severely cross-predict residues that interact with other types of protein partners (e.g. RNA and DNA) as PBRs. Moreover, these methods are relatively slow, prohibiting genome-scale use. RESULTS: We propose a novel, accurate and fast sequence-based predictor of PBRs that minimizes the cross-predictions. Our SCRIBER (SeleCtive pRoteIn-Binding rEsidue pRedictor) method takes advantage of three innovations: comprehensive dataset that covers multiple types of binding residues, novel types of inputs that are relevant to the prediction of PBRs, and an architecture that is tailored to reduce the cross-predictions. The dataset includes complete protein chains and offers improved coverage of binding annotations that are transferred from multiple protein–protein complexes. We utilize innovative two-layer architecture where the first layer generates a prediction of protein-binding, RNA-binding, DNA-binding and small ligand-binding residues. The second layer re-predicts PBRs by reducing overlap between PBRs and the other types of binding residues produced in the first layer. Empirical tests on an independent test dataset reveal that SCRIBER significantly outperforms current predictors and that all three innovations contribute to its high predictive performance. SCRIBER reduces cross-predictions by between 41% and 69% and our conservative estimates show that it is at least 3 times faster. We provide putative PBRs produced by SCRIBER for the entire human proteome and use these results to hypothesize that about 14% of currently known human protein domains bind proteins. AVAILABILITY AND IMPLEMENTATION: SCRIBER webserver is available at http://biomine.cs.vcu.edu/servers/SCRIBER/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612887/ /pubmed/31510679 http://dx.doi.org/10.1093/bioinformatics/btz324 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Ismb/Eccb 2019 Conference Proceedings Zhang, Jian Kurgan, Lukasz SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences
title	SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences
title_full	SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences
title_fullStr	SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences
title_full_unstemmed	SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences
title_short	SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences
title_sort	scriber: accurate and partner type-specific prediction of protein-binding residues from proteins sequences
topic	Ismb/Eccb 2019 Conference Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612887/ https://www.ncbi.nlm.nih.gov/pubmed/31510679 http://dx.doi.org/10.1093/bioinformatics/btz324
work_keys_str_mv	AT zhangjian scriberaccurateandpartnertypespecificpredictionofproteinbindingresiduesfromproteinssequences AT kurganlukasz scriberaccurateandpartnertypespecificpredictionofproteinbindingresiduesfromproteinssequences

SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences

Ejemplares similares