Cargando…

Predicting proteolytic sites in extracellular proteins: only halfway there

Motivation: Many secretory proteins are synthesized as inactive precursors that must undergo post-translational proteolysis in order to mature and become active. In the current study, we address the challenge of sequence-based discovery of proteolytic sites in secreted proteins using machine learnin...

Descripción completa

Detalles Bibliográficos
Autores principales: Kliger, Yossef, Gofer, Eyal, Wool, Assaf, Toporik, Amir, Apatoff, Avihay, Olshansky, Moshe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7109841/
https://www.ncbi.nlm.nih.gov/pubmed/18321887
http://dx.doi.org/10.1093/bioinformatics/btn084
_version_ 1783513003883233280
author Kliger, Yossef
Gofer, Eyal
Wool, Assaf
Toporik, Amir
Apatoff, Avihay
Olshansky, Moshe
author_facet Kliger, Yossef
Gofer, Eyal
Wool, Assaf
Toporik, Amir
Apatoff, Avihay
Olshansky, Moshe
author_sort Kliger, Yossef
collection PubMed
description Motivation: Many secretory proteins are synthesized as inactive precursors that must undergo post-translational proteolysis in order to mature and become active. In the current study, we address the challenge of sequence-based discovery of proteolytic sites in secreted proteins using machine learning. Results: The results revealed that only half of the extracellular proteolytic sites are currently annotated, leaving over 3600 unannotated ones. Furthermore, we have found that only 6% of the unannotated sites are similar to known proteolytic sites, whereas the remaining 94% do not share significant similarity with any annotated proteolytic site. The computational challenges in these two cases are very different. While the precision in detecting the former group is close to perfect, only a mere 22% of the latter group were detected with a precision of 80%. The applicability of the classifier is demonstrated through members of the FGF family, in which we verified the conservation of physiologically-relevant proteolytic sites in homologous proteins. Contact: kliger@compugen.co.il; yossef.kliger@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7109841
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-71098412020-04-02 Predicting proteolytic sites in extracellular proteins: only halfway there Kliger, Yossef Gofer, Eyal Wool, Assaf Toporik, Amir Apatoff, Avihay Olshansky, Moshe Bioinformatics Original Papers Motivation: Many secretory proteins are synthesized as inactive precursors that must undergo post-translational proteolysis in order to mature and become active. In the current study, we address the challenge of sequence-based discovery of proteolytic sites in secreted proteins using machine learning. Results: The results revealed that only half of the extracellular proteolytic sites are currently annotated, leaving over 3600 unannotated ones. Furthermore, we have found that only 6% of the unannotated sites are similar to known proteolytic sites, whereas the remaining 94% do not share significant similarity with any annotated proteolytic site. The computational challenges in these two cases are very different. While the precision in detecting the former group is close to perfect, only a mere 22% of the latter group were detected with a precision of 80%. The applicability of the classifier is demonstrated through members of the FGF family, in which we verified the conservation of physiologically-relevant proteolytic sites in homologous proteins. Contact: kliger@compugen.co.il; yossef.kliger@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2008-04-15 2008-03-04 /pmc/articles/PMC7109841/ /pubmed/18321887 http://dx.doi.org/10.1093/bioinformatics/btn084 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Kliger, Yossef
Gofer, Eyal
Wool, Assaf
Toporik, Amir
Apatoff, Avihay
Olshansky, Moshe
Predicting proteolytic sites in extracellular proteins: only halfway there
title Predicting proteolytic sites in extracellular proteins: only halfway there
title_full Predicting proteolytic sites in extracellular proteins: only halfway there
title_fullStr Predicting proteolytic sites in extracellular proteins: only halfway there
title_full_unstemmed Predicting proteolytic sites in extracellular proteins: only halfway there
title_short Predicting proteolytic sites in extracellular proteins: only halfway there
title_sort predicting proteolytic sites in extracellular proteins: only halfway there
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7109841/
https://www.ncbi.nlm.nih.gov/pubmed/18321887
http://dx.doi.org/10.1093/bioinformatics/btn084
work_keys_str_mv AT kligeryossef predictingproteolyticsitesinextracellularproteinsonlyhalfwaythere
AT gofereyal predictingproteolyticsitesinextracellularproteinsonlyhalfwaythere
AT woolassaf predictingproteolyticsitesinextracellularproteinsonlyhalfwaythere
AT toporikamir predictingproteolyticsitesinextracellularproteinsonlyhalfwaythere
AT apatoffavihay predictingproteolyticsitesinextracellularproteinsonlyhalfwaythere
AT olshanskymoshe predictingproteolyticsitesinextracellularproteinsonlyhalfwaythere