Cargando…
FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring
A glycosylphosphatidylinositol (GPI) anchor is a common but complex C-terminal post-translational modification of extracellular proteins in eukaryotes. Here we investigate the problem of correctly annotating GPI-anchored proteins for the growing number of sequences in public databases. We developed...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054108/ https://www.ncbi.nlm.nih.gov/pubmed/17893077 http://dx.doi.org/10.1016/S1672-0229(07)60022-9 |
_version_ | 1782458528097632256 |
---|---|
author | Poisson, Guylaine Chauve, Cedric Chen, Xin Bergeron, Anne |
author_facet | Poisson, Guylaine Chauve, Cedric Chen, Xin Bergeron, Anne |
author_sort | Poisson, Guylaine |
collection | PubMed |
description | A glycosylphosphatidylinositol (GPI) anchor is a common but complex C-terminal post-translational modification of extracellular proteins in eukaryotes. Here we investigate the problem of correctly annotating GPI-anchored proteins for the growing number of sequences in public databases. We developed a computational system, called FragAnchor, based on the tandem use of a neural network (NN) and a hidden Markov model (HMM). Firstly, NN selects potential GPI-anchored proteins in a dataset, then HMM parses these potential GPI signals and refines the prediction by qualitative scoring. FragAnchor correctly predicted 91% of all the GPI-anchored proteins annotated in the Swiss-Prot database. In a large-scale analysis of 29 eukaryote proteomes, FragAnchor predicted that the percentage of highly probable GPI-anchored proteins is between 0.21% and 2.01%. The distinctive feature of FragAnchor, compared with other systems, is that it targets only the C-terminus of a protein, making it less sensitive to the background noise found in databases and possible incomplete protein sequences. Moreover, FragAnchor can be used to predict GPI-anchored proteins in all eukaryotes. Finally, by using qualitative scoring, the predictions combine both sensitivity and information content. The predictor is publicly available at http://navet.ics.hawaii.edu/~fraganchor/NNHMM/NNHMM.html. |
format | Online Article Text |
id | pubmed-5054108 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-50541082016-10-14 FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring Poisson, Guylaine Chauve, Cedric Chen, Xin Bergeron, Anne Genomics Proteomics Bioinformatics Method A glycosylphosphatidylinositol (GPI) anchor is a common but complex C-terminal post-translational modification of extracellular proteins in eukaryotes. Here we investigate the problem of correctly annotating GPI-anchored proteins for the growing number of sequences in public databases. We developed a computational system, called FragAnchor, based on the tandem use of a neural network (NN) and a hidden Markov model (HMM). Firstly, NN selects potential GPI-anchored proteins in a dataset, then HMM parses these potential GPI signals and refines the prediction by qualitative scoring. FragAnchor correctly predicted 91% of all the GPI-anchored proteins annotated in the Swiss-Prot database. In a large-scale analysis of 29 eukaryote proteomes, FragAnchor predicted that the percentage of highly probable GPI-anchored proteins is between 0.21% and 2.01%. The distinctive feature of FragAnchor, compared with other systems, is that it targets only the C-terminus of a protein, making it less sensitive to the background noise found in databases and possible incomplete protein sequences. Moreover, FragAnchor can be used to predict GPI-anchored proteins in all eukaryotes. Finally, by using qualitative scoring, the predictions combine both sensitivity and information content. The predictor is publicly available at http://navet.ics.hawaii.edu/~fraganchor/NNHMM/NNHMM.html. Elsevier 2007 2007-09-22 /pmc/articles/PMC5054108/ /pubmed/17893077 http://dx.doi.org/10.1016/S1672-0229(07)60022-9 Text en © 2007 Beijing Institute of Genomics http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/). |
spellingShingle | Method Poisson, Guylaine Chauve, Cedric Chen, Xin Bergeron, Anne FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring |
title | FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring |
title_full | FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring |
title_fullStr | FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring |
title_full_unstemmed | FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring |
title_short | FragAnchor: A Large-Scale Predictor of Glycosylphosphatidylinositol Anchors in Eukaryote Protein Sequences by Qualitative Scoring |
title_sort | fraganchor: a large-scale predictor of glycosylphosphatidylinositol anchors in eukaryote protein sequences by qualitative scoring |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054108/ https://www.ncbi.nlm.nih.gov/pubmed/17893077 http://dx.doi.org/10.1016/S1672-0229(07)60022-9 |
work_keys_str_mv | AT poissonguylaine fraganchoralargescalepredictorofglycosylphosphatidylinositolanchorsineukaryoteproteinsequencesbyqualitativescoring AT chauvecedric fraganchoralargescalepredictorofglycosylphosphatidylinositolanchorsineukaryoteproteinsequencesbyqualitativescoring AT chenxin fraganchoralargescalepredictorofglycosylphosphatidylinositolanchorsineukaryoteproteinsequencesbyqualitativescoring AT bergeronanne fraganchoralargescalepredictorofglycosylphosphatidylinositolanchorsineukaryoteproteinsequencesbyqualitativescoring |