Cargando…

A novel approach for protein subcellular location prediction using amino acid exposure

BACKGROUND: Proteins perform their functions in associated cellular locations. Therefore, the study of protein function can be facilitated by predictions of protein location. Protein location can be predicted either from the sequence of a protein alone by identification of targeting peptide sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Mer, Arvind Singh, Andrade-Navarro, Miguel A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4219330/
https://www.ncbi.nlm.nih.gov/pubmed/24283794
http://dx.doi.org/10.1186/1471-2105-14-342
_version_ 1782342566337839104
author Mer, Arvind Singh
Andrade-Navarro, Miguel A
author_facet Mer, Arvind Singh
Andrade-Navarro, Miguel A
author_sort Mer, Arvind Singh
collection PubMed
description BACKGROUND: Proteins perform their functions in associated cellular locations. Therefore, the study of protein function can be facilitated by predictions of protein location. Protein location can be predicted either from the sequence of a protein alone by identification of targeting peptide sequences and motifs, or by homology to proteins of known location. A third approach, which is complementary, exploits the differences in amino acid composition of proteins associated to different cellular locations, and can be useful if motif and homology information are missing. Here we expand this approach taking into account amino acid composition at different levels of amino acid exposure. RESULTS: Our method has two stages. For stage one, we trained multiple Support Vector Machines (SVMs) to score eukaryotic protein sequences for membership to each of three categories: nuclear, cytoplasmic and extracellular, plus extra category nucleocytoplasmic, accounting for the fact that a large number of proteins shuttles between those two locations. In stage two we use an artificial neural network (ANN) to propose a category from the scores given to the four locations in stage one. The method reaches an accuracy of 68% when using as input 3D-derived values of amino acid exposure. Calibration of the method using predicted values of amino acid exposure allows classifying proteins without 3D-information with an accuracy of 62% and discerning proteins in different locations even if they shared high levels of identity. CONCLUSIONS: In this study we explored the relationship between residue exposure and protein subcellular location. We developed a new algorithm for subcellular location prediction that uses residue exposure signatures. Our algorithm uses a novel approach to address the multiclass classification problem. The algorithm is implemented as web server 'NYCE’ and can be accessed at http://cbdm.mdc-berlin.de/~amer/nyce.
format Online
Article
Text
id pubmed-4219330
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42193302014-11-05 A novel approach for protein subcellular location prediction using amino acid exposure Mer, Arvind Singh Andrade-Navarro, Miguel A BMC Bioinformatics Methodology Article BACKGROUND: Proteins perform their functions in associated cellular locations. Therefore, the study of protein function can be facilitated by predictions of protein location. Protein location can be predicted either from the sequence of a protein alone by identification of targeting peptide sequences and motifs, or by homology to proteins of known location. A third approach, which is complementary, exploits the differences in amino acid composition of proteins associated to different cellular locations, and can be useful if motif and homology information are missing. Here we expand this approach taking into account amino acid composition at different levels of amino acid exposure. RESULTS: Our method has two stages. For stage one, we trained multiple Support Vector Machines (SVMs) to score eukaryotic protein sequences for membership to each of three categories: nuclear, cytoplasmic and extracellular, plus extra category nucleocytoplasmic, accounting for the fact that a large number of proteins shuttles between those two locations. In stage two we use an artificial neural network (ANN) to propose a category from the scores given to the four locations in stage one. The method reaches an accuracy of 68% when using as input 3D-derived values of amino acid exposure. Calibration of the method using predicted values of amino acid exposure allows classifying proteins without 3D-information with an accuracy of 62% and discerning proteins in different locations even if they shared high levels of identity. CONCLUSIONS: In this study we explored the relationship between residue exposure and protein subcellular location. We developed a new algorithm for subcellular location prediction that uses residue exposure signatures. Our algorithm uses a novel approach to address the multiclass classification problem. The algorithm is implemented as web server 'NYCE’ and can be accessed at http://cbdm.mdc-berlin.de/~amer/nyce. BioMed Central 2013-11-28 /pmc/articles/PMC4219330/ /pubmed/24283794 http://dx.doi.org/10.1186/1471-2105-14-342 Text en Copyright © 2013 Mer and Andrade-Navarro; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Mer, Arvind Singh
Andrade-Navarro, Miguel A
A novel approach for protein subcellular location prediction using amino acid exposure
title A novel approach for protein subcellular location prediction using amino acid exposure
title_full A novel approach for protein subcellular location prediction using amino acid exposure
title_fullStr A novel approach for protein subcellular location prediction using amino acid exposure
title_full_unstemmed A novel approach for protein subcellular location prediction using amino acid exposure
title_short A novel approach for protein subcellular location prediction using amino acid exposure
title_sort novel approach for protein subcellular location prediction using amino acid exposure
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4219330/
https://www.ncbi.nlm.nih.gov/pubmed/24283794
http://dx.doi.org/10.1186/1471-2105-14-342
work_keys_str_mv AT merarvindsingh anovelapproachforproteinsubcellularlocationpredictionusingaminoacidexposure
AT andradenavarromiguela anovelapproachforproteinsubcellularlocationpredictionusingaminoacidexposure
AT merarvindsingh novelapproachforproteinsubcellularlocationpredictionusingaminoacidexposure
AT andradenavarromiguela novelapproachforproteinsubcellularlocationpredictionusingaminoacidexposure