Cargando…
ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes
BACKGROUND: Understanding protein subcellular localization is a necessary component toward understanding the overall function of a protein. Numerous computational methods have been published over the past decade, with varying degrees of success. Despite the large number of published methods in this...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3532370/ https://www.ncbi.nlm.nih.gov/pubmed/22780965 http://dx.doi.org/10.1186/1756-0500-5-351 |
_version_ | 1782254299715207168 |
---|---|
author | King, Brian R Vural, Suleyman Pandey, Sanjit Barteau, Alex Guda, Chittibabu |
author_facet | King, Brian R Vural, Suleyman Pandey, Sanjit Barteau, Alex Guda, Chittibabu |
author_sort | King, Brian R |
collection | PubMed |
description | BACKGROUND: Understanding protein subcellular localization is a necessary component toward understanding the overall function of a protein. Numerous computational methods have been published over the past decade, with varying degrees of success. Despite the large number of published methods in this area, only a small fraction of them are available for researchers to use in their own studies. Of those that are available, many are limited by predicting only a small number of organelles in the cell. Additionally, the majority of methods predict only a single location for a sequence, even though it is known that a large fraction of the proteins in eukaryotic species shuttle between locations to carry out their function. FINDINGS: We present a software package and a web server for predicting the subcellular localization of protein sequences based on the ngLOC method. ngLOC is an n-gram-based Bayesian classifier that predicts subcellular localization of proteins both in prokaryotes and eukaryotes(.) The overall prediction accuracy varies from 89.8% to 91.4% across species. This program can predict 11 distinct locations each in plant and animal species. ngLOC also predicts 4 and 5 distinct locations on gram-positive and gram-negative bacterial datasets, respectively. CONCLUSIONS: ngLOC is a generic method that can be trained by data from a variety of species or classes for predicting protein subcellular localization. The standalone software is freely available for academic use under GNU GPL, and the ngLOC web server is also accessible at http://ngloc.unmc.edu. |
format | Online Article Text |
id | pubmed-3532370 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35323702013-01-03 ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes King, Brian R Vural, Suleyman Pandey, Sanjit Barteau, Alex Guda, Chittibabu BMC Res Notes Technical Note BACKGROUND: Understanding protein subcellular localization is a necessary component toward understanding the overall function of a protein. Numerous computational methods have been published over the past decade, with varying degrees of success. Despite the large number of published methods in this area, only a small fraction of them are available for researchers to use in their own studies. Of those that are available, many are limited by predicting only a small number of organelles in the cell. Additionally, the majority of methods predict only a single location for a sequence, even though it is known that a large fraction of the proteins in eukaryotic species shuttle between locations to carry out their function. FINDINGS: We present a software package and a web server for predicting the subcellular localization of protein sequences based on the ngLOC method. ngLOC is an n-gram-based Bayesian classifier that predicts subcellular localization of proteins both in prokaryotes and eukaryotes(.) The overall prediction accuracy varies from 89.8% to 91.4% across species. This program can predict 11 distinct locations each in plant and animal species. ngLOC also predicts 4 and 5 distinct locations on gram-positive and gram-negative bacterial datasets, respectively. CONCLUSIONS: ngLOC is a generic method that can be trained by data from a variety of species or classes for predicting protein subcellular localization. The standalone software is freely available for academic use under GNU GPL, and the ngLOC web server is also accessible at http://ngloc.unmc.edu. BioMed Central 2012-07-10 /pmc/articles/PMC3532370/ /pubmed/22780965 http://dx.doi.org/10.1186/1756-0500-5-351 Text en Copyright ©2012 King et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note King, Brian R Vural, Suleyman Pandey, Sanjit Barteau, Alex Guda, Chittibabu ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes |
title | ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes |
title_full | ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes |
title_fullStr | ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes |
title_full_unstemmed | ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes |
title_short | ngLOC: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes |
title_sort | ngloc: software and web server for predicting protein subcellular localization in prokaryotes and eukaryotes |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3532370/ https://www.ncbi.nlm.nih.gov/pubmed/22780965 http://dx.doi.org/10.1186/1756-0500-5-351 |
work_keys_str_mv | AT kingbrianr nglocsoftwareandwebserverforpredictingproteinsubcellularlocalizationinprokaryotesandeukaryotes AT vuralsuleyman nglocsoftwareandwebserverforpredictingproteinsubcellularlocalizationinprokaryotesandeukaryotes AT pandeysanjit nglocsoftwareandwebserverforpredictingproteinsubcellularlocalizationinprokaryotesandeukaryotes AT barteaualex nglocsoftwareandwebserverforpredictingproteinsubcellularlocalizationinprokaryotesandeukaryotes AT gudachittibabu nglocsoftwareandwebserverforpredictingproteinsubcellularlocalizationinprokaryotesandeukaryotes |