Cargando…

ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes

We present a method called ngLOC, an n-gram-based Bayesian classifier that predicts the localization of a protein sequence over ten distinct subcellular organelles. A tenfold cross-validation result shows an accuracy of 89% for sequences localized to a single organelle, and 82% for those localized t...

Descripción completa

Detalles Bibliográficos
Autores principales: King, Brian R, Guda, Chittibabu
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1929137/
https://www.ncbi.nlm.nih.gov/pubmed/17472741
http://dx.doi.org/10.1186/gb-2007-8-5-r68
_version_ 1782134267920252928
author King, Brian R
Guda, Chittibabu
author_facet King, Brian R
Guda, Chittibabu
author_sort King, Brian R
collection PubMed
description We present a method called ngLOC, an n-gram-based Bayesian classifier that predicts the localization of a protein sequence over ten distinct subcellular organelles. A tenfold cross-validation result shows an accuracy of 89% for sequences localized to a single organelle, and 82% for those localized to multiple organelles. An enhanced version of ngLOC was developed to estimate the subcellular proteomes of eight eukaryotic organisms: yeast, nematode, fruitfly, mosquito, zebrafish, chicken, mouse, and human.
format Text
id pubmed-1929137
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19291372007-07-21 ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes King, Brian R Guda, Chittibabu Genome Biol Method We present a method called ngLOC, an n-gram-based Bayesian classifier that predicts the localization of a protein sequence over ten distinct subcellular organelles. A tenfold cross-validation result shows an accuracy of 89% for sequences localized to a single organelle, and 82% for those localized to multiple organelles. An enhanced version of ngLOC was developed to estimate the subcellular proteomes of eight eukaryotic organisms: yeast, nematode, fruitfly, mosquito, zebrafish, chicken, mouse, and human. BioMed Central 2007 2007-05-01 /pmc/articles/PMC1929137/ /pubmed/17472741 http://dx.doi.org/10.1186/gb-2007-8-5-r68 Text en Copyright © 2007 King and Guda; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method
King, Brian R
Guda, Chittibabu
ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes
title ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes
title_full ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes
title_fullStr ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes
title_full_unstemmed ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes
title_short ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes
title_sort ngloc: an n-gram-based bayesian method for estimating the subcellular proteomes of eukaryotes
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1929137/
https://www.ncbi.nlm.nih.gov/pubmed/17472741
http://dx.doi.org/10.1186/gb-2007-8-5-r68
work_keys_str_mv AT kingbrianr nglocanngrambasedbayesianmethodforestimatingthesubcellularproteomesofeukaryotes
AT gudachittibabu nglocanngrambasedbayesianmethodforestimatingthesubcellularproteomesofeukaryotes