Cargando…

TESTLoc: protein subcellular localization prediction from EST data

BACKGROUND: The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Yao-Qing, Burger, Gertraud
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3000424/
https://www.ncbi.nlm.nih.gov/pubmed/21078192
http://dx.doi.org/10.1186/1471-2105-11-563
_version_ 1782193543655194624
author Shen, Yao-Qing
Burger, Gertraud
author_facet Shen, Yao-Qing
Burger, Gertraud
author_sort Shen, Yao-Qing
collection PubMed
description BACKGROUND: The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes. RESULTS: We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%). CONCLUSIONS: TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html
format Text
id pubmed-3000424
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30004242010-12-15 TESTLoc: protein subcellular localization prediction from EST data Shen, Yao-Qing Burger, Gertraud BMC Bioinformatics Research Article BACKGROUND: The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes. RESULTS: We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%). CONCLUSIONS: TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html BioMed Central 2010-11-15 /pmc/articles/PMC3000424/ /pubmed/21078192 http://dx.doi.org/10.1186/1471-2105-11-563 Text en Copyright ©2010 Shen and Burger; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Shen, Yao-Qing
Burger, Gertraud
TESTLoc: protein subcellular localization prediction from EST data
title TESTLoc: protein subcellular localization prediction from EST data
title_full TESTLoc: protein subcellular localization prediction from EST data
title_fullStr TESTLoc: protein subcellular localization prediction from EST data
title_full_unstemmed TESTLoc: protein subcellular localization prediction from EST data
title_short TESTLoc: protein subcellular localization prediction from EST data
title_sort testloc: protein subcellular localization prediction from est data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3000424/
https://www.ncbi.nlm.nih.gov/pubmed/21078192
http://dx.doi.org/10.1186/1471-2105-11-563
work_keys_str_mv AT shenyaoqing testlocproteinsubcellularlocalizationpredictionfromestdata
AT burgergertraud testlocproteinsubcellularlocalizationpredictionfromestdata