Cargando…
TESTLoc: protein subcellular localization prediction from EST data
BACKGROUND: The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3000424/ https://www.ncbi.nlm.nih.gov/pubmed/21078192 http://dx.doi.org/10.1186/1471-2105-11-563 |
_version_ | 1782193543655194624 |
---|---|
author | Shen, Yao-Qing Burger, Gertraud |
author_facet | Shen, Yao-Qing Burger, Gertraud |
author_sort | Shen, Yao-Qing |
collection | PubMed |
description | BACKGROUND: The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes. RESULTS: We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%). CONCLUSIONS: TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html |
format | Text |
id | pubmed-3000424 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30004242010-12-15 TESTLoc: protein subcellular localization prediction from EST data Shen, Yao-Qing Burger, Gertraud BMC Bioinformatics Research Article BACKGROUND: The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes. RESULTS: We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%). CONCLUSIONS: TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html BioMed Central 2010-11-15 /pmc/articles/PMC3000424/ /pubmed/21078192 http://dx.doi.org/10.1186/1471-2105-11-563 Text en Copyright ©2010 Shen and Burger; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Shen, Yao-Qing Burger, Gertraud TESTLoc: protein subcellular localization prediction from EST data |
title | TESTLoc: protein subcellular localization prediction from EST data |
title_full | TESTLoc: protein subcellular localization prediction from EST data |
title_fullStr | TESTLoc: protein subcellular localization prediction from EST data |
title_full_unstemmed | TESTLoc: protein subcellular localization prediction from EST data |
title_short | TESTLoc: protein subcellular localization prediction from EST data |
title_sort | testloc: protein subcellular localization prediction from est data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3000424/ https://www.ncbi.nlm.nih.gov/pubmed/21078192 http://dx.doi.org/10.1186/1471-2105-11-563 |
work_keys_str_mv | AT shenyaoqing testlocproteinsubcellularlocalizationpredictionfromestdata AT burgergertraud testlocproteinsubcellularlocalizationpredictionfromestdata |