Cargando…

'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools

BACKGROUND: Knowing the subcellular location of proteins provides clues to their function as well as the interconnectivity of biological processes. Dozens of tools are available for predicting protein location in the eukaryotic cell. Each tool performs well on certain data sets, but their prediction...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Yao Qing, Burger, Gertraud
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2176073/
https://www.ncbi.nlm.nih.gov/pubmed/17967180
http://dx.doi.org/10.1186/1471-2105-8-420
_version_ 1782145485308428288
author Shen, Yao Qing
Burger, Gertraud
author_facet Shen, Yao Qing
Burger, Gertraud
author_sort Shen, Yao Qing
collection PubMed
description BACKGROUND: Knowing the subcellular location of proteins provides clues to their function as well as the interconnectivity of biological processes. Dozens of tools are available for predicting protein location in the eukaryotic cell. Each tool performs well on certain data sets, but their predictions often disagree for a given protein. Since the individual tools each have particular strengths, we set out to integrate them in a way that optimally exploits their potential. The method we present here is applicable to various subcellular locations, but tailored for predicting whether or not a protein is localized in mitochondria. Knowledge of the mitochondrial proteome is relevant to understanding the role of this organelle in global cellular processes. RESULTS: In order to develop a method for enhanced prediction of subcellular localization, we integrated the outputs of available localization prediction tools by several strategies, and tested the performance of each strategy with known mitochondrial proteins. The accuracy obtained (up to 92%) surpasses by far the individual tools. The method of integration proved crucial to the performance. For the prediction of mitochondrion-located proteins, integration via a two-layer decision tree clearly outperforms simpler methods, as it allows emphasis of biologically relevant features such as the mitochondrial targeting peptide and transmembrane domains. CONCLUSION: We developed an approach that enhances the prediction accuracy of mitochondrial proteins by uniting the strength of specialized tools. The combination of machine-learning based integration with biological expert knowledge leads to improved performance. This approach also alleviates the conundrum of how to choose between conflicting predictions. Our approach is easy to implement, and applicable to predicting subcellular locations other than mitochondria, as well as other biological features. For a trial of our approach, we provide a webservice for mitochondrial protein prediction (named YimLOC), which can be accessed through the AnaBench suite at http://anabench.bcm.umontreal.ca/anabench/. The source code is provided in the Additional File 2.
format Text
id pubmed-2176073
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21760732008-01-09 'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools Shen, Yao Qing Burger, Gertraud BMC Bioinformatics Methodology Article BACKGROUND: Knowing the subcellular location of proteins provides clues to their function as well as the interconnectivity of biological processes. Dozens of tools are available for predicting protein location in the eukaryotic cell. Each tool performs well on certain data sets, but their predictions often disagree for a given protein. Since the individual tools each have particular strengths, we set out to integrate them in a way that optimally exploits their potential. The method we present here is applicable to various subcellular locations, but tailored for predicting whether or not a protein is localized in mitochondria. Knowledge of the mitochondrial proteome is relevant to understanding the role of this organelle in global cellular processes. RESULTS: In order to develop a method for enhanced prediction of subcellular localization, we integrated the outputs of available localization prediction tools by several strategies, and tested the performance of each strategy with known mitochondrial proteins. The accuracy obtained (up to 92%) surpasses by far the individual tools. The method of integration proved crucial to the performance. For the prediction of mitochondrion-located proteins, integration via a two-layer decision tree clearly outperforms simpler methods, as it allows emphasis of biologically relevant features such as the mitochondrial targeting peptide and transmembrane domains. CONCLUSION: We developed an approach that enhances the prediction accuracy of mitochondrial proteins by uniting the strength of specialized tools. The combination of machine-learning based integration with biological expert knowledge leads to improved performance. This approach also alleviates the conundrum of how to choose between conflicting predictions. Our approach is easy to implement, and applicable to predicting subcellular locations other than mitochondria, as well as other biological features. For a trial of our approach, we provide a webservice for mitochondrial protein prediction (named YimLOC), which can be accessed through the AnaBench suite at http://anabench.bcm.umontreal.ca/anabench/. The source code is provided in the Additional File 2. BioMed Central 2007-10-29 /pmc/articles/PMC2176073/ /pubmed/17967180 http://dx.doi.org/10.1186/1471-2105-8-420 Text en Copyright © 2007 Shen and Burger; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Shen, Yao Qing
Burger, Gertraud
'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools
title 'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools
title_full 'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools
title_fullStr 'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools
title_full_unstemmed 'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools
title_short 'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools
title_sort 'unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2176073/
https://www.ncbi.nlm.nih.gov/pubmed/17967180
http://dx.doi.org/10.1186/1471-2105-8-420
work_keys_str_mv AT shenyaoqing uniteandconquerenhancedpredictionofproteinsubcellularlocalizationbyintegratingmultiplespecializedtools
AT burgergertraud uniteandconquerenhancedpredictionofproteinsubcellularlocalizationbyintegratingmultiplespecializedtools