Cargando…

Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species

The function of a protein is intimately tied to its subcellular localization. Although localizations have been measured for many yeast proteins through systematic GFP fusions, similar studies in other branches of life are still forthcoming. In the interim, various machine-learning methods have been...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, KiYoung, Chuang, Han-Yu, Beyer, Andreas, Sung, Min-Kyung, Huh, Won-Ki, Lee, Bonghee, Ideker, Trey
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2582614/
https://www.ncbi.nlm.nih.gov/pubmed/18836191
http://dx.doi.org/10.1093/nar/gkn619
_version_ 1782160683728633856
author Lee, KiYoung
Chuang, Han-Yu
Beyer, Andreas
Sung, Min-Kyung
Huh, Won-Ki
Lee, Bonghee
Ideker, Trey
author_facet Lee, KiYoung
Chuang, Han-Yu
Beyer, Andreas
Sung, Min-Kyung
Huh, Won-Ki
Lee, Bonghee
Ideker, Trey
author_sort Lee, KiYoung
collection PubMed
description The function of a protein is intimately tied to its subcellular localization. Although localizations have been measured for many yeast proteins through systematic GFP fusions, similar studies in other branches of life are still forthcoming. In the interim, various machine-learning methods have been proposed to predict localization using physical characteristics of a protein, such as amino acid content, hydrophobicity, side-chain mass and domain composition. However, there has been comparatively little work on predicting localization using protein networks. Here, we predict protein localizations by integrating an extensive set of protein physical characteristics over a protein's extended protein–protein interaction neighborhood, using a classification framework called ‘Divide and Conquer k-Nearest Neighbors’ (DC-kNN). These predictions achieve significantly higher accuracy than two well-known methods for predicting protein localization in yeast. Using new GFP imaging experiments, we show that the network-based approach can extend and revise previous annotations made from high-throughput studies. Finally, we show that our approach remains highly predictive in higher eukaryotes such as fly and human, in which most localizations are unknown and the protein network coverage is less substantial.
format Text
id pubmed-2582614
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-25826142009-01-22 Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species Lee, KiYoung Chuang, Han-Yu Beyer, Andreas Sung, Min-Kyung Huh, Won-Ki Lee, Bonghee Ideker, Trey Nucleic Acids Res Methods Online The function of a protein is intimately tied to its subcellular localization. Although localizations have been measured for many yeast proteins through systematic GFP fusions, similar studies in other branches of life are still forthcoming. In the interim, various machine-learning methods have been proposed to predict localization using physical characteristics of a protein, such as amino acid content, hydrophobicity, side-chain mass and domain composition. However, there has been comparatively little work on predicting localization using protein networks. Here, we predict protein localizations by integrating an extensive set of protein physical characteristics over a protein's extended protein–protein interaction neighborhood, using a classification framework called ‘Divide and Conquer k-Nearest Neighbors’ (DC-kNN). These predictions achieve significantly higher accuracy than two well-known methods for predicting protein localization in yeast. Using new GFP imaging experiments, we show that the network-based approach can extend and revise previous annotations made from high-throughput studies. Finally, we show that our approach remains highly predictive in higher eukaryotes such as fly and human, in which most localizations are unknown and the protein network coverage is less substantial. Oxford University Press 2008-11 2008-10-04 /pmc/articles/PMC2582614/ /pubmed/18836191 http://dx.doi.org/10.1093/nar/gkn619 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Lee, KiYoung
Chuang, Han-Yu
Beyer, Andreas
Sung, Min-Kyung
Huh, Won-Ki
Lee, Bonghee
Ideker, Trey
Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species
title Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species
title_full Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species
title_fullStr Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species
title_full_unstemmed Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species
title_short Protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species
title_sort protein networks markedly improve prediction of subcellular localization in multiple eukaryotic species
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2582614/
https://www.ncbi.nlm.nih.gov/pubmed/18836191
http://dx.doi.org/10.1093/nar/gkn619
work_keys_str_mv AT leekiyoung proteinnetworksmarkedlyimprovepredictionofsubcellularlocalizationinmultipleeukaryoticspecies
AT chuanghanyu proteinnetworksmarkedlyimprovepredictionofsubcellularlocalizationinmultipleeukaryoticspecies
AT beyerandreas proteinnetworksmarkedlyimprovepredictionofsubcellularlocalizationinmultipleeukaryoticspecies
AT sungminkyung proteinnetworksmarkedlyimprovepredictionofsubcellularlocalizationinmultipleeukaryoticspecies
AT huhwonki proteinnetworksmarkedlyimprovepredictionofsubcellularlocalizationinmultipleeukaryoticspecies
AT leebonghee proteinnetworksmarkedlyimprovepredictionofsubcellularlocalizationinmultipleeukaryoticspecies
AT idekertrey proteinnetworksmarkedlyimprovepredictionofsubcellularlocalizationinmultipleeukaryoticspecies