Cargando…
Predicting human protein subcellular localization by heterogeneous and comprehensive approaches
Drug development and investigation of protein function both require an understanding of protein subcellular localization. We developed a system, REALoc, that can predict the subcellular localization of singleplex and multiplex proteins in humans. This system, based on comprehensive strategy, consist...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5489166/ https://www.ncbi.nlm.nih.gov/pubmed/28658305 http://dx.doi.org/10.1371/journal.pone.0178832 |
_version_ | 1783246756580950016 |
---|---|
author | Tung, Chi-Hua Chen, Chi-Wei Sun, Han-Hao Chu, Yen-Wei |
author_facet | Tung, Chi-Hua Chen, Chi-Wei Sun, Han-Hao Chu, Yen-Wei |
author_sort | Tung, Chi-Hua |
collection | PubMed |
description | Drug development and investigation of protein function both require an understanding of protein subcellular localization. We developed a system, REALoc, that can predict the subcellular localization of singleplex and multiplex proteins in humans. This system, based on comprehensive strategy, consists of two heterogeneous systematic frameworks that integrate one-to-one and many-to-many machine learning methods and use sequence-based features, including amino acid composition, surface accessibility, weighted sign aa index, and sequence similarity profile, as well as gene ontology function-based features. REALoc can be used to predict localization to six subcellular compartments (cell membrane, cytoplasm, endoplasmic reticulum/Golgi, mitochondrion, nucleus, and extracellular). REALoc yielded a 75.3% absolute true success rate during five-fold cross-validation and a 57.1% absolute true success rate in an independent database test, which was >10% higher than six other prediction systems. Lastly, we analyzed the effects of Vote and GANN models on singleplex and multiplex localization prediction efficacy. REALoc is freely available at http://predictor.nchu.edu.tw/REALoc. |
format | Online Article Text |
id | pubmed-5489166 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-54891662017-07-11 Predicting human protein subcellular localization by heterogeneous and comprehensive approaches Tung, Chi-Hua Chen, Chi-Wei Sun, Han-Hao Chu, Yen-Wei PLoS One Research Article Drug development and investigation of protein function both require an understanding of protein subcellular localization. We developed a system, REALoc, that can predict the subcellular localization of singleplex and multiplex proteins in humans. This system, based on comprehensive strategy, consists of two heterogeneous systematic frameworks that integrate one-to-one and many-to-many machine learning methods and use sequence-based features, including amino acid composition, surface accessibility, weighted sign aa index, and sequence similarity profile, as well as gene ontology function-based features. REALoc can be used to predict localization to six subcellular compartments (cell membrane, cytoplasm, endoplasmic reticulum/Golgi, mitochondrion, nucleus, and extracellular). REALoc yielded a 75.3% absolute true success rate during five-fold cross-validation and a 57.1% absolute true success rate in an independent database test, which was >10% higher than six other prediction systems. Lastly, we analyzed the effects of Vote and GANN models on singleplex and multiplex localization prediction efficacy. REALoc is freely available at http://predictor.nchu.edu.tw/REALoc. Public Library of Science 2017-06-28 /pmc/articles/PMC5489166/ /pubmed/28658305 http://dx.doi.org/10.1371/journal.pone.0178832 Text en © 2017 Tung et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Tung, Chi-Hua Chen, Chi-Wei Sun, Han-Hao Chu, Yen-Wei Predicting human protein subcellular localization by heterogeneous and comprehensive approaches |
title | Predicting human protein subcellular localization by heterogeneous and comprehensive approaches |
title_full | Predicting human protein subcellular localization by heterogeneous and comprehensive approaches |
title_fullStr | Predicting human protein subcellular localization by heterogeneous and comprehensive approaches |
title_full_unstemmed | Predicting human protein subcellular localization by heterogeneous and comprehensive approaches |
title_short | Predicting human protein subcellular localization by heterogeneous and comprehensive approaches |
title_sort | predicting human protein subcellular localization by heterogeneous and comprehensive approaches |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5489166/ https://www.ncbi.nlm.nih.gov/pubmed/28658305 http://dx.doi.org/10.1371/journal.pone.0178832 |
work_keys_str_mv | AT tungchihua predictinghumanproteinsubcellularlocalizationbyheterogeneousandcomprehensiveapproaches AT chenchiwei predictinghumanproteinsubcellularlocalizationbyheterogeneousandcomprehensiveapproaches AT sunhanhao predictinghumanproteinsubcellularlocalizationbyheterogeneousandcomprehensiveapproaches AT chuyenwei predictinghumanproteinsubcellularlocalizationbyheterogeneousandcomprehensiveapproaches |