Cargando…

Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier

Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-th...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Xiaotong, Liu, Fulin, Ju, Ying, Wang, Zhen, Wang, Chunyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4914962/
https://www.ncbi.nlm.nih.gov/pubmed/27323846
http://dx.doi.org/10.1038/srep28087
_version_ 1782438616029462528
author Guo, Xiaotong
Liu, Fulin
Ju, Ying
Wang, Zhen
Wang, Chunyu
author_facet Guo, Xiaotong
Liu, Fulin
Ju, Ying
Wang, Zhen
Wang, Chunyu
author_sort Guo, Xiaotong
collection PubMed
description Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-the-art methods. First, several proteins appear in different subcellular structures simultaneously, whereas current methods only predict one protein sequence in one subcellular structure. Second, most software tools are trained with obsolete data and the latest new databases are missed. We proposed a novel multi-label classification algorithm to solve the first problem and integrated several latest databases to improve prediction performance. Experiments proved the effectiveness of the proposed method. The present study would facilitate research on cellular proteomics.
format Online
Article
Text
id pubmed-4914962
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49149622016-06-27 Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier Guo, Xiaotong Liu, Fulin Ju, Ying Wang, Zhen Wang, Chunyu Sci Rep Article Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-the-art methods. First, several proteins appear in different subcellular structures simultaneously, whereas current methods only predict one protein sequence in one subcellular structure. Second, most software tools are trained with obsolete data and the latest new databases are missed. We proposed a novel multi-label classification algorithm to solve the first problem and integrated several latest databases to improve prediction performance. Experiments proved the effectiveness of the proposed method. The present study would facilitate research on cellular proteomics. Nature Publishing Group 2016-06-21 /pmc/articles/PMC4914962/ /pubmed/27323846 http://dx.doi.org/10.1038/srep28087 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Guo, Xiaotong
Liu, Fulin
Ju, Ying
Wang, Zhen
Wang, Chunyu
Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
title Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
title_full Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
title_fullStr Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
title_full_unstemmed Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
title_short Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
title_sort human protein subcellular localization with integrated source and multi-label ensemble classifier
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4914962/
https://www.ncbi.nlm.nih.gov/pubmed/27323846
http://dx.doi.org/10.1038/srep28087
work_keys_str_mv AT guoxiaotong humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier
AT liufulin humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier
AT juying humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier
AT wangzhen humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier
AT wangchunyu humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier