Cargando…
Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-th...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4914962/ https://www.ncbi.nlm.nih.gov/pubmed/27323846 http://dx.doi.org/10.1038/srep28087 |
_version_ | 1782438616029462528 |
---|---|
author | Guo, Xiaotong Liu, Fulin Ju, Ying Wang, Zhen Wang, Chunyu |
author_facet | Guo, Xiaotong Liu, Fulin Ju, Ying Wang, Zhen Wang, Chunyu |
author_sort | Guo, Xiaotong |
collection | PubMed |
description | Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-the-art methods. First, several proteins appear in different subcellular structures simultaneously, whereas current methods only predict one protein sequence in one subcellular structure. Second, most software tools are trained with obsolete data and the latest new databases are missed. We proposed a novel multi-label classification algorithm to solve the first problem and integrated several latest databases to improve prediction performance. Experiments proved the effectiveness of the proposed method. The present study would facilitate research on cellular proteomics. |
format | Online Article Text |
id | pubmed-4914962 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-49149622016-06-27 Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier Guo, Xiaotong Liu, Fulin Ju, Ying Wang, Zhen Wang, Chunyu Sci Rep Article Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-the-art methods. First, several proteins appear in different subcellular structures simultaneously, whereas current methods only predict one protein sequence in one subcellular structure. Second, most software tools are trained with obsolete data and the latest new databases are missed. We proposed a novel multi-label classification algorithm to solve the first problem and integrated several latest databases to improve prediction performance. Experiments proved the effectiveness of the proposed method. The present study would facilitate research on cellular proteomics. Nature Publishing Group 2016-06-21 /pmc/articles/PMC4914962/ /pubmed/27323846 http://dx.doi.org/10.1038/srep28087 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Guo, Xiaotong Liu, Fulin Ju, Ying Wang, Zhen Wang, Chunyu Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier |
title | Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier |
title_full | Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier |
title_fullStr | Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier |
title_full_unstemmed | Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier |
title_short | Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier |
title_sort | human protein subcellular localization with integrated source and multi-label ensemble classifier |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4914962/ https://www.ncbi.nlm.nih.gov/pubmed/27323846 http://dx.doi.org/10.1038/srep28087 |
work_keys_str_mv | AT guoxiaotong humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier AT liufulin humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier AT juying humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier AT wangzhen humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier AT wangchunyu humanproteinsubcellularlocalizationwithintegratedsourceandmultilabelensembleclassifier |