Cargando…

Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier

Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-th...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Xiaotong, Liu, Fulin, Ju, Ying, Wang, Zhen, Wang, Chunyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4914962/
https://www.ncbi.nlm.nih.gov/pubmed/27323846
http://dx.doi.org/10.1038/srep28087
Descripción
Sumario:Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-the-art methods. First, several proteins appear in different subcellular structures simultaneously, whereas current methods only predict one protein sequence in one subcellular structure. Second, most software tools are trained with obsolete data and the latest new databases are missed. We proposed a novel multi-label classification algorithm to solve the first problem and integrated several latest databases to improve prediction performance. Experiments proved the effectiveness of the proposed method. The present study would facilitate research on cellular proteomics.