Cargando…

A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease

The disorder distribution of protein in the compartment or organelle leads to many human diseases, including neurodegenerative diseases such as Alzheimer's disease. The prediction of protein subcellular localization play important roles in the understanding of the mechanism of protein function,...

Descripción completa

Detalles Bibliográficos
Autores principales: Pang, Long, Wang, Junjie, Zhao, Lingling, Wang, Chunyu, Zhan, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6345701/
https://www.ncbi.nlm.nih.gov/pubmed/30713552
http://dx.doi.org/10.3389/fgene.2018.00751
_version_ 1783389604306485248
author Pang, Long
Wang, Junjie
Zhao, Lingling
Wang, Chunyu
Zhan, Hui
author_facet Pang, Long
Wang, Junjie
Zhao, Lingling
Wang, Chunyu
Zhan, Hui
author_sort Pang, Long
collection PubMed
description The disorder distribution of protein in the compartment or organelle leads to many human diseases, including neurodegenerative diseases such as Alzheimer's disease. The prediction of protein subcellular localization play important roles in the understanding of the mechanism of protein function, pathogenes and disease therapy. This paper proposes a novel subcellular localization method by integrating the Convolutional Neural Network (CNN) and eXtreme Gradient Boosting (XGBoost), where CNN acts as a feature extractor to automatically obtain features from the original sequence information and a XGBoost classifier as a recognizer to identify the protein subcellular localization based on the output of the CNN. Experiments are implemented on three protein datasets. The results prove that the CNN-XGBoost method performs better than the general protein subcellular localization methods.
format Online
Article
Text
id pubmed-6345701
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63457012019-02-01 A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease Pang, Long Wang, Junjie Zhao, Lingling Wang, Chunyu Zhan, Hui Front Genet Genetics The disorder distribution of protein in the compartment or organelle leads to many human diseases, including neurodegenerative diseases such as Alzheimer's disease. The prediction of protein subcellular localization play important roles in the understanding of the mechanism of protein function, pathogenes and disease therapy. This paper proposes a novel subcellular localization method by integrating the Convolutional Neural Network (CNN) and eXtreme Gradient Boosting (XGBoost), where CNN acts as a feature extractor to automatically obtain features from the original sequence information and a XGBoost classifier as a recognizer to identify the protein subcellular localization based on the output of the CNN. Experiments are implemented on three protein datasets. The results prove that the CNN-XGBoost method performs better than the general protein subcellular localization methods. Frontiers Media S.A. 2019-01-18 /pmc/articles/PMC6345701/ /pubmed/30713552 http://dx.doi.org/10.3389/fgene.2018.00751 Text en Copyright © 2019 Pang, Wang, Zhao, Wang and Zhan. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Pang, Long
Wang, Junjie
Zhao, Lingling
Wang, Chunyu
Zhan, Hui
A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease
title A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease
title_full A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease
title_fullStr A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease
title_full_unstemmed A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease
title_short A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease
title_sort novel protein subcellular localization method with cnn-xgboost model for alzheimer's disease
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6345701/
https://www.ncbi.nlm.nih.gov/pubmed/30713552
http://dx.doi.org/10.3389/fgene.2018.00751
work_keys_str_mv AT panglong anovelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT wangjunjie anovelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT zhaolingling anovelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT wangchunyu anovelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT zhanhui anovelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT panglong novelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT wangjunjie novelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT zhaolingling novelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT wangchunyu novelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease
AT zhanhui novelproteinsubcellularlocalizationmethodwithcnnxgboostmodelforalzheimersdisease