Cargando…

Identifying Alzheimer’s disease-related proteins by LRRGD

BACKGROUND: Alzheimer’s disease (AD) imposes a heavy burden on society and every family. Therefore, diagnosing AD in advance and discovering new drug targets are crucial, while these could be achieved by identifying AD-related proteins. The time-consuming and money-costing biological experiment make...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Tianyi, Hu, Yang, Zang, Tianyi, Cheng, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6876080/
https://www.ncbi.nlm.nih.gov/pubmed/31760934
http://dx.doi.org/10.1186/s12859-019-3124-7
_version_ 1783473150661492736
author Zhao, Tianyi
Hu, Yang
Zang, Tianyi
Cheng, Liang
author_facet Zhao, Tianyi
Hu, Yang
Zang, Tianyi
Cheng, Liang
author_sort Zhao, Tianyi
collection PubMed
description BACKGROUND: Alzheimer’s disease (AD) imposes a heavy burden on society and every family. Therefore, diagnosing AD in advance and discovering new drug targets are crucial, while these could be achieved by identifying AD-related proteins. The time-consuming and money-costing biological experiment makes researchers turn to develop more advanced algorithms to identify AD-related proteins. RESULTS: Firstly, we proposed a hypothesis “similar diseases share similar related proteins”. Therefore, five similarity calculation methods are introduced to find out others diseases which are similar to AD. Then, these diseases’ related proteins could be obtained by public data set. Finally, these proteins are features of each disease and could be used to map their similarity to AD. We developed a novel method ‘LRRGD’ which combines Logistic Regression (LR) and Gradient Descent (GD) and borrows the idea of Random Forest (RF). LR is introduced to regress features to similarities. Borrowing the idea of RF, hundreds of LR models have been built by randomly selecting 40 features (proteins) each time. Here, GD is introduced to find out the optimal result. To avoid the drawback of local optimal solution, a good initial value is selected by some known AD-related proteins. Finally, 376 proteins are found to be related to AD. CONCLUSION: Three hundred eight of three hundred seventy-six proteins are the novel proteins. Three case studies are done to prove our method’s effectiveness. These 308 proteins could give researchers a basis to do biological experiments to help treatment and diagnostic AD.
format Online
Article
Text
id pubmed-6876080
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68760802019-11-29 Identifying Alzheimer’s disease-related proteins by LRRGD Zhao, Tianyi Hu, Yang Zang, Tianyi Cheng, Liang BMC Bioinformatics Research BACKGROUND: Alzheimer’s disease (AD) imposes a heavy burden on society and every family. Therefore, diagnosing AD in advance and discovering new drug targets are crucial, while these could be achieved by identifying AD-related proteins. The time-consuming and money-costing biological experiment makes researchers turn to develop more advanced algorithms to identify AD-related proteins. RESULTS: Firstly, we proposed a hypothesis “similar diseases share similar related proteins”. Therefore, five similarity calculation methods are introduced to find out others diseases which are similar to AD. Then, these diseases’ related proteins could be obtained by public data set. Finally, these proteins are features of each disease and could be used to map their similarity to AD. We developed a novel method ‘LRRGD’ which combines Logistic Regression (LR) and Gradient Descent (GD) and borrows the idea of Random Forest (RF). LR is introduced to regress features to similarities. Borrowing the idea of RF, hundreds of LR models have been built by randomly selecting 40 features (proteins) each time. Here, GD is introduced to find out the optimal result. To avoid the drawback of local optimal solution, a good initial value is selected by some known AD-related proteins. Finally, 376 proteins are found to be related to AD. CONCLUSION: Three hundred eight of three hundred seventy-six proteins are the novel proteins. Three case studies are done to prove our method’s effectiveness. These 308 proteins could give researchers a basis to do biological experiments to help treatment and diagnostic AD. BioMed Central 2019-11-25 /pmc/articles/PMC6876080/ /pubmed/31760934 http://dx.doi.org/10.1186/s12859-019-3124-7 Text en © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhao, Tianyi
Hu, Yang
Zang, Tianyi
Cheng, Liang
Identifying Alzheimer’s disease-related proteins by LRRGD
title Identifying Alzheimer’s disease-related proteins by LRRGD
title_full Identifying Alzheimer’s disease-related proteins by LRRGD
title_fullStr Identifying Alzheimer’s disease-related proteins by LRRGD
title_full_unstemmed Identifying Alzheimer’s disease-related proteins by LRRGD
title_short Identifying Alzheimer’s disease-related proteins by LRRGD
title_sort identifying alzheimer’s disease-related proteins by lrrgd
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6876080/
https://www.ncbi.nlm.nih.gov/pubmed/31760934
http://dx.doi.org/10.1186/s12859-019-3124-7
work_keys_str_mv AT zhaotianyi identifyingalzheimersdiseaserelatedproteinsbylrrgd
AT huyang identifyingalzheimersdiseaserelatedproteinsbylrrgd
AT zangtianyi identifyingalzheimersdiseaserelatedproteinsbylrrgd
AT chengliang identifyingalzheimersdiseaserelatedproteinsbylrrgd