Cargando…

c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease

BACKGROUND: Alzheimer’s disease (AD) is an incurable, debilitating neurodegenerative disorder. Current biomarkers for AD diagnosis require expensive neuroimaging or invasive cerebrospinal fluid sampling, thus precluding early detection. Blood-based biomarker discovery in Alzheimer’s can facilitate l...

Descripción completa

Detalles Bibliográficos
Autores principales: Jemimah, Sherlyn, AlShehhi, Aamna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10571239/
https://www.ncbi.nlm.nih.gov/pubmed/37833700
http://dx.doi.org/10.1186/s12920-023-01675-9
_version_ 1785119943794098176
author Jemimah, Sherlyn
AlShehhi, Aamna
author_facet Jemimah, Sherlyn
AlShehhi, Aamna
author_sort Jemimah, Sherlyn
collection PubMed
description BACKGROUND: Alzheimer’s disease (AD) is an incurable, debilitating neurodegenerative disorder. Current biomarkers for AD diagnosis require expensive neuroimaging or invasive cerebrospinal fluid sampling, thus precluding early detection. Blood-based biomarker discovery in Alzheimer’s can facilitate less-invasive, routine diagnostic tests to aid early intervention. Therefore, we propose “c-Diadem” (constrained dual-input Alzheimer’s disease model), a novel deep learning classifier which incorporates KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway constraints on the input genotyping data to predict disease, i.e., mild cognitive impairment (MCI)/AD or cognitively normal (CN). SHAP (SHapley Additive exPlanations) was used to explain the model and identify novel, potential blood-based genetic markers of MCI/AD. METHODS: We developed a novel constrained deep learning neural network which utilizes SNPs (single nucleotide polymorphisms) and microarray data from ADNI (Alzheimer’s Disease Neuroimaging Initiative) to predict the disease status of participants, i.e., CN or with disease (MCI/AD), and identify potential blood-based biomarkers for diagnosis and intervention. The dataset contains samples from 626 participants, of which 212 are CN (average age 74.6 ± 5.4 years) and 414 patients have MCI/AD (average age 72.7 ± 7.6 years). KEGG pathway information was used to generate constraints applied to the input tensors, thus enhancing the interpretability of the model. SHAP scores were used to identify genes which could potentially serve as biomarkers for diagnosis and targets for drug development. RESULTS: Our model’s performance, with accuracy of 69% and AUC of 70% in the test dataset, is superior to previous models. The SHAP scores show that SNPs in PRKCZ, PLCB1 and ITPR2 as well as expression of HLA-DQB1, EIF1AY, HLA-DQA1, and ZFP57 have more impact on model predictions. CONCLUSIONS: In addition to predicting MCI/AD, our model has been interrogated for potential genetic biomarkers using SHAP. From our analysis, we have identified blood-based genetic markers related to Ca(2+) ion release in affected regions of the brain, as well as depression. The findings from our study provides insights into disease mechanisms, and can facilitate innovation in less-invasive, cost-effective diagnostics. To the best of our knowledge, our model is the first to use pathway constraints in a multimodal neural network to identify potential genetic markers for AD.
format Online
Article
Text
id pubmed-10571239
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105712392023-10-14 c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease Jemimah, Sherlyn AlShehhi, Aamna BMC Med Genomics Research BACKGROUND: Alzheimer’s disease (AD) is an incurable, debilitating neurodegenerative disorder. Current biomarkers for AD diagnosis require expensive neuroimaging or invasive cerebrospinal fluid sampling, thus precluding early detection. Blood-based biomarker discovery in Alzheimer’s can facilitate less-invasive, routine diagnostic tests to aid early intervention. Therefore, we propose “c-Diadem” (constrained dual-input Alzheimer’s disease model), a novel deep learning classifier which incorporates KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway constraints on the input genotyping data to predict disease, i.e., mild cognitive impairment (MCI)/AD or cognitively normal (CN). SHAP (SHapley Additive exPlanations) was used to explain the model and identify novel, potential blood-based genetic markers of MCI/AD. METHODS: We developed a novel constrained deep learning neural network which utilizes SNPs (single nucleotide polymorphisms) and microarray data from ADNI (Alzheimer’s Disease Neuroimaging Initiative) to predict the disease status of participants, i.e., CN or with disease (MCI/AD), and identify potential blood-based biomarkers for diagnosis and intervention. The dataset contains samples from 626 participants, of which 212 are CN (average age 74.6 ± 5.4 years) and 414 patients have MCI/AD (average age 72.7 ± 7.6 years). KEGG pathway information was used to generate constraints applied to the input tensors, thus enhancing the interpretability of the model. SHAP scores were used to identify genes which could potentially serve as biomarkers for diagnosis and targets for drug development. RESULTS: Our model’s performance, with accuracy of 69% and AUC of 70% in the test dataset, is superior to previous models. The SHAP scores show that SNPs in PRKCZ, PLCB1 and ITPR2 as well as expression of HLA-DQB1, EIF1AY, HLA-DQA1, and ZFP57 have more impact on model predictions. CONCLUSIONS: In addition to predicting MCI/AD, our model has been interrogated for potential genetic biomarkers using SHAP. From our analysis, we have identified blood-based genetic markers related to Ca(2+) ion release in affected regions of the brain, as well as depression. The findings from our study provides insights into disease mechanisms, and can facilitate innovation in less-invasive, cost-effective diagnostics. To the best of our knowledge, our model is the first to use pathway constraints in a multimodal neural network to identify potential genetic markers for AD. BioMed Central 2023-10-13 /pmc/articles/PMC10571239/ /pubmed/37833700 http://dx.doi.org/10.1186/s12920-023-01675-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Jemimah, Sherlyn
AlShehhi, Aamna
c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease
title c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease
title_full c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease
title_fullStr c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease
title_full_unstemmed c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease
title_short c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease
title_sort c-diadem: a constrained dual-input deep learning model to identify novel biomarkers in alzheimer’s disease
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10571239/
https://www.ncbi.nlm.nih.gov/pubmed/37833700
http://dx.doi.org/10.1186/s12920-023-01675-9
work_keys_str_mv AT jemimahsherlyn cdiademaconstraineddualinputdeeplearningmodeltoidentifynovelbiomarkersinalzheimersdisease
AT alshehhiaamna cdiademaconstraineddualinputdeeplearningmodeltoidentifynovelbiomarkersinalzheimersdisease
AT cdiademaconstraineddualinputdeeplearningmodeltoidentifynovelbiomarkersinalzheimersdisease