Cargando…

Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study

BACKGROUND: In the older general population, neurodegenerative diseases (NDs) are associated with increased disability, decreased physical and cognitive function. Detecting risk factors can help implement prevention measures. Using deep neural networks (DNNs), a machine-learning algorithm could be a...

Descripción completa

Detalles Bibliográficos
Autores principales: Aguayo, Gloria A., Zhang, Lu, Vaillant, Michel, Ngari, Moses, Perquin, Magali, Moran, Valerie, Huiart, Laetitia, Krüger, Rejko, Azuaje, Francisco, Ferdynus, Cyril, Fagherazzi, Guy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832793/
https://www.ncbi.nlm.nih.gov/pubmed/36631766
http://dx.doi.org/10.1186/s12874-023-01837-4
_version_ 1784868127198150656
author Aguayo, Gloria A.
Zhang, Lu
Vaillant, Michel
Ngari, Moses
Perquin, Magali
Moran, Valerie
Huiart, Laetitia
Krüger, Rejko
Azuaje, Francisco
Ferdynus, Cyril
Fagherazzi, Guy
author_facet Aguayo, Gloria A.
Zhang, Lu
Vaillant, Michel
Ngari, Moses
Perquin, Magali
Moran, Valerie
Huiart, Laetitia
Krüger, Rejko
Azuaje, Francisco
Ferdynus, Cyril
Fagherazzi, Guy
author_sort Aguayo, Gloria A.
collection PubMed
description BACKGROUND: In the older general population, neurodegenerative diseases (NDs) are associated with increased disability, decreased physical and cognitive function. Detecting risk factors can help implement prevention measures. Using deep neural networks (DNNs), a machine-learning algorithm could be an alternative to Cox regression in tabular datasets with many predictive features. We aimed to compare the performance of different types of DNNs with regularized Cox proportional hazards models to predict NDs in the older general population. METHODS: We performed a longitudinal analysis with participants of the English Longitudinal Study of Ageing. We included men and women with no NDs at baseline, aged 60 years and older, assessed every 2 years from 2004 to 2005 (wave2) to 2016–2017 (wave 8). The features were a set of 91 epidemiological and clinical baseline variables. The outcome was new events of Parkinson’s, Alzheimer or dementia. After applying multiple imputations, we trained three DNN algorithms: Feedforward, TabTransformer, and Dense Convolutional (Densenet). In addition, we trained two algorithms based on Cox models: Elastic Net regularization (CoxEn) and selected features (CoxSf). RESULTS: 5433 participants were included in wave 2. During follow-up, 12.7% participants developed NDs. Although the five models predicted NDs events, the discriminative ability was superior using TabTransformer (Uno’s C-statistic (coefficient (95% confidence intervals)) 0.757 (0.702, 0.805). TabTransformer showed superior time-dependent balanced accuracy (0.834 (0.779, 0.889)) and specificity (0.855 (0.0.773, 0.909)) than the other models. With the CoxSf (hazard ratio (95% confidence intervals)), age (10.0 (6.9, 14.7)), poor hearing (1.3 (1.1, 1.5)) and weight loss 1.3 (1.1, 1.6)) were associated with a higher DNN risk. In contrast, executive function (0.3 (0.2, 0.6)), memory (0, 0, 0.1)), increased gait speed (0.2, (0.1, 0.4)), vigorous physical activity (0.7, 0.6, 0.9)) and higher BMI (0.4 (0.2, 0.8)) were associated with a lower DNN risk. CONCLUSION: TabTransformer is promising for prediction of NDs with heterogeneous tabular datasets with numerous features. Moreover, it can handle censored data. However, Cox models perform well and are easier to interpret than DNNs. Therefore, they are still a good choice for NDs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-023-01837-4.
format Online
Article
Text
id pubmed-9832793
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-98327932023-01-12 Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study Aguayo, Gloria A. Zhang, Lu Vaillant, Michel Ngari, Moses Perquin, Magali Moran, Valerie Huiart, Laetitia Krüger, Rejko Azuaje, Francisco Ferdynus, Cyril Fagherazzi, Guy BMC Med Res Methodol Research BACKGROUND: In the older general population, neurodegenerative diseases (NDs) are associated with increased disability, decreased physical and cognitive function. Detecting risk factors can help implement prevention measures. Using deep neural networks (DNNs), a machine-learning algorithm could be an alternative to Cox regression in tabular datasets with many predictive features. We aimed to compare the performance of different types of DNNs with regularized Cox proportional hazards models to predict NDs in the older general population. METHODS: We performed a longitudinal analysis with participants of the English Longitudinal Study of Ageing. We included men and women with no NDs at baseline, aged 60 years and older, assessed every 2 years from 2004 to 2005 (wave2) to 2016–2017 (wave 8). The features were a set of 91 epidemiological and clinical baseline variables. The outcome was new events of Parkinson’s, Alzheimer or dementia. After applying multiple imputations, we trained three DNN algorithms: Feedforward, TabTransformer, and Dense Convolutional (Densenet). In addition, we trained two algorithms based on Cox models: Elastic Net regularization (CoxEn) and selected features (CoxSf). RESULTS: 5433 participants were included in wave 2. During follow-up, 12.7% participants developed NDs. Although the five models predicted NDs events, the discriminative ability was superior using TabTransformer (Uno’s C-statistic (coefficient (95% confidence intervals)) 0.757 (0.702, 0.805). TabTransformer showed superior time-dependent balanced accuracy (0.834 (0.779, 0.889)) and specificity (0.855 (0.0.773, 0.909)) than the other models. With the CoxSf (hazard ratio (95% confidence intervals)), age (10.0 (6.9, 14.7)), poor hearing (1.3 (1.1, 1.5)) and weight loss 1.3 (1.1, 1.6)) were associated with a higher DNN risk. In contrast, executive function (0.3 (0.2, 0.6)), memory (0, 0, 0.1)), increased gait speed (0.2, (0.1, 0.4)), vigorous physical activity (0.7, 0.6, 0.9)) and higher BMI (0.4 (0.2, 0.8)) were associated with a lower DNN risk. CONCLUSION: TabTransformer is promising for prediction of NDs with heterogeneous tabular datasets with numerous features. Moreover, it can handle censored data. However, Cox models perform well and are easier to interpret than DNNs. Therefore, they are still a good choice for NDs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-023-01837-4. BioMed Central 2023-01-11 /pmc/articles/PMC9832793/ /pubmed/36631766 http://dx.doi.org/10.1186/s12874-023-01837-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Aguayo, Gloria A.
Zhang, Lu
Vaillant, Michel
Ngari, Moses
Perquin, Magali
Moran, Valerie
Huiart, Laetitia
Krüger, Rejko
Azuaje, Francisco
Ferdynus, Cyril
Fagherazzi, Guy
Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study
title Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study
title_full Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study
title_fullStr Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study
title_full_unstemmed Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study
title_short Machine learning for predicting neurodegenerative diseases in the general older population: a cohort study
title_sort machine learning for predicting neurodegenerative diseases in the general older population: a cohort study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9832793/
https://www.ncbi.nlm.nih.gov/pubmed/36631766
http://dx.doi.org/10.1186/s12874-023-01837-4
work_keys_str_mv AT aguayogloriaa machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT zhanglu machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT vaillantmichel machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT ngarimoses machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT perquinmagali machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT moranvalerie machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT huiartlaetitia machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT krugerrejko machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT azuajefrancisco machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT ferdynuscyril machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy
AT fagherazziguy machinelearningforpredictingneurodegenerativediseasesinthegeneralolderpopulationacohortstudy