Cargando…

Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study

BACKGROUND: Early identification of dementia is crucial for prompt intervention for high-risk individuals in the general population. External validation studies on prognostic models for dementia have highlighted the need for updated models. The use of machine learning in dementia prediction is in it...

Descripción completa

Detalles Bibliográficos
Autores principales: Twait, Emma L., Andaur Navarro, Constanza L., Gudnason, Vilmunur, Hu, Yi-Han, Launer, Lenore J., Geerlings, Mirjam I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10463542/
https://www.ncbi.nlm.nih.gov/pubmed/37641038
http://dx.doi.org/10.1186/s12911-023-02244-x
_version_ 1785098257156800512
author Twait, Emma L.
Andaur Navarro, Constanza L.
Gudnason, Vilmunur
Hu, Yi-Han
Launer, Lenore J.
Geerlings, Mirjam I.
author_facet Twait, Emma L.
Andaur Navarro, Constanza L.
Gudnason, Vilmunur
Hu, Yi-Han
Launer, Lenore J.
Geerlings, Mirjam I.
author_sort Twait, Emma L.
collection PubMed
description BACKGROUND: Early identification of dementia is crucial for prompt intervention for high-risk individuals in the general population. External validation studies on prognostic models for dementia have highlighted the need for updated models. The use of machine learning in dementia prediction is in its infancy and may improve predictive performance. The current study aimed to explore the difference in performance of machine learning algorithms compared to traditional statistical techniques, such as logistic and Cox regression, for prediction of all-cause dementia. Our secondary aim was to assess the feasibility of only using clinically accessible predictors rather than MRI predictors. METHODS: Data are from 4,793 participants in the population-based AGES-Reykjavik Study without dementia or mild cognitive impairment at baseline (mean age: 76 years, % female: 59%). Cognitive, biometric, and MRI assessments (total: 59 variables) were collected at baseline, with follow-up of incident dementia diagnoses for a maximum of 12 years. Machine learning algorithms included elastic net regression, random forest, support vector machine, and elastic net Cox regression. Traditional statistical methods for comparison were logistic and Cox regression. Model 1 was fit using all variables and model 2 was after feature selection using the Boruta package. A third model explored performance when leaving out neuroimaging markers (clinically accessible model). Ten-fold cross-validation, repeated ten times, was implemented during training. Upsampling was used to account for imbalanced data. Tuning parameters were optimized for recalibration automatically using the caret package in R. RESULTS: 19% of participants developed all-cause dementia. Machine learning algorithms were comparable in performance to logistic regression in all three models. However, a slight added performance was observed in the elastic net Cox regression in the third model (c = 0.78, 95% CI: 0.78–0.78) compared to the traditional Cox regression (c = 0.75, 95% CI: 0.74–0.77). CONCLUSIONS: Supervised machine learning only showed added benefit when using survival techniques. Removing MRI markers did not significantly worsen our model’s performance. Further, we presented the use of a nomogram using machine learning methods, showing transportability for the use of machine learning models in clinical practice. External validation is needed to assess the use of this model in other populations. Identifying high-risk individuals will amplify prevention efforts and selection for clinical trials. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02244-x.
format Online
Article
Text
id pubmed-10463542
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104635422023-08-30 Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study Twait, Emma L. Andaur Navarro, Constanza L. Gudnason, Vilmunur Hu, Yi-Han Launer, Lenore J. Geerlings, Mirjam I. BMC Med Inform Decis Mak Research BACKGROUND: Early identification of dementia is crucial for prompt intervention for high-risk individuals in the general population. External validation studies on prognostic models for dementia have highlighted the need for updated models. The use of machine learning in dementia prediction is in its infancy and may improve predictive performance. The current study aimed to explore the difference in performance of machine learning algorithms compared to traditional statistical techniques, such as logistic and Cox regression, for prediction of all-cause dementia. Our secondary aim was to assess the feasibility of only using clinically accessible predictors rather than MRI predictors. METHODS: Data are from 4,793 participants in the population-based AGES-Reykjavik Study without dementia or mild cognitive impairment at baseline (mean age: 76 years, % female: 59%). Cognitive, biometric, and MRI assessments (total: 59 variables) were collected at baseline, with follow-up of incident dementia diagnoses for a maximum of 12 years. Machine learning algorithms included elastic net regression, random forest, support vector machine, and elastic net Cox regression. Traditional statistical methods for comparison were logistic and Cox regression. Model 1 was fit using all variables and model 2 was after feature selection using the Boruta package. A third model explored performance when leaving out neuroimaging markers (clinically accessible model). Ten-fold cross-validation, repeated ten times, was implemented during training. Upsampling was used to account for imbalanced data. Tuning parameters were optimized for recalibration automatically using the caret package in R. RESULTS: 19% of participants developed all-cause dementia. Machine learning algorithms were comparable in performance to logistic regression in all three models. However, a slight added performance was observed in the elastic net Cox regression in the third model (c = 0.78, 95% CI: 0.78–0.78) compared to the traditional Cox regression (c = 0.75, 95% CI: 0.74–0.77). CONCLUSIONS: Supervised machine learning only showed added benefit when using survival techniques. Removing MRI markers did not significantly worsen our model’s performance. Further, we presented the use of a nomogram using machine learning methods, showing transportability for the use of machine learning models in clinical practice. External validation is needed to assess the use of this model in other populations. Identifying high-risk individuals will amplify prevention efforts and selection for clinical trials. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02244-x. BioMed Central 2023-08-28 /pmc/articles/PMC10463542/ /pubmed/37641038 http://dx.doi.org/10.1186/s12911-023-02244-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Twait, Emma L.
Andaur Navarro, Constanza L.
Gudnason, Vilmunur
Hu, Yi-Han
Launer, Lenore J.
Geerlings, Mirjam I.
Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study
title Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study
title_full Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study
title_fullStr Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study
title_full_unstemmed Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study
title_short Dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. The AGES-Reykjavik study
title_sort dementia prediction in the general population using clinically accessible variables: a proof-of-concept study using machine learning. the ages-reykjavik study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10463542/
https://www.ncbi.nlm.nih.gov/pubmed/37641038
http://dx.doi.org/10.1186/s12911-023-02244-x
work_keys_str_mv AT twaitemmal dementiapredictioninthegeneralpopulationusingclinicallyaccessiblevariablesaproofofconceptstudyusingmachinelearningtheagesreykjavikstudy
AT andaurnavarroconstanzal dementiapredictioninthegeneralpopulationusingclinicallyaccessiblevariablesaproofofconceptstudyusingmachinelearningtheagesreykjavikstudy
AT gudnasonvilmunur dementiapredictioninthegeneralpopulationusingclinicallyaccessiblevariablesaproofofconceptstudyusingmachinelearningtheagesreykjavikstudy
AT huyihan dementiapredictioninthegeneralpopulationusingclinicallyaccessiblevariablesaproofofconceptstudyusingmachinelearningtheagesreykjavikstudy
AT launerlenorej dementiapredictioninthegeneralpopulationusingclinicallyaccessiblevariablesaproofofconceptstudyusingmachinelearningtheagesreykjavikstudy
AT geerlingsmirjami dementiapredictioninthegeneralpopulationusingclinicallyaccessiblevariablesaproofofconceptstudyusingmachinelearningtheagesreykjavikstudy