Cargando…

Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis

BACKGROUND: Dementia is defined as a cognitive decline that affects functional status. Longitudinal ageing surveys often lack a clinical diagnosis of dementia though measure cognition and daily function over time. We used unsupervised machine learning and longitudinal data to identify transition to...

Descripción completa

Detalles Bibliográficos
Autores principales: Gharbi-Meliani, Amin, Husson, François, Vandendriessche, Henri, Bayen, Eleonore, Yaffe, Kristine, Bachoud-Lévi, Anne-Catherine, Cleret de Langavant, Laurent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10688099/
https://www.ncbi.nlm.nih.gov/pubmed/38031083
http://dx.doi.org/10.1186/s13195-023-01357-9
_version_ 1785152111711879168
author Gharbi-Meliani, Amin
Husson, François
Vandendriessche, Henri
Bayen, Eleonore
Yaffe, Kristine
Bachoud-Lévi, Anne-Catherine
Cleret de Langavant, Laurent
author_facet Gharbi-Meliani, Amin
Husson, François
Vandendriessche, Henri
Bayen, Eleonore
Yaffe, Kristine
Bachoud-Lévi, Anne-Catherine
Cleret de Langavant, Laurent
author_sort Gharbi-Meliani, Amin
collection PubMed
description BACKGROUND: Dementia is defined as a cognitive decline that affects functional status. Longitudinal ageing surveys often lack a clinical diagnosis of dementia though measure cognition and daily function over time. We used unsupervised machine learning and longitudinal data to identify transition to probable dementia. METHODS: Multiple Factor Analysis was applied to longitudinal function and cognitive data of 15,278 baseline participants (aged 50 years and more) from the Survey of Health, Ageing, and Retirement in Europe (SHARE) (waves 1, 2 and 4–7, between 2004 and 2017). Hierarchical Clustering on Principal Components discriminated three clusters at each wave. We estimated probable or “Likely Dementia” prevalence by sex and age, and assessed whether dementia risk factors increased the risk of being assigned probable dementia status using multistate models. Next, we compared the “Likely Dementia” cluster with self-reported dementia status and replicated our findings in the English Longitudinal Study of Ageing (ELSA) cohort (waves 1–9, between 2002 and 2019, 7840 participants at baseline). RESULTS: Our algorithm identified a higher number of probable dementia cases compared with self-reported cases and showed good discriminative power across all waves (AUC ranged from 0.754 [0.722–0.787] to 0.830 [0.800–0.861]). “Likely Dementia” status was more prevalent in older people, displayed a 2:1 female/male ratio, and was associated with nine factors that increased risk of transition to dementia: low education, hearing loss, hypertension, drinking, smoking, depression, social isolation, physical inactivity, diabetes, and obesity. Results were replicated in ELSA cohort with good accuracy. CONCLUSIONS: Machine learning clustering can be used to study dementia determinants and outcomes in longitudinal population ageing surveys in which dementia clinical diagnosis is lacking. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13195-023-01357-9.
format Online
Article
Text
id pubmed-10688099
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-106880992023-11-30 Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis Gharbi-Meliani, Amin Husson, François Vandendriessche, Henri Bayen, Eleonore Yaffe, Kristine Bachoud-Lévi, Anne-Catherine Cleret de Langavant, Laurent Alzheimers Res Ther Research BACKGROUND: Dementia is defined as a cognitive decline that affects functional status. Longitudinal ageing surveys often lack a clinical diagnosis of dementia though measure cognition and daily function over time. We used unsupervised machine learning and longitudinal data to identify transition to probable dementia. METHODS: Multiple Factor Analysis was applied to longitudinal function and cognitive data of 15,278 baseline participants (aged 50 years and more) from the Survey of Health, Ageing, and Retirement in Europe (SHARE) (waves 1, 2 and 4–7, between 2004 and 2017). Hierarchical Clustering on Principal Components discriminated three clusters at each wave. We estimated probable or “Likely Dementia” prevalence by sex and age, and assessed whether dementia risk factors increased the risk of being assigned probable dementia status using multistate models. Next, we compared the “Likely Dementia” cluster with self-reported dementia status and replicated our findings in the English Longitudinal Study of Ageing (ELSA) cohort (waves 1–9, between 2002 and 2019, 7840 participants at baseline). RESULTS: Our algorithm identified a higher number of probable dementia cases compared with self-reported cases and showed good discriminative power across all waves (AUC ranged from 0.754 [0.722–0.787] to 0.830 [0.800–0.861]). “Likely Dementia” status was more prevalent in older people, displayed a 2:1 female/male ratio, and was associated with nine factors that increased risk of transition to dementia: low education, hearing loss, hypertension, drinking, smoking, depression, social isolation, physical inactivity, diabetes, and obesity. Results were replicated in ELSA cohort with good accuracy. CONCLUSIONS: Machine learning clustering can be used to study dementia determinants and outcomes in longitudinal population ageing surveys in which dementia clinical diagnosis is lacking. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13195-023-01357-9. BioMed Central 2023-11-29 /pmc/articles/PMC10688099/ /pubmed/38031083 http://dx.doi.org/10.1186/s13195-023-01357-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Gharbi-Meliani, Amin
Husson, François
Vandendriessche, Henri
Bayen, Eleonore
Yaffe, Kristine
Bachoud-Lévi, Anne-Catherine
Cleret de Langavant, Laurent
Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis
title Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis
title_full Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis
title_fullStr Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis
title_full_unstemmed Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis
title_short Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis
title_sort identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10688099/
https://www.ncbi.nlm.nih.gov/pubmed/38031083
http://dx.doi.org/10.1186/s13195-023-01357-9
work_keys_str_mv AT gharbimelianiamin identificationofhighlikelihoodofdementiainpopulationbasedsurveysusingunsupervisedclusteringalongitudinalanalysis
AT hussonfrancois identificationofhighlikelihoodofdementiainpopulationbasedsurveysusingunsupervisedclusteringalongitudinalanalysis
AT vandendriesschehenri identificationofhighlikelihoodofdementiainpopulationbasedsurveysusingunsupervisedclusteringalongitudinalanalysis
AT bayeneleonore identificationofhighlikelihoodofdementiainpopulationbasedsurveysusingunsupervisedclusteringalongitudinalanalysis
AT yaffekristine identificationofhighlikelihoodofdementiainpopulationbasedsurveysusingunsupervisedclusteringalongitudinalanalysis
AT bachoudleviannecatherine identificationofhighlikelihoodofdementiainpopulationbasedsurveysusingunsupervisedclusteringalongitudinalanalysis
AT cleretdelangavantlaurent identificationofhighlikelihoodofdementiainpopulationbasedsurveysusingunsupervisedclusteringalongitudinalanalysis