Cargando…

Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and Tree SHAP interpretation

BACKGROUND: Persons with Parkinson’s disease (PD) differentially progress to cognitive impairment and dementia. With a 3-year longitudinal sample of initially non-demented PD patients measured on multiple dementia risk factors, we demonstrate that machine learning classifier algorithms can be combin...

Descripción completa

Detalles Bibliográficos
Autores principales: McFall, G. Peggy, Bohn, Linzy, Gee, Myrlene, Drouin, Shannon M., Fah, Harrison, Han, Wei, Li, Liang, Camicioli, Richard, Dixon, Roger A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10347530/
https://www.ncbi.nlm.nih.gov/pubmed/37455938
http://dx.doi.org/10.3389/fnagi.2023.1124232
_version_ 1785073567555125248
author McFall, G. Peggy
Bohn, Linzy
Gee, Myrlene
Drouin, Shannon M.
Fah, Harrison
Han, Wei
Li, Liang
Camicioli, Richard
Dixon, Roger A.
author_facet McFall, G. Peggy
Bohn, Linzy
Gee, Myrlene
Drouin, Shannon M.
Fah, Harrison
Han, Wei
Li, Liang
Camicioli, Richard
Dixon, Roger A.
author_sort McFall, G. Peggy
collection PubMed
description BACKGROUND: Persons with Parkinson’s disease (PD) differentially progress to cognitive impairment and dementia. With a 3-year longitudinal sample of initially non-demented PD patients measured on multiple dementia risk factors, we demonstrate that machine learning classifier algorithms can be combined with explainable artificial intelligence methods to identify and interpret leading predictors that discriminate those who later converted to dementia from those who did not. METHOD: Participants were 48 well-characterized PD patients (M(baseline age) = 71.6; SD = 4.8; 44% female). We tested 38 multi-modal predictors from 10 domains (e.g., motor, cognitive) in a computationally competitive context to identify those that best discriminated two unobserved baseline groups, PD No Dementia (PDND), and PD Incipient Dementia (PDID). We used Random Forest (RF) classifier models for the discrimination goal and Tree SHapley Additive exPlanation (Tree SHAP) values for deep interpretation. RESULTS: An excellent RF model discriminated baseline PDID from PDND (AUC = 0.84; normalized Matthews Correlation Coefficient = 0.76). Tree SHAP showed that ten leading predictors of PDID accounted for 62.5% of the model, as well as their relative importance, direction, and magnitude (risk threshold). These predictors represented the motor (e.g., poorer gait), cognitive (e.g., slower Trail A), molecular (up-regulated metabolite panel), demographic (age), imaging (ventricular volume), and lifestyle (activities of daily living) domains. CONCLUSION: Our data-driven protocol integrated RF classifier models and Tree SHAP applications to selectively identify and interpret early dementia risk factors in a well-characterized sample of initially non-demented persons with PD. Results indicate that leading dementia predictors derive from multiple complementary risk domains.
format Online
Article
Text
id pubmed-10347530
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103475302023-07-15 Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and Tree SHAP interpretation McFall, G. Peggy Bohn, Linzy Gee, Myrlene Drouin, Shannon M. Fah, Harrison Han, Wei Li, Liang Camicioli, Richard Dixon, Roger A. Front Aging Neurosci Neuroscience BACKGROUND: Persons with Parkinson’s disease (PD) differentially progress to cognitive impairment and dementia. With a 3-year longitudinal sample of initially non-demented PD patients measured on multiple dementia risk factors, we demonstrate that machine learning classifier algorithms can be combined with explainable artificial intelligence methods to identify and interpret leading predictors that discriminate those who later converted to dementia from those who did not. METHOD: Participants were 48 well-characterized PD patients (M(baseline age) = 71.6; SD = 4.8; 44% female). We tested 38 multi-modal predictors from 10 domains (e.g., motor, cognitive) in a computationally competitive context to identify those that best discriminated two unobserved baseline groups, PD No Dementia (PDND), and PD Incipient Dementia (PDID). We used Random Forest (RF) classifier models for the discrimination goal and Tree SHapley Additive exPlanation (Tree SHAP) values for deep interpretation. RESULTS: An excellent RF model discriminated baseline PDID from PDND (AUC = 0.84; normalized Matthews Correlation Coefficient = 0.76). Tree SHAP showed that ten leading predictors of PDID accounted for 62.5% of the model, as well as their relative importance, direction, and magnitude (risk threshold). These predictors represented the motor (e.g., poorer gait), cognitive (e.g., slower Trail A), molecular (up-regulated metabolite panel), demographic (age), imaging (ventricular volume), and lifestyle (activities of daily living) domains. CONCLUSION: Our data-driven protocol integrated RF classifier models and Tree SHAP applications to selectively identify and interpret early dementia risk factors in a well-characterized sample of initially non-demented persons with PD. Results indicate that leading dementia predictors derive from multiple complementary risk domains. Frontiers Media S.A. 2023-06-30 /pmc/articles/PMC10347530/ /pubmed/37455938 http://dx.doi.org/10.3389/fnagi.2023.1124232 Text en Copyright © 2023 McFall, Bohn, Gee, Drouin, Fah, Han, Li, Camicioli and Dixon. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
McFall, G. Peggy
Bohn, Linzy
Gee, Myrlene
Drouin, Shannon M.
Fah, Harrison
Han, Wei
Li, Liang
Camicioli, Richard
Dixon, Roger A.
Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and Tree SHAP interpretation
title Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and Tree SHAP interpretation
title_full Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and Tree SHAP interpretation
title_fullStr Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and Tree SHAP interpretation
title_full_unstemmed Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and Tree SHAP interpretation
title_short Identifying key multi-modal predictors of incipient dementia in Parkinson’s disease: a machine learning analysis and Tree SHAP interpretation
title_sort identifying key multi-modal predictors of incipient dementia in parkinson’s disease: a machine learning analysis and tree shap interpretation
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10347530/
https://www.ncbi.nlm.nih.gov/pubmed/37455938
http://dx.doi.org/10.3389/fnagi.2023.1124232
work_keys_str_mv AT mcfallgpeggy identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation
AT bohnlinzy identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation
AT geemyrlene identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation
AT drouinshannonm identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation
AT fahharrison identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation
AT hanwei identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation
AT liliang identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation
AT camiciolirichard identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation
AT dixonrogera identifyingkeymultimodalpredictorsofincipientdementiainparkinsonsdiseaseamachinelearninganalysisandtreeshapinterpretation