Cargando…

Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression

IMPORTANCE: Predictive models using machine learning techniques have potential to improve early detection and management of Alzheimer disease (AD). However, these models potentially have biases and may perpetuate or exacerbate existing disparities. OBJECTIVE: To characterize the algorithmic fairness...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Chenxi, Linn, Kristin A., Hubbard, Rebecca A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Association 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10630899/
https://www.ncbi.nlm.nih.gov/pubmed/37934495
http://dx.doi.org/10.1001/jamanetworkopen.2023.42203
_version_ 1785146057358835712
author Yuan, Chenxi
Linn, Kristin A.
Hubbard, Rebecca A.
author_facet Yuan, Chenxi
Linn, Kristin A.
Hubbard, Rebecca A.
author_sort Yuan, Chenxi
collection PubMed
description IMPORTANCE: Predictive models using machine learning techniques have potential to improve early detection and management of Alzheimer disease (AD). However, these models potentially have biases and may perpetuate or exacerbate existing disparities. OBJECTIVE: To characterize the algorithmic fairness of longitudinal prediction models for AD progression. DESIGN, SETTING, AND PARTICIPANTS: This prognostic study investigated the algorithmic fairness of logistic regression, support vector machines, and recurrent neural networks for predicting progression to mild cognitive impairment (MCI) and AD using data from participants in the Alzheimer Disease Neuroimaging Initiative evaluated at 57 sites in the US and Canada. Participants aged 54 to 91 years who contributed data on at least 2 visits between September 2005 and May 2017 were included. Data were analyzed in October 2022. EXPOSURES: Fairness was quantified across sex, ethnicity, and race groups. Neuropsychological test scores, anatomical features from T1 magnetic resonance imaging, measures extracted from positron emission tomography, and cerebrospinal fluid biomarkers were included as predictors. MAIN OUTCOMES AND MEASURES: Outcome measures quantified fairness of prediction models (logistic regression [LR], support vector machine [SVM], and recurrent neural network [RNN] models), including equal opportunity, equalized odds, and demographic parity. Specifically, if the model exhibited equal sensitivity for all groups, it aligned with the principle of equal opportunity, indicating fairness in predictive performance. RESULTS: A total of 1730 participants in the cohort (mean [SD] age, 73.81 [6.92] years; 776 females [44.9%]; 69 Hispanic [4.0%] and 1661 non-Hispanic [96.0%]; 29 Asian [1.7%], 77 Black [4.5%], 1599 White [92.4%], and 25 other race [1.4%]) were included. Sensitivity for predicting progression to MCI and AD was lower for Hispanic participants compared with non-Hispanic participants; the difference (SD) in true positive rate ranged from 20.9% (5.5%) for the RNN model to 27.8% (9.8%) for the SVM model in MCI and 24.1% (5.4%) for the RNN model to 48.2% (17.3%) for the LR model in AD. Sensitivity was similarly lower for Black and Asian participants compared with non-Hispanic White participants; for example, the difference (SD) in AD true positive rate was 14.5% (51.6%) in the LR model, 12.3% (35.1%) in the SVM model, and 28.4% (16.8%) in the RNN model for Black vs White participants, and the difference (SD) in MCI true positive rate was 25.6% (13.1%) in the LR model, 24.3% (13.1%) in the SVM model, and 6.8% (18.7%) in the RNN model for Asian vs White participants. Models generally satisfied metrics of fairness with respect to sex, with no significant differences by group, except for cognitively normal (CN)–MCI and MCI-AD transitions (eg, an absolute increase [SD] in the true positive rate of CN-MCI transitions of 10.3% [27.8%] for the LR model). CONCLUSIONS AND RELEVANCE: In this study, models were accurate in aggregate but failed to satisfy fairness metrics. These findings suggest that fairness should be considered in the development and use of machine learning models for AD progression.
format Online
Article
Text
id pubmed-10630899
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Medical Association
record_format MEDLINE/PubMed
spelling pubmed-106308992023-11-15 Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression Yuan, Chenxi Linn, Kristin A. Hubbard, Rebecca A. JAMA Netw Open Original Investigation IMPORTANCE: Predictive models using machine learning techniques have potential to improve early detection and management of Alzheimer disease (AD). However, these models potentially have biases and may perpetuate or exacerbate existing disparities. OBJECTIVE: To characterize the algorithmic fairness of longitudinal prediction models for AD progression. DESIGN, SETTING, AND PARTICIPANTS: This prognostic study investigated the algorithmic fairness of logistic regression, support vector machines, and recurrent neural networks for predicting progression to mild cognitive impairment (MCI) and AD using data from participants in the Alzheimer Disease Neuroimaging Initiative evaluated at 57 sites in the US and Canada. Participants aged 54 to 91 years who contributed data on at least 2 visits between September 2005 and May 2017 were included. Data were analyzed in October 2022. EXPOSURES: Fairness was quantified across sex, ethnicity, and race groups. Neuropsychological test scores, anatomical features from T1 magnetic resonance imaging, measures extracted from positron emission tomography, and cerebrospinal fluid biomarkers were included as predictors. MAIN OUTCOMES AND MEASURES: Outcome measures quantified fairness of prediction models (logistic regression [LR], support vector machine [SVM], and recurrent neural network [RNN] models), including equal opportunity, equalized odds, and demographic parity. Specifically, if the model exhibited equal sensitivity for all groups, it aligned with the principle of equal opportunity, indicating fairness in predictive performance. RESULTS: A total of 1730 participants in the cohort (mean [SD] age, 73.81 [6.92] years; 776 females [44.9%]; 69 Hispanic [4.0%] and 1661 non-Hispanic [96.0%]; 29 Asian [1.7%], 77 Black [4.5%], 1599 White [92.4%], and 25 other race [1.4%]) were included. Sensitivity for predicting progression to MCI and AD was lower for Hispanic participants compared with non-Hispanic participants; the difference (SD) in true positive rate ranged from 20.9% (5.5%) for the RNN model to 27.8% (9.8%) for the SVM model in MCI and 24.1% (5.4%) for the RNN model to 48.2% (17.3%) for the LR model in AD. Sensitivity was similarly lower for Black and Asian participants compared with non-Hispanic White participants; for example, the difference (SD) in AD true positive rate was 14.5% (51.6%) in the LR model, 12.3% (35.1%) in the SVM model, and 28.4% (16.8%) in the RNN model for Black vs White participants, and the difference (SD) in MCI true positive rate was 25.6% (13.1%) in the LR model, 24.3% (13.1%) in the SVM model, and 6.8% (18.7%) in the RNN model for Asian vs White participants. Models generally satisfied metrics of fairness with respect to sex, with no significant differences by group, except for cognitively normal (CN)–MCI and MCI-AD transitions (eg, an absolute increase [SD] in the true positive rate of CN-MCI transitions of 10.3% [27.8%] for the LR model). CONCLUSIONS AND RELEVANCE: In this study, models were accurate in aggregate but failed to satisfy fairness metrics. These findings suggest that fairness should be considered in the development and use of machine learning models for AD progression. American Medical Association 2023-11-07 /pmc/articles/PMC10630899/ /pubmed/37934495 http://dx.doi.org/10.1001/jamanetworkopen.2023.42203 Text en Copyright 2023 Yuan C et al. JAMA Network Open. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article distributed under the terms of the CC-BY-NC-ND License.
spellingShingle Original Investigation
Yuan, Chenxi
Linn, Kristin A.
Hubbard, Rebecca A.
Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression
title Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression
title_full Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression
title_fullStr Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression
title_full_unstemmed Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression
title_short Algorithmic Fairness of Machine Learning Models for Alzheimer Disease Progression
title_sort algorithmic fairness of machine learning models for alzheimer disease progression
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10630899/
https://www.ncbi.nlm.nih.gov/pubmed/37934495
http://dx.doi.org/10.1001/jamanetworkopen.2023.42203
work_keys_str_mv AT yuanchenxi algorithmicfairnessofmachinelearningmodelsforalzheimerdiseaseprogression
AT linnkristina algorithmicfairnessofmachinelearningmodelsforalzheimerdiseaseprogression
AT hubbardrebeccaa algorithmicfairnessofmachinelearningmodelsforalzheimerdiseaseprogression