Cargando…

Characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and Alzheimer’s and Parkinson’s diseases

OBJECTIVE: Biases within probabilistic electronic phenotyping algorithms are largely unexplored. In this work, we characterize differences in subgroup performance of phenotyping algorithms for Alzheimer’s disease and related dementias (ADRD) in older adults. MATERIALS AND METHODS: We created an expe...

Descripción completa

Detalles Bibliográficos
Autores principales: Banda, Juan M, Shah, Nigam H, Periyakoil, Vyjeyanthi S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10307941/
https://www.ncbi.nlm.nih.gov/pubmed/37397506
http://dx.doi.org/10.1093/jamiaopen/ooad043
_version_ 1785066139158577152
author Banda, Juan M
Shah, Nigam H
Periyakoil, Vyjeyanthi S
author_facet Banda, Juan M
Shah, Nigam H
Periyakoil, Vyjeyanthi S
author_sort Banda, Juan M
collection PubMed
description OBJECTIVE: Biases within probabilistic electronic phenotyping algorithms are largely unexplored. In this work, we characterize differences in subgroup performance of phenotyping algorithms for Alzheimer’s disease and related dementias (ADRD) in older adults. MATERIALS AND METHODS: We created an experimental framework to characterize the performance of probabilistic phenotyping algorithms under different racial distributions allowing us to identify which algorithms may have differential performance, by how much, and under what conditions. We relied on rule-based phenotype definitions as reference to evaluate probabilistic phenotype algorithms created using the Automated PHenotype Routine for Observational Definition, Identification, Training and Evaluation framework. RESULTS: We demonstrate that some algorithms have performance variations anywhere from 3% to 30% for different populations, even when not using race as an input variable. We show that while performance differences in subgroups are not present for all phenotypes, they do affect some phenotypes and groups more disproportionately than others. DISCUSSION: Our analysis establishes the need for a robust evaluation framework for subgroup differences. The underlying patient populations for the algorithms showing subgroup performance differences have great variance between model features when compared with the phenotypes with little to no differences. CONCLUSION: We have created a framework to identify systematic differences in the performance of probabilistic phenotyping algorithms specifically in the context of ADRD as a use case. Differences in subgroup performance of probabilistic phenotyping algorithms are not widespread nor do they occur consistently. This highlights the great need for careful ongoing monitoring to evaluate, measure, and try to mitigate such differences.
format Online
Article
Text
id pubmed-10307941
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103079412023-06-30 Characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and Alzheimer’s and Parkinson’s diseases Banda, Juan M Shah, Nigam H Periyakoil, Vyjeyanthi S JAMIA Open Research and Applications OBJECTIVE: Biases within probabilistic electronic phenotyping algorithms are largely unexplored. In this work, we characterize differences in subgroup performance of phenotyping algorithms for Alzheimer’s disease and related dementias (ADRD) in older adults. MATERIALS AND METHODS: We created an experimental framework to characterize the performance of probabilistic phenotyping algorithms under different racial distributions allowing us to identify which algorithms may have differential performance, by how much, and under what conditions. We relied on rule-based phenotype definitions as reference to evaluate probabilistic phenotype algorithms created using the Automated PHenotype Routine for Observational Definition, Identification, Training and Evaluation framework. RESULTS: We demonstrate that some algorithms have performance variations anywhere from 3% to 30% for different populations, even when not using race as an input variable. We show that while performance differences in subgroups are not present for all phenotypes, they do affect some phenotypes and groups more disproportionately than others. DISCUSSION: Our analysis establishes the need for a robust evaluation framework for subgroup differences. The underlying patient populations for the algorithms showing subgroup performance differences have great variance between model features when compared with the phenotypes with little to no differences. CONCLUSION: We have created a framework to identify systematic differences in the performance of probabilistic phenotyping algorithms specifically in the context of ADRD as a use case. Differences in subgroup performance of probabilistic phenotyping algorithms are not widespread nor do they occur consistently. This highlights the great need for careful ongoing monitoring to evaluate, measure, and try to mitigate such differences. Oxford University Press 2023-06-28 /pmc/articles/PMC10307941/ /pubmed/37397506 http://dx.doi.org/10.1093/jamiaopen/ooad043 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Banda, Juan M
Shah, Nigam H
Periyakoil, Vyjeyanthi S
Characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and Alzheimer’s and Parkinson’s diseases
title Characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and Alzheimer’s and Parkinson’s diseases
title_full Characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and Alzheimer’s and Parkinson’s diseases
title_fullStr Characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and Alzheimer’s and Parkinson’s diseases
title_full_unstemmed Characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and Alzheimer’s and Parkinson’s diseases
title_short Characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and Alzheimer’s and Parkinson’s diseases
title_sort characterizing subgroup performance of probabilistic phenotype algorithms within older adults: a case study for dementia, mild cognitive impairment, and alzheimer’s and parkinson’s diseases
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10307941/
https://www.ncbi.nlm.nih.gov/pubmed/37397506
http://dx.doi.org/10.1093/jamiaopen/ooad043
work_keys_str_mv AT bandajuanm characterizingsubgroupperformanceofprobabilisticphenotypealgorithmswithinolderadultsacasestudyfordementiamildcognitiveimpairmentandalzheimersandparkinsonsdiseases
AT shahnigamh characterizingsubgroupperformanceofprobabilisticphenotypealgorithmswithinolderadultsacasestudyfordementiamildcognitiveimpairmentandalzheimersandparkinsonsdiseases
AT periyakoilvyjeyanthis characterizingsubgroupperformanceofprobabilisticphenotypealgorithmswithinolderadultsacasestudyfordementiamildcognitiveimpairmentandalzheimersandparkinsonsdiseases