Cargando…

Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness

Biases have marked medical history, leading to unequal care affecting marginalised groups. The patterns of missingness in observational data often reflect these group discrepancies, but the algorithmic fairness implications of group-specific missingness are not well understood. Despite its potential...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeanselme, Vincent, De-Arteaga, Maria, Zhang, Zhe, Barrett, Jessica, Tom, Brian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7614014/
https://www.ncbi.nlm.nih.gov/pubmed/36601036
_version_ 1783605553283465216
author Jeanselme, Vincent
De-Arteaga, Maria
Zhang, Zhe
Barrett, Jessica
Tom, Brian
author_facet Jeanselme, Vincent
De-Arteaga, Maria
Zhang, Zhe
Barrett, Jessica
Tom, Brian
author_sort Jeanselme, Vincent
collection PubMed
description Biases have marked medical history, leading to unequal care affecting marginalised groups. The patterns of missingness in observational data often reflect these group discrepancies, but the algorithmic fairness implications of group-specific missingness are not well understood. Despite its potential impact, imputation is too often an overlooked preprocessing step. When explicitly considered, attention is placed on overall performance, ignoring how this preprocessing can reinforce groupspecific inequities. Our work questions this choice by studying how imputation affects downstream algorithmic fairness. First, we provide a structured view of the relationship between clinical presence mechanisms and groupspecific missingness patterns. Then, through simulations and real-world experiments, we demonstrate that the imputation choice influences marginalised group performance and that no imputation strategy consistently reduces disparities. Importantly, our results show that current practices may endanger health equity as similarly performing imputation strategies at the population level can affect marginalised groups differently. Finally, we propose recommendations for mitigating inequities that may stem from a neglected step of the machine learning pipeline.
format Online
Article
Text
id pubmed-7614014
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-76140142023-01-03 Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness Jeanselme, Vincent De-Arteaga, Maria Zhang, Zhe Barrett, Jessica Tom, Brian Proc Mach Learn Res Article Biases have marked medical history, leading to unequal care affecting marginalised groups. The patterns of missingness in observational data often reflect these group discrepancies, but the algorithmic fairness implications of group-specific missingness are not well understood. Despite its potential impact, imputation is too often an overlooked preprocessing step. When explicitly considered, attention is placed on overall performance, ignoring how this preprocessing can reinforce groupspecific inequities. Our work questions this choice by studying how imputation affects downstream algorithmic fairness. First, we provide a structured view of the relationship between clinical presence mechanisms and groupspecific missingness patterns. Then, through simulations and real-world experiments, we demonstrate that the imputation choice influences marginalised group performance and that no imputation strategy consistently reduces disparities. Importantly, our results show that current practices may endanger health equity as similarly performing imputation strategies at the population level can affect marginalised groups differently. Finally, we propose recommendations for mitigating inequities that may stem from a neglected step of the machine learning pipeline. 2022 /pmc/articles/PMC7614014/ /pubmed/36601036 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/) International license.
spellingShingle Article
Jeanselme, Vincent
De-Arteaga, Maria
Zhang, Zhe
Barrett, Jessica
Tom, Brian
Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness
title Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness
title_full Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness
title_fullStr Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness
title_full_unstemmed Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness
title_short Imputation Strategies Under Clinical Presence: Impact on Algorithmic Fairness
title_sort imputation strategies under clinical presence: impact on algorithmic fairness
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7614014/
https://www.ncbi.nlm.nih.gov/pubmed/36601036
work_keys_str_mv AT jeanselmevincent imputationstrategiesunderclinicalpresenceimpactonalgorithmicfairness
AT dearteagamaria imputationstrategiesunderclinicalpresenceimpactonalgorithmicfairness
AT zhangzhe imputationstrategiesunderclinicalpresenceimpactonalgorithmicfairness
AT barrettjessica imputationstrategiesunderclinicalpresenceimpactonalgorithmicfairness
AT tombrian imputationstrategiesunderclinicalpresenceimpactonalgorithmicfairness