Cargando…

The path toward equal performance in medical machine learning

To ensure equitable quality of care, differences in machine learning model performance between patient groups must be addressed. Here, we argue that two separate mechanisms can cause performance differences between groups. First, model performance may be worse than theoretically achievable in a give...

Descripción completa

Detalles Bibliográficos
Autores principales: Petersen, Eike, Holm, Sune, Ganz, Melanie, Feragen, Aasa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10382979/
https://www.ncbi.nlm.nih.gov/pubmed/37521051
http://dx.doi.org/10.1016/j.patter.2023.100790
_version_ 1785080794087161856
author Petersen, Eike
Holm, Sune
Ganz, Melanie
Feragen, Aasa
author_facet Petersen, Eike
Holm, Sune
Ganz, Melanie
Feragen, Aasa
author_sort Petersen, Eike
collection PubMed
description To ensure equitable quality of care, differences in machine learning model performance between patient groups must be addressed. Here, we argue that two separate mechanisms can cause performance differences between groups. First, model performance may be worse than theoretically achievable in a given group. This can occur due to a combination of group underrepresentation, modeling choices, and the characteristics of the prediction task at hand. We examine scenarios in which underrepresentation leads to underperformance, scenarios in which it does not, and the differences between them. Second, the optimal achievable performance may also differ between groups due to differences in the intrinsic difficulty of the prediction task. We discuss several possible causes of such differences in task difficulty. In addition, challenges such as label biases and selection biases may confound both learning and performance evaluation. We highlight consequences for the path toward equal performance, and we emphasize that leveling up model performance may require gathering not only more data from underperforming groups but also better data. Throughout, we ground our discussion in real-world medical phenomena and case studies while also referencing relevant statistical theory.
format Online
Article
Text
id pubmed-10382979
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-103829792023-07-30 The path toward equal performance in medical machine learning Petersen, Eike Holm, Sune Ganz, Melanie Feragen, Aasa Patterns (N Y) Perspective To ensure equitable quality of care, differences in machine learning model performance between patient groups must be addressed. Here, we argue that two separate mechanisms can cause performance differences between groups. First, model performance may be worse than theoretically achievable in a given group. This can occur due to a combination of group underrepresentation, modeling choices, and the characteristics of the prediction task at hand. We examine scenarios in which underrepresentation leads to underperformance, scenarios in which it does not, and the differences between them. Second, the optimal achievable performance may also differ between groups due to differences in the intrinsic difficulty of the prediction task. We discuss several possible causes of such differences in task difficulty. In addition, challenges such as label biases and selection biases may confound both learning and performance evaluation. We highlight consequences for the path toward equal performance, and we emphasize that leveling up model performance may require gathering not only more data from underperforming groups but also better data. Throughout, we ground our discussion in real-world medical phenomena and case studies while also referencing relevant statistical theory. Elsevier 2023-07-14 /pmc/articles/PMC10382979/ /pubmed/37521051 http://dx.doi.org/10.1016/j.patter.2023.100790 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Perspective
Petersen, Eike
Holm, Sune
Ganz, Melanie
Feragen, Aasa
The path toward equal performance in medical machine learning
title The path toward equal performance in medical machine learning
title_full The path toward equal performance in medical machine learning
title_fullStr The path toward equal performance in medical machine learning
title_full_unstemmed The path toward equal performance in medical machine learning
title_short The path toward equal performance in medical machine learning
title_sort path toward equal performance in medical machine learning
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10382979/
https://www.ncbi.nlm.nih.gov/pubmed/37521051
http://dx.doi.org/10.1016/j.patter.2023.100790
work_keys_str_mv AT peterseneike thepathtowardequalperformanceinmedicalmachinelearning
AT holmsune thepathtowardequalperformanceinmedicalmachinelearning
AT ganzmelanie thepathtowardequalperformanceinmedicalmachinelearning
AT feragenaasa thepathtowardequalperformanceinmedicalmachinelearning
AT peterseneike pathtowardequalperformanceinmedicalmachinelearning
AT holmsune pathtowardequalperformanceinmedicalmachinelearning
AT ganzmelanie pathtowardequalperformanceinmedicalmachinelearning
AT feragenaasa pathtowardequalperformanceinmedicalmachinelearning