Cargando…

A comparison of approaches to improve worst-case predictive model performance over patient subpopulations

Predictive models for clinical outcomes that are accurate on average in a patient population may underperform drastically for some subpopulations, potentially introducing or reinforcing inequities in care access and quality. Model training approaches that aim to maximize worst-case model performance...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pfohl, Stephen R., Zhang, Haoran, Xu, Yizhe, Foryciarz, Agata, Ghassemi, Marzyeh, Shah, Nigam H.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8885701/ https://www.ncbi.nlm.nih.gov/pubmed/35228563 http://dx.doi.org/10.1038/s41598-022-07167-7

_version_	1784660497186947072
author	Pfohl, Stephen R. Zhang, Haoran Xu, Yizhe Foryciarz, Agata Ghassemi, Marzyeh Shah, Nigam H.
author_facet	Pfohl, Stephen R. Zhang, Haoran Xu, Yizhe Foryciarz, Agata Ghassemi, Marzyeh Shah, Nigam H.
author_sort	Pfohl, Stephen R.
collection	PubMed
description	Predictive models for clinical outcomes that are accurate on average in a patient population may underperform drastically for some subpopulations, potentially introducing or reinforcing inequities in care access and quality. Model training approaches that aim to maximize worst-case model performance across subpopulations, such as distributionally robust optimization (DRO), attempt to address this problem without introducing additional harms. We conduct a large-scale empirical study of DRO and several variations of standard learning procedures to identify approaches for model development and selection that consistently improve disaggregated and worst-case performance over subpopulations compared to standard approaches for learning predictive models from electronic health records data. In the course of our evaluation, we introduce an extension to DRO approaches that allows for specification of the metric used to assess worst-case performance. We conduct the analysis for models that predict in-hospital mortality, prolonged length of stay, and 30-day readmission for inpatient admissions, and predict in-hospital mortality using intensive care data. We find that, with relatively few exceptions, no approach performs better, for each patient subpopulation examined, than standard learning procedures using the entire training dataset. These results imply that when it is of interest to improve model performance for patient subpopulations beyond what can be achieved with standard practices, it may be necessary to do so via data collection techniques that increase the effective sample size or reduce the level of noise in the prediction problem.
format	Online Article Text
id	pubmed-8885701
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-88857012022-03-01 A comparison of approaches to improve worst-case predictive model performance over patient subpopulations Pfohl, Stephen R. Zhang, Haoran Xu, Yizhe Foryciarz, Agata Ghassemi, Marzyeh Shah, Nigam H. Sci Rep Article Predictive models for clinical outcomes that are accurate on average in a patient population may underperform drastically for some subpopulations, potentially introducing or reinforcing inequities in care access and quality. Model training approaches that aim to maximize worst-case model performance across subpopulations, such as distributionally robust optimization (DRO), attempt to address this problem without introducing additional harms. We conduct a large-scale empirical study of DRO and several variations of standard learning procedures to identify approaches for model development and selection that consistently improve disaggregated and worst-case performance over subpopulations compared to standard approaches for learning predictive models from electronic health records data. In the course of our evaluation, we introduce an extension to DRO approaches that allows for specification of the metric used to assess worst-case performance. We conduct the analysis for models that predict in-hospital mortality, prolonged length of stay, and 30-day readmission for inpatient admissions, and predict in-hospital mortality using intensive care data. We find that, with relatively few exceptions, no approach performs better, for each patient subpopulation examined, than standard learning procedures using the entire training dataset. These results imply that when it is of interest to improve model performance for patient subpopulations beyond what can be achieved with standard practices, it may be necessary to do so via data collection techniques that increase the effective sample size or reduce the level of noise in the prediction problem. Nature Publishing Group UK 2022-02-28 /pmc/articles/PMC8885701/ /pubmed/35228563 http://dx.doi.org/10.1038/s41598-022-07167-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Pfohl, Stephen R. Zhang, Haoran Xu, Yizhe Foryciarz, Agata Ghassemi, Marzyeh Shah, Nigam H. A comparison of approaches to improve worst-case predictive model performance over patient subpopulations
title	A comparison of approaches to improve worst-case predictive model performance over patient subpopulations
title_full	A comparison of approaches to improve worst-case predictive model performance over patient subpopulations
title_fullStr	A comparison of approaches to improve worst-case predictive model performance over patient subpopulations
title_full_unstemmed	A comparison of approaches to improve worst-case predictive model performance over patient subpopulations
title_short	A comparison of approaches to improve worst-case predictive model performance over patient subpopulations
title_sort	comparison of approaches to improve worst-case predictive model performance over patient subpopulations
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8885701/ https://www.ncbi.nlm.nih.gov/pubmed/35228563 http://dx.doi.org/10.1038/s41598-022-07167-7
work_keys_str_mv	AT pfohlstephenr acomparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT zhanghaoran acomparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT xuyizhe acomparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT foryciarzagata acomparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT ghassemimarzyeh acomparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT shahnigamh acomparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT pfohlstephenr comparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT zhanghaoran comparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT xuyizhe comparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT foryciarzagata comparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT ghassemimarzyeh comparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations AT shahnigamh comparisonofapproachestoimproveworstcasepredictivemodelperformanceoverpatientsubpopulations

A comparison of approaches to improve worst-case predictive model performance over patient subpopulations

Ejemplares similares