Cargando…

Performance Drift in a Mortality Prediction Algorithm during the SARS-CoV-2 Pandemic

RESEARCH OBJECTIVE: Health systems use clinical predictive algorithms to allocate resources to high-risk patients. Such algorithms are trained using historical data and are later implemented in clinical settings. During this implementation period, predictive algorithms are prone to performance chang...

Descripción completa

Detalles Bibliográficos
Autores principales: Parikh, Ravi B., Zhang, Yichen, Chivers, Corey, Courtright, Katherine R., Zhu, Jingsan, Hearn, Caleb M., Navathe, Amol S., Chen, Jinbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8902871/
https://www.ncbi.nlm.nih.gov/pubmed/35262088
http://dx.doi.org/10.1101/2022.02.28.22270996
_version_ 1784664682911498240
author Parikh, Ravi B.
Zhang, Yichen
Chivers, Corey
Courtright, Katherine R.
Zhu, Jingsan
Hearn, Caleb M.
Navathe, Amol S.
Chen, Jinbo
author_facet Parikh, Ravi B.
Zhang, Yichen
Chivers, Corey
Courtright, Katherine R.
Zhu, Jingsan
Hearn, Caleb M.
Navathe, Amol S.
Chen, Jinbo
author_sort Parikh, Ravi B.
collection PubMed
description RESEARCH OBJECTIVE: Health systems use clinical predictive algorithms to allocate resources to high-risk patients. Such algorithms are trained using historical data and are later implemented in clinical settings. During this implementation period, predictive algorithms are prone to performance changes (“drift”) due to exogenous shocks in utilization or shifts in patient characteristics. Our objective was to examine the impact of sudden utilization shifts during the SARS-CoV-2 pandemic on the performance of an electronic health record (EHR)-based prognostic algorithm. STUDY DESIGN: We studied changes in the performance of Conversation Connect, a validated machine learning algorithm that predicts 180-day mortality among outpatients with cancer receiving care at medical oncology practices within a large academic cancer center. Conversation Connect generates mortality risk predictions before each encounter using data from 159 EHR variables collected in the six months before the encounter. Since January 2019, Conversation Connect has been used as part of a behavioral intervention to prompt clinicians to consider early advance care planning conversations among patients with ≥ 10% mortality risk. First, we descriptively compared encounter-level characteristics in the following periods: January 2019–February 2020 (“pre-pandemic”), March-May 2020 (“early-pandemic”), and June–December 2020 (“later-pandemic”). Second, we quantified changes in high-risk patient encounters using interrupted time series analyses that controlled for pre-pandemic trends and demographic, clinical, and practice covariates. Our primary metric of performance drift was false negative rate (FNR). Third, we assessed contributors to performance drift by comparing distributions of key EHR inputs across periods and predicting later pandemic utilization using pre-pandemic inputs. POPULATION STUDIED: 237,336 in-person and telemedicine medical oncology encounters. PRINCIPAL FINDINGS: Age, race, average patient encounters per month, insurance type, comorbidity counts, laboratory values, and overall mortality were similar among encounters in the pre-, early-, and later-pandemic periods. Relative to the pre-pandemic period, the later-pandemic period was characterized by a 6.5-percentage-point decrease (28.2% vs. 34.7%) in high-risk encounters (p<0.001). FNR increased from 41.0% (95% CI 38.0–44.1%) in the pre-pandemic period to 57.5% (95% CI 51.9–63.0%) in the later pandemic period. Compared to the pre-pandemic period, the early and later pandemic periods had higher proportions of telemedicine encounters (0.01% pre-pandemic vs. 20.0% early-pandemic vs. 26.4% later-pandemic) and encounters with no preceding laboratory draws (17.7% pre-pandemic vs. 19.8% early-pandemic vs. 24.1% later-pandemic). In the later pandemic period, observed laboratory utilization was lower than predicted (76.0% vs 81.2%, p<0.001). In the later-pandemic period, mean 180-day mortality risk scores were lower for telemedicine encounters vs. in-person encounters (10.3% vs 11.2%, p<0.001) and encounters with no vs. any preceding laboratory draws (1.5% vs. 14.0%, p<0.001). CONCLUSIONS: During the SARS-CoV-2 pandemic period, the performance of a machine learning prognostic algorithm used to prompt advance care planning declined substantially. Increases in telemedicine and declines in laboratory utilization contributed to lower performance. IMPLICATIONS FOR POLICY OR PRACTICE: This is the first study to show algorithm performance drift due to SARS-CoV-2 pandemic-related shifts in telemedicine and laboratory utilization. These mechanisms of performance drift could apply to other EHR clinical predictive algorithms. Pandemic-related decreases in care utilization may negatively impact the performance of clinical predictive algorithms and warrant assessment and possible retraining of such algorithms.
format Online
Article
Text
id pubmed-8902871
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-89028712022-03-09 Performance Drift in a Mortality Prediction Algorithm during the SARS-CoV-2 Pandemic Parikh, Ravi B. Zhang, Yichen Chivers, Corey Courtright, Katherine R. Zhu, Jingsan Hearn, Caleb M. Navathe, Amol S. Chen, Jinbo medRxiv Article RESEARCH OBJECTIVE: Health systems use clinical predictive algorithms to allocate resources to high-risk patients. Such algorithms are trained using historical data and are later implemented in clinical settings. During this implementation period, predictive algorithms are prone to performance changes (“drift”) due to exogenous shocks in utilization or shifts in patient characteristics. Our objective was to examine the impact of sudden utilization shifts during the SARS-CoV-2 pandemic on the performance of an electronic health record (EHR)-based prognostic algorithm. STUDY DESIGN: We studied changes in the performance of Conversation Connect, a validated machine learning algorithm that predicts 180-day mortality among outpatients with cancer receiving care at medical oncology practices within a large academic cancer center. Conversation Connect generates mortality risk predictions before each encounter using data from 159 EHR variables collected in the six months before the encounter. Since January 2019, Conversation Connect has been used as part of a behavioral intervention to prompt clinicians to consider early advance care planning conversations among patients with ≥ 10% mortality risk. First, we descriptively compared encounter-level characteristics in the following periods: January 2019–February 2020 (“pre-pandemic”), March-May 2020 (“early-pandemic”), and June–December 2020 (“later-pandemic”). Second, we quantified changes in high-risk patient encounters using interrupted time series analyses that controlled for pre-pandemic trends and demographic, clinical, and practice covariates. Our primary metric of performance drift was false negative rate (FNR). Third, we assessed contributors to performance drift by comparing distributions of key EHR inputs across periods and predicting later pandemic utilization using pre-pandemic inputs. POPULATION STUDIED: 237,336 in-person and telemedicine medical oncology encounters. PRINCIPAL FINDINGS: Age, race, average patient encounters per month, insurance type, comorbidity counts, laboratory values, and overall mortality were similar among encounters in the pre-, early-, and later-pandemic periods. Relative to the pre-pandemic period, the later-pandemic period was characterized by a 6.5-percentage-point decrease (28.2% vs. 34.7%) in high-risk encounters (p<0.001). FNR increased from 41.0% (95% CI 38.0–44.1%) in the pre-pandemic period to 57.5% (95% CI 51.9–63.0%) in the later pandemic period. Compared to the pre-pandemic period, the early and later pandemic periods had higher proportions of telemedicine encounters (0.01% pre-pandemic vs. 20.0% early-pandemic vs. 26.4% later-pandemic) and encounters with no preceding laboratory draws (17.7% pre-pandemic vs. 19.8% early-pandemic vs. 24.1% later-pandemic). In the later pandemic period, observed laboratory utilization was lower than predicted (76.0% vs 81.2%, p<0.001). In the later-pandemic period, mean 180-day mortality risk scores were lower for telemedicine encounters vs. in-person encounters (10.3% vs 11.2%, p<0.001) and encounters with no vs. any preceding laboratory draws (1.5% vs. 14.0%, p<0.001). CONCLUSIONS: During the SARS-CoV-2 pandemic period, the performance of a machine learning prognostic algorithm used to prompt advance care planning declined substantially. Increases in telemedicine and declines in laboratory utilization contributed to lower performance. IMPLICATIONS FOR POLICY OR PRACTICE: This is the first study to show algorithm performance drift due to SARS-CoV-2 pandemic-related shifts in telemedicine and laboratory utilization. These mechanisms of performance drift could apply to other EHR clinical predictive algorithms. Pandemic-related decreases in care utilization may negatively impact the performance of clinical predictive algorithms and warrant assessment and possible retraining of such algorithms. Cold Spring Harbor Laboratory 2022-03-01 /pmc/articles/PMC8902871/ /pubmed/35262088 http://dx.doi.org/10.1101/2022.02.28.22270996 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Parikh, Ravi B.
Zhang, Yichen
Chivers, Corey
Courtright, Katherine R.
Zhu, Jingsan
Hearn, Caleb M.
Navathe, Amol S.
Chen, Jinbo
Performance Drift in a Mortality Prediction Algorithm during the SARS-CoV-2 Pandemic
title Performance Drift in a Mortality Prediction Algorithm during the SARS-CoV-2 Pandemic
title_full Performance Drift in a Mortality Prediction Algorithm during the SARS-CoV-2 Pandemic
title_fullStr Performance Drift in a Mortality Prediction Algorithm during the SARS-CoV-2 Pandemic
title_full_unstemmed Performance Drift in a Mortality Prediction Algorithm during the SARS-CoV-2 Pandemic
title_short Performance Drift in a Mortality Prediction Algorithm during the SARS-CoV-2 Pandemic
title_sort performance drift in a mortality prediction algorithm during the sars-cov-2 pandemic
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8902871/
https://www.ncbi.nlm.nih.gov/pubmed/35262088
http://dx.doi.org/10.1101/2022.02.28.22270996
work_keys_str_mv AT parikhravib performancedriftinamortalitypredictionalgorithmduringthesarscov2pandemic
AT zhangyichen performancedriftinamortalitypredictionalgorithmduringthesarscov2pandemic
AT chiverscorey performancedriftinamortalitypredictionalgorithmduringthesarscov2pandemic
AT courtrightkatheriner performancedriftinamortalitypredictionalgorithmduringthesarscov2pandemic
AT zhujingsan performancedriftinamortalitypredictionalgorithmduringthesarscov2pandemic
AT hearncalebm performancedriftinamortalitypredictionalgorithmduringthesarscov2pandemic
AT navatheamols performancedriftinamortalitypredictionalgorithmduringthesarscov2pandemic
AT chenjinbo performancedriftinamortalitypredictionalgorithmduringthesarscov2pandemic