Cargando…

Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients

IMPORTANCE: Comparisons of antimicrobial use among hospitals are difficult to interpret owing to variations in patient case mix. Risk-adjustment strategies incorporating larger numbers of variables haves been proposed as a method to improve comparisons for antimicrobial stewardship assessments. OBJE...

Descripción completa

Detalles Bibliográficos
Autores principales: Moehring, Rebekah W., Phelan, Matthew, Lofgren, Eric, Nelson, Alicia, Dodds Ashley, Elizabeth, Anderson, Deverick J., Goldstein, Benjamin A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Association 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008288/
https://www.ncbi.nlm.nih.gov/pubmed/33779743
http://dx.doi.org/10.1001/jamanetworkopen.2021.3460
_version_ 1783672664409243648
author Moehring, Rebekah W.
Phelan, Matthew
Lofgren, Eric
Nelson, Alicia
Dodds Ashley, Elizabeth
Anderson, Deverick J.
Goldstein, Benjamin A.
author_facet Moehring, Rebekah W.
Phelan, Matthew
Lofgren, Eric
Nelson, Alicia
Dodds Ashley, Elizabeth
Anderson, Deverick J.
Goldstein, Benjamin A.
author_sort Moehring, Rebekah W.
collection PubMed
description IMPORTANCE: Comparisons of antimicrobial use among hospitals are difficult to interpret owing to variations in patient case mix. Risk-adjustment strategies incorporating larger numbers of variables haves been proposed as a method to improve comparisons for antimicrobial stewardship assessments. OBJECTIVE: To evaluate whether variables of varying complexity and feasibility of measurement, derived retrospectively from the electronic health records, accurately identify inpatient antimicrobial use. DESIGN, SETTING, AND PARTICIPANTS: Retrospective cohort study, using a 2-stage random forests machine learning modeling analysis of electronic health record data. Data were split into training and testing sets to measure model performance using area under the curve and absolute error. All adult and pediatric inpatient encounters from October 1, 2015, to September 30, 2017, at 2 community hospitals and 1 academic medical center in the Duke University Health System were analyzed. A total of 204 candidate variables were categorized into 4 tiers based on feasibility of measurement from the electronic health records. MAIN OUTCOMES AND MEASURES: Antimicrobial exposure was measured at the encounter level in 2 ways: binary (ever or never) and number of days of therapy. Analyses were stratified by age (pediatric or adult), unit type, and antibiotic group. RESULTS: The data set included 170 294 encounters and 204 candidate variables from 3 hospitals during the 3-year study period. Antimicrobial exposure occurred in 80 190 encounters (47%); 64 998 (38%) received 1 to 6 days of therapy, and 15 192 (9%) received 7 or more days of therapy. Two-stage models identified antimicrobial use with high fidelity (mean area under the curve, 0.85; mean absolute error, 1.0 days of therapy). Addition of more complex variables increased accuracy, with largest improvements occurring with inclusion of diagnosis information. Accuracy varied based on location and antibiotic group. Models underestimated the number of days of therapy of encounters with long lengths of stay. CONCLUSIONS AND RELEVANCE: Models using variables derived from electronic health records identified antimicrobial exposure accurately. Future risk-adjustment strategies incorporating encounter-level information may make comparisons of antimicrobial use more meaningful for hospital antimicrobial stewardship assessments.
format Online
Article
Text
id pubmed-8008288
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Medical Association
record_format MEDLINE/PubMed
spelling pubmed-80082882021-04-16 Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients Moehring, Rebekah W. Phelan, Matthew Lofgren, Eric Nelson, Alicia Dodds Ashley, Elizabeth Anderson, Deverick J. Goldstein, Benjamin A. JAMA Netw Open Original Investigation IMPORTANCE: Comparisons of antimicrobial use among hospitals are difficult to interpret owing to variations in patient case mix. Risk-adjustment strategies incorporating larger numbers of variables haves been proposed as a method to improve comparisons for antimicrobial stewardship assessments. OBJECTIVE: To evaluate whether variables of varying complexity and feasibility of measurement, derived retrospectively from the electronic health records, accurately identify inpatient antimicrobial use. DESIGN, SETTING, AND PARTICIPANTS: Retrospective cohort study, using a 2-stage random forests machine learning modeling analysis of electronic health record data. Data were split into training and testing sets to measure model performance using area under the curve and absolute error. All adult and pediatric inpatient encounters from October 1, 2015, to September 30, 2017, at 2 community hospitals and 1 academic medical center in the Duke University Health System were analyzed. A total of 204 candidate variables were categorized into 4 tiers based on feasibility of measurement from the electronic health records. MAIN OUTCOMES AND MEASURES: Antimicrobial exposure was measured at the encounter level in 2 ways: binary (ever or never) and number of days of therapy. Analyses were stratified by age (pediatric or adult), unit type, and antibiotic group. RESULTS: The data set included 170 294 encounters and 204 candidate variables from 3 hospitals during the 3-year study period. Antimicrobial exposure occurred in 80 190 encounters (47%); 64 998 (38%) received 1 to 6 days of therapy, and 15 192 (9%) received 7 or more days of therapy. Two-stage models identified antimicrobial use with high fidelity (mean area under the curve, 0.85; mean absolute error, 1.0 days of therapy). Addition of more complex variables increased accuracy, with largest improvements occurring with inclusion of diagnosis information. Accuracy varied based on location and antibiotic group. Models underestimated the number of days of therapy of encounters with long lengths of stay. CONCLUSIONS AND RELEVANCE: Models using variables derived from electronic health records identified antimicrobial exposure accurately. Future risk-adjustment strategies incorporating encounter-level information may make comparisons of antimicrobial use more meaningful for hospital antimicrobial stewardship assessments. American Medical Association 2021-03-29 /pmc/articles/PMC8008288/ /pubmed/33779743 http://dx.doi.org/10.1001/jamanetworkopen.2021.3460 Text en Copyright 2021 Moehring RW et al. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License.
spellingShingle Original Investigation
Moehring, Rebekah W.
Phelan, Matthew
Lofgren, Eric
Nelson, Alicia
Dodds Ashley, Elizabeth
Anderson, Deverick J.
Goldstein, Benjamin A.
Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients
title Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients
title_full Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients
title_fullStr Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients
title_full_unstemmed Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients
title_short Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients
title_sort development of a machine learning model using electronic health record data to identify antibiotic use among hospitalized patients
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008288/
https://www.ncbi.nlm.nih.gov/pubmed/33779743
http://dx.doi.org/10.1001/jamanetworkopen.2021.3460
work_keys_str_mv AT moehringrebekahw developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients
AT phelanmatthew developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients
AT lofgreneric developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients
AT nelsonalicia developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients
AT doddsashleyelizabeth developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients
AT andersondeverickj developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients
AT goldsteinbenjamina developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients