Cargando…
Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients
IMPORTANCE: Comparisons of antimicrobial use among hospitals are difficult to interpret owing to variations in patient case mix. Risk-adjustment strategies incorporating larger numbers of variables haves been proposed as a method to improve comparisons for antimicrobial stewardship assessments. OBJE...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Association
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008288/ https://www.ncbi.nlm.nih.gov/pubmed/33779743 http://dx.doi.org/10.1001/jamanetworkopen.2021.3460 |
_version_ | 1783672664409243648 |
---|---|
author | Moehring, Rebekah W. Phelan, Matthew Lofgren, Eric Nelson, Alicia Dodds Ashley, Elizabeth Anderson, Deverick J. Goldstein, Benjamin A. |
author_facet | Moehring, Rebekah W. Phelan, Matthew Lofgren, Eric Nelson, Alicia Dodds Ashley, Elizabeth Anderson, Deverick J. Goldstein, Benjamin A. |
author_sort | Moehring, Rebekah W. |
collection | PubMed |
description | IMPORTANCE: Comparisons of antimicrobial use among hospitals are difficult to interpret owing to variations in patient case mix. Risk-adjustment strategies incorporating larger numbers of variables haves been proposed as a method to improve comparisons for antimicrobial stewardship assessments. OBJECTIVE: To evaluate whether variables of varying complexity and feasibility of measurement, derived retrospectively from the electronic health records, accurately identify inpatient antimicrobial use. DESIGN, SETTING, AND PARTICIPANTS: Retrospective cohort study, using a 2-stage random forests machine learning modeling analysis of electronic health record data. Data were split into training and testing sets to measure model performance using area under the curve and absolute error. All adult and pediatric inpatient encounters from October 1, 2015, to September 30, 2017, at 2 community hospitals and 1 academic medical center in the Duke University Health System were analyzed. A total of 204 candidate variables were categorized into 4 tiers based on feasibility of measurement from the electronic health records. MAIN OUTCOMES AND MEASURES: Antimicrobial exposure was measured at the encounter level in 2 ways: binary (ever or never) and number of days of therapy. Analyses were stratified by age (pediatric or adult), unit type, and antibiotic group. RESULTS: The data set included 170 294 encounters and 204 candidate variables from 3 hospitals during the 3-year study period. Antimicrobial exposure occurred in 80 190 encounters (47%); 64 998 (38%) received 1 to 6 days of therapy, and 15 192 (9%) received 7 or more days of therapy. Two-stage models identified antimicrobial use with high fidelity (mean area under the curve, 0.85; mean absolute error, 1.0 days of therapy). Addition of more complex variables increased accuracy, with largest improvements occurring with inclusion of diagnosis information. Accuracy varied based on location and antibiotic group. Models underestimated the number of days of therapy of encounters with long lengths of stay. CONCLUSIONS AND RELEVANCE: Models using variables derived from electronic health records identified antimicrobial exposure accurately. Future risk-adjustment strategies incorporating encounter-level information may make comparisons of antimicrobial use more meaningful for hospital antimicrobial stewardship assessments. |
format | Online Article Text |
id | pubmed-8008288 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Medical Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-80082882021-04-16 Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients Moehring, Rebekah W. Phelan, Matthew Lofgren, Eric Nelson, Alicia Dodds Ashley, Elizabeth Anderson, Deverick J. Goldstein, Benjamin A. JAMA Netw Open Original Investigation IMPORTANCE: Comparisons of antimicrobial use among hospitals are difficult to interpret owing to variations in patient case mix. Risk-adjustment strategies incorporating larger numbers of variables haves been proposed as a method to improve comparisons for antimicrobial stewardship assessments. OBJECTIVE: To evaluate whether variables of varying complexity and feasibility of measurement, derived retrospectively from the electronic health records, accurately identify inpatient antimicrobial use. DESIGN, SETTING, AND PARTICIPANTS: Retrospective cohort study, using a 2-stage random forests machine learning modeling analysis of electronic health record data. Data were split into training and testing sets to measure model performance using area under the curve and absolute error. All adult and pediatric inpatient encounters from October 1, 2015, to September 30, 2017, at 2 community hospitals and 1 academic medical center in the Duke University Health System were analyzed. A total of 204 candidate variables were categorized into 4 tiers based on feasibility of measurement from the electronic health records. MAIN OUTCOMES AND MEASURES: Antimicrobial exposure was measured at the encounter level in 2 ways: binary (ever or never) and number of days of therapy. Analyses were stratified by age (pediatric or adult), unit type, and antibiotic group. RESULTS: The data set included 170 294 encounters and 204 candidate variables from 3 hospitals during the 3-year study period. Antimicrobial exposure occurred in 80 190 encounters (47%); 64 998 (38%) received 1 to 6 days of therapy, and 15 192 (9%) received 7 or more days of therapy. Two-stage models identified antimicrobial use with high fidelity (mean area under the curve, 0.85; mean absolute error, 1.0 days of therapy). Addition of more complex variables increased accuracy, with largest improvements occurring with inclusion of diagnosis information. Accuracy varied based on location and antibiotic group. Models underestimated the number of days of therapy of encounters with long lengths of stay. CONCLUSIONS AND RELEVANCE: Models using variables derived from electronic health records identified antimicrobial exposure accurately. Future risk-adjustment strategies incorporating encounter-level information may make comparisons of antimicrobial use more meaningful for hospital antimicrobial stewardship assessments. American Medical Association 2021-03-29 /pmc/articles/PMC8008288/ /pubmed/33779743 http://dx.doi.org/10.1001/jamanetworkopen.2021.3460 Text en Copyright 2021 Moehring RW et al. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License. |
spellingShingle | Original Investigation Moehring, Rebekah W. Phelan, Matthew Lofgren, Eric Nelson, Alicia Dodds Ashley, Elizabeth Anderson, Deverick J. Goldstein, Benjamin A. Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients |
title | Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients |
title_full | Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients |
title_fullStr | Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients |
title_full_unstemmed | Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients |
title_short | Development of a Machine Learning Model Using Electronic Health Record Data to Identify Antibiotic Use Among Hospitalized Patients |
title_sort | development of a machine learning model using electronic health record data to identify antibiotic use among hospitalized patients |
topic | Original Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008288/ https://www.ncbi.nlm.nih.gov/pubmed/33779743 http://dx.doi.org/10.1001/jamanetworkopen.2021.3460 |
work_keys_str_mv | AT moehringrebekahw developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients AT phelanmatthew developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients AT lofgreneric developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients AT nelsonalicia developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients AT doddsashleyelizabeth developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients AT andersondeverickj developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients AT goldsteinbenjamina developmentofamachinelearningmodelusingelectronichealthrecorddatatoidentifyantibioticuseamonghospitalizedpatients |