Cargando…
Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing
BACKGROUND: Electronic health records (EHR) contain large volumes of unstructured free-form text notes that richly describe a patient’s health and medical comorbidities. It is unclear if perioperative risk stratification can be performed directly from these notes without manual data extraction. We c...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10476287/ https://www.ncbi.nlm.nih.gov/pubmed/37667258 http://dx.doi.org/10.1186/s12871-023-02248-0 |
_version_ | 1785100895297470464 |
---|---|
author | Chung, Philip Fong, Christine T. Walters, Andrew M. Yetisgen, Meliha O’Reilly-Shah, Vikas N. |
author_facet | Chung, Philip Fong, Christine T. Walters, Andrew M. Yetisgen, Meliha O’Reilly-Shah, Vikas N. |
author_sort | Chung, Philip |
collection | PubMed |
description | BACKGROUND: Electronic health records (EHR) contain large volumes of unstructured free-form text notes that richly describe a patient’s health and medical comorbidities. It is unclear if perioperative risk stratification can be performed directly from these notes without manual data extraction. We conduct a feasibility study using natural language processing (NLP) to predict the American Society of Anesthesiologists Physical Status Classification (ASA-PS) as a surrogate measure for perioperative risk. We explore prediction performance using four different model types and compare the use of different note sections versus the whole note. We use Shapley values to explain model predictions and analyze disagreement between model and human anesthesiologist predictions. METHODS: Single-center retrospective cohort analysis of EHR notes from patients undergoing procedures with anesthesia care spanning all procedural specialties during a 5 year period who were not assigned ASA VI and also had a preoperative evaluation note filed within 90 days prior to the procedure. NLP models were trained for each combination of 4 models and 8 text snippets from notes. Model performance was compared using area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC). Shapley values were used to explain model predictions. Error analysis and model explanation using Shapley values was conducted for the best performing model. RESULTS: Final dataset includes 38,566 patients undergoing 61,503 procedures with anesthesia care. Prevalence of ASA-PS was 8.81% for ASA I, 31.4% for ASA II, 43.25% for ASA III, and 16.54% for ASA IV-V. The best performing models were the BioClinicalBERT model on the truncated note task (macro-average AUROC 0.845) and the fastText model on the full note task (macro-average AUROC 0.865). Shapley values reveal human-interpretable model predictions. Error analysis reveals that some original ASA-PS assignments may be incorrect and the model is making a reasonable prediction in these cases. CONCLUSIONS: Text classification models can accurately predict a patient’s illness severity using only free-form text descriptions of patients without any manual data extraction. They can be an additional patient safety tool in the perioperative setting and reduce manual chart review for medical billing. Shapley feature attributions produce explanations that logically support model predictions and are understandable to clinicians. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12871-023-02248-0. |
format | Online Article Text |
id | pubmed-10476287 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-104762872023-09-05 Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing Chung, Philip Fong, Christine T. Walters, Andrew M. Yetisgen, Meliha O’Reilly-Shah, Vikas N. BMC Anesthesiol Research BACKGROUND: Electronic health records (EHR) contain large volumes of unstructured free-form text notes that richly describe a patient’s health and medical comorbidities. It is unclear if perioperative risk stratification can be performed directly from these notes without manual data extraction. We conduct a feasibility study using natural language processing (NLP) to predict the American Society of Anesthesiologists Physical Status Classification (ASA-PS) as a surrogate measure for perioperative risk. We explore prediction performance using four different model types and compare the use of different note sections versus the whole note. We use Shapley values to explain model predictions and analyze disagreement between model and human anesthesiologist predictions. METHODS: Single-center retrospective cohort analysis of EHR notes from patients undergoing procedures with anesthesia care spanning all procedural specialties during a 5 year period who were not assigned ASA VI and also had a preoperative evaluation note filed within 90 days prior to the procedure. NLP models were trained for each combination of 4 models and 8 text snippets from notes. Model performance was compared using area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC). Shapley values were used to explain model predictions. Error analysis and model explanation using Shapley values was conducted for the best performing model. RESULTS: Final dataset includes 38,566 patients undergoing 61,503 procedures with anesthesia care. Prevalence of ASA-PS was 8.81% for ASA I, 31.4% for ASA II, 43.25% for ASA III, and 16.54% for ASA IV-V. The best performing models were the BioClinicalBERT model on the truncated note task (macro-average AUROC 0.845) and the fastText model on the full note task (macro-average AUROC 0.865). Shapley values reveal human-interpretable model predictions. Error analysis reveals that some original ASA-PS assignments may be incorrect and the model is making a reasonable prediction in these cases. CONCLUSIONS: Text classification models can accurately predict a patient’s illness severity using only free-form text descriptions of patients without any manual data extraction. They can be an additional patient safety tool in the perioperative setting and reduce manual chart review for medical billing. Shapley feature attributions produce explanations that logically support model predictions and are understandable to clinicians. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12871-023-02248-0. BioMed Central 2023-09-04 /pmc/articles/PMC10476287/ /pubmed/37667258 http://dx.doi.org/10.1186/s12871-023-02248-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Chung, Philip Fong, Christine T. Walters, Andrew M. Yetisgen, Meliha O’Reilly-Shah, Vikas N. Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing |
title | Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing |
title_full | Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing |
title_fullStr | Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing |
title_full_unstemmed | Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing |
title_short | Prediction of American Society of Anesthesiologists Physical Status Classification from preoperative clinical text narratives using natural language processing |
title_sort | prediction of american society of anesthesiologists physical status classification from preoperative clinical text narratives using natural language processing |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10476287/ https://www.ncbi.nlm.nih.gov/pubmed/37667258 http://dx.doi.org/10.1186/s12871-023-02248-0 |
work_keys_str_mv | AT chungphilip predictionofamericansocietyofanesthesiologistsphysicalstatusclassificationfrompreoperativeclinicaltextnarrativesusingnaturallanguageprocessing AT fongchristinet predictionofamericansocietyofanesthesiologistsphysicalstatusclassificationfrompreoperativeclinicaltextnarrativesusingnaturallanguageprocessing AT waltersandrewm predictionofamericansocietyofanesthesiologistsphysicalstatusclassificationfrompreoperativeclinicaltextnarrativesusingnaturallanguageprocessing AT yetisgenmeliha predictionofamericansocietyofanesthesiologistsphysicalstatusclassificationfrompreoperativeclinicaltextnarrativesusingnaturallanguageprocessing AT oreillyshahvikasn predictionofamericansocietyofanesthesiologistsphysicalstatusclassificationfrompreoperativeclinicaltextnarrativesusingnaturallanguageprocessing |