Cargando…

The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia

OBJECTIVE: Chest radiographs are frequently used to diagnose community-acquired pneumonia (CAP) for children in the acute care setting. Natural language processing (NLP)-based tools may be incorporated into the electronic health record and combined with other clinical data to develop meaningful clin...

Descripción completa

Detalles Bibliográficos
Autores principales: Rixe, Nancy, Frisch, Adam, Wang, Zhendong, Martin, Judith M., Suresh, Srinivasan, Florin, Todd A., Ramgopal, Sriram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992200/
https://www.ncbi.nlm.nih.gov/pubmed/36910570
http://dx.doi.org/10.3389/fdgth.2023.1104604
_version_ 1784902280741388288
author Rixe, Nancy
Frisch, Adam
Wang, Zhendong
Martin, Judith M.
Suresh, Srinivasan
Florin, Todd A.
Ramgopal, Sriram
author_facet Rixe, Nancy
Frisch, Adam
Wang, Zhendong
Martin, Judith M.
Suresh, Srinivasan
Florin, Todd A.
Ramgopal, Sriram
author_sort Rixe, Nancy
collection PubMed
description OBJECTIVE: Chest radiographs are frequently used to diagnose community-acquired pneumonia (CAP) for children in the acute care setting. Natural language processing (NLP)-based tools may be incorporated into the electronic health record and combined with other clinical data to develop meaningful clinical decision support tools for this common pediatric infection. We sought to develop and internally validate NLP algorithms to identify pediatric chest radiograph (CXR) reports with pneumonia. MATERIALS AND METHODS: We performed a retrospective study of encounters for patients from six pediatric hospitals over a 3-year period. We utilized six NLP techniques: word embedding, support vector machines, extreme gradient boosting (XGBoost), light gradient boosting machines Naïve Bayes and logistic regression. We evaluated their performance of each model from a validation sample of 1,350 chest radiographs developed as a stratified random sample of 35% admitted and 65% discharged patients when both using expert consensus and diagnosis codes. RESULTS: Of 172,662 encounters in the derivation sample, 15.6% had a discharge diagnosis of pneumonia in a primary or secondary position. The median patient age in the derivation sample was 3.7 years (interquartile range, 1.4–9.5 years). In the validation sample, 185/1350 (13.8%) and 205/1350 (15.3%) were classified as pneumonia by content experts and by diagnosis codes, respectively. Compared to content experts, Naïve Bayes had the highest sensitivity (93.5%) and XGBoost had the highest F1 score (72.4). Compared to a diagnosis code of pneumonia, the highest sensitivity was again with the Naïve Bayes (80.1%), and the highest F1 score was with the support vector machine (53.0%). CONCLUSION: NLP algorithms can accurately identify pediatric pneumonia from radiography reports. Following external validation and implementation into the electronic health record, these algorithms can facilitate clinical decision support and inform large database research.
format Online
Article
Text
id pubmed-9992200
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99922002023-03-09 The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia Rixe, Nancy Frisch, Adam Wang, Zhendong Martin, Judith M. Suresh, Srinivasan Florin, Todd A. Ramgopal, Sriram Front Digit Health Digital Health OBJECTIVE: Chest radiographs are frequently used to diagnose community-acquired pneumonia (CAP) for children in the acute care setting. Natural language processing (NLP)-based tools may be incorporated into the electronic health record and combined with other clinical data to develop meaningful clinical decision support tools for this common pediatric infection. We sought to develop and internally validate NLP algorithms to identify pediatric chest radiograph (CXR) reports with pneumonia. MATERIALS AND METHODS: We performed a retrospective study of encounters for patients from six pediatric hospitals over a 3-year period. We utilized six NLP techniques: word embedding, support vector machines, extreme gradient boosting (XGBoost), light gradient boosting machines Naïve Bayes and logistic regression. We evaluated their performance of each model from a validation sample of 1,350 chest radiographs developed as a stratified random sample of 35% admitted and 65% discharged patients when both using expert consensus and diagnosis codes. RESULTS: Of 172,662 encounters in the derivation sample, 15.6% had a discharge diagnosis of pneumonia in a primary or secondary position. The median patient age in the derivation sample was 3.7 years (interquartile range, 1.4–9.5 years). In the validation sample, 185/1350 (13.8%) and 205/1350 (15.3%) were classified as pneumonia by content experts and by diagnosis codes, respectively. Compared to content experts, Naïve Bayes had the highest sensitivity (93.5%) and XGBoost had the highest F1 score (72.4). Compared to a diagnosis code of pneumonia, the highest sensitivity was again with the Naïve Bayes (80.1%), and the highest F1 score was with the support vector machine (53.0%). CONCLUSION: NLP algorithms can accurately identify pediatric pneumonia from radiography reports. Following external validation and implementation into the electronic health record, these algorithms can facilitate clinical decision support and inform large database research. Frontiers Media S.A. 2023-02-22 /pmc/articles/PMC9992200/ /pubmed/36910570 http://dx.doi.org/10.3389/fdgth.2023.1104604 Text en © 2023 Rixe, Frisch, Wang, Martin, Suresh, Florin and Ramgopal. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Digital Health
Rixe, Nancy
Frisch, Adam
Wang, Zhendong
Martin, Judith M.
Suresh, Srinivasan
Florin, Todd A.
Ramgopal, Sriram
The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
title The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
title_full The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
title_fullStr The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
title_full_unstemmed The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
title_short The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
title_sort development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
topic Digital Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992200/
https://www.ncbi.nlm.nih.gov/pubmed/36910570
http://dx.doi.org/10.3389/fdgth.2023.1104604
work_keys_str_mv AT rixenancy thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT frischadam thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT wangzhendong thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT martinjudithm thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT sureshsrinivasan thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT florintodda thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT ramgopalsriram thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT rixenancy developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT frischadam developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT wangzhendong developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT martinjudithm developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT sureshsrinivasan developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT florintodda developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia
AT ramgopalsriram developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia