Cargando…
The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia
OBJECTIVE: Chest radiographs are frequently used to diagnose community-acquired pneumonia (CAP) for children in the acute care setting. Natural language processing (NLP)-based tools may be incorporated into the electronic health record and combined with other clinical data to develop meaningful clin...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992200/ https://www.ncbi.nlm.nih.gov/pubmed/36910570 http://dx.doi.org/10.3389/fdgth.2023.1104604 |
_version_ | 1784902280741388288 |
---|---|
author | Rixe, Nancy Frisch, Adam Wang, Zhendong Martin, Judith M. Suresh, Srinivasan Florin, Todd A. Ramgopal, Sriram |
author_facet | Rixe, Nancy Frisch, Adam Wang, Zhendong Martin, Judith M. Suresh, Srinivasan Florin, Todd A. Ramgopal, Sriram |
author_sort | Rixe, Nancy |
collection | PubMed |
description | OBJECTIVE: Chest radiographs are frequently used to diagnose community-acquired pneumonia (CAP) for children in the acute care setting. Natural language processing (NLP)-based tools may be incorporated into the electronic health record and combined with other clinical data to develop meaningful clinical decision support tools for this common pediatric infection. We sought to develop and internally validate NLP algorithms to identify pediatric chest radiograph (CXR) reports with pneumonia. MATERIALS AND METHODS: We performed a retrospective study of encounters for patients from six pediatric hospitals over a 3-year period. We utilized six NLP techniques: word embedding, support vector machines, extreme gradient boosting (XGBoost), light gradient boosting machines Naïve Bayes and logistic regression. We evaluated their performance of each model from a validation sample of 1,350 chest radiographs developed as a stratified random sample of 35% admitted and 65% discharged patients when both using expert consensus and diagnosis codes. RESULTS: Of 172,662 encounters in the derivation sample, 15.6% had a discharge diagnosis of pneumonia in a primary or secondary position. The median patient age in the derivation sample was 3.7 years (interquartile range, 1.4–9.5 years). In the validation sample, 185/1350 (13.8%) and 205/1350 (15.3%) were classified as pneumonia by content experts and by diagnosis codes, respectively. Compared to content experts, Naïve Bayes had the highest sensitivity (93.5%) and XGBoost had the highest F1 score (72.4). Compared to a diagnosis code of pneumonia, the highest sensitivity was again with the Naïve Bayes (80.1%), and the highest F1 score was with the support vector machine (53.0%). CONCLUSION: NLP algorithms can accurately identify pediatric pneumonia from radiography reports. Following external validation and implementation into the electronic health record, these algorithms can facilitate clinical decision support and inform large database research. |
format | Online Article Text |
id | pubmed-9992200 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-99922002023-03-09 The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia Rixe, Nancy Frisch, Adam Wang, Zhendong Martin, Judith M. Suresh, Srinivasan Florin, Todd A. Ramgopal, Sriram Front Digit Health Digital Health OBJECTIVE: Chest radiographs are frequently used to diagnose community-acquired pneumonia (CAP) for children in the acute care setting. Natural language processing (NLP)-based tools may be incorporated into the electronic health record and combined with other clinical data to develop meaningful clinical decision support tools for this common pediatric infection. We sought to develop and internally validate NLP algorithms to identify pediatric chest radiograph (CXR) reports with pneumonia. MATERIALS AND METHODS: We performed a retrospective study of encounters for patients from six pediatric hospitals over a 3-year period. We utilized six NLP techniques: word embedding, support vector machines, extreme gradient boosting (XGBoost), light gradient boosting machines Naïve Bayes and logistic regression. We evaluated their performance of each model from a validation sample of 1,350 chest radiographs developed as a stratified random sample of 35% admitted and 65% discharged patients when both using expert consensus and diagnosis codes. RESULTS: Of 172,662 encounters in the derivation sample, 15.6% had a discharge diagnosis of pneumonia in a primary or secondary position. The median patient age in the derivation sample was 3.7 years (interquartile range, 1.4–9.5 years). In the validation sample, 185/1350 (13.8%) and 205/1350 (15.3%) were classified as pneumonia by content experts and by diagnosis codes, respectively. Compared to content experts, Naïve Bayes had the highest sensitivity (93.5%) and XGBoost had the highest F1 score (72.4). Compared to a diagnosis code of pneumonia, the highest sensitivity was again with the Naïve Bayes (80.1%), and the highest F1 score was with the support vector machine (53.0%). CONCLUSION: NLP algorithms can accurately identify pediatric pneumonia from radiography reports. Following external validation and implementation into the electronic health record, these algorithms can facilitate clinical decision support and inform large database research. Frontiers Media S.A. 2023-02-22 /pmc/articles/PMC9992200/ /pubmed/36910570 http://dx.doi.org/10.3389/fdgth.2023.1104604 Text en © 2023 Rixe, Frisch, Wang, Martin, Suresh, Florin and Ramgopal. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Digital Health Rixe, Nancy Frisch, Adam Wang, Zhendong Martin, Judith M. Suresh, Srinivasan Florin, Todd A. Ramgopal, Sriram The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia |
title | The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia |
title_full | The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia |
title_fullStr | The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia |
title_full_unstemmed | The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia |
title_short | The development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia |
title_sort | development of a novel natural language processing tool to identify pediatric chest radiograph reports with pneumonia |
topic | Digital Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992200/ https://www.ncbi.nlm.nih.gov/pubmed/36910570 http://dx.doi.org/10.3389/fdgth.2023.1104604 |
work_keys_str_mv | AT rixenancy thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT frischadam thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT wangzhendong thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT martinjudithm thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT sureshsrinivasan thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT florintodda thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT ramgopalsriram thedevelopmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT rixenancy developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT frischadam developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT wangzhendong developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT martinjudithm developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT sureshsrinivasan developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT florintodda developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia AT ramgopalsriram developmentofanovelnaturallanguageprocessingtooltoidentifypediatricchestradiographreportswithpneumonia |