Cargando…

Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US

BACKGROUND: Coronavirus disease 2019 (COVID-19) has spread quickly throughout the United States (US) causing significant disruption in healthcare and society. Tools to identify hot spots are important for public health planning. The goal of our study was to determine if natural language processing (...

Descripción completa

Detalles Bibliográficos
Autores principales: Cury, Ricardo C., Megyeri, Istvan, Lindsey, Tony, Macedo, Robson, Batlle, Juan, Kim, Shwan, Baker, Brian, Harris, Robert, Clark, Reese H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Radiological Society of North America 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7977750/
https://www.ncbi.nlm.nih.gov/pubmed/33778666
http://dx.doi.org/10.1148/ryct.2021200596
_version_ 1783667142894288896
author Cury, Ricardo C.
Megyeri, Istvan
Lindsey, Tony
Macedo, Robson
Batlle, Juan
Kim, Shwan
Baker, Brian
Harris, Robert
Clark, Reese H.
author_facet Cury, Ricardo C.
Megyeri, Istvan
Lindsey, Tony
Macedo, Robson
Batlle, Juan
Kim, Shwan
Baker, Brian
Harris, Robert
Clark, Reese H.
author_sort Cury, Ricardo C.
collection PubMed
description BACKGROUND: Coronavirus disease 2019 (COVID-19) has spread quickly throughout the United States (US) causing significant disruption in healthcare and society. Tools to identify hot spots are important for public health planning. The goal of our study was to determine if natural language processing (NLP) algorithm assessment of thoracic computed tomography (CT) imaging reports correlated with the incidence of official COVID-19 cases in the US. METHODS: Using de-identified HIPAA compliant patient data from our common imaging platform interconnected with over 2,100 facilities covering all 50 states, we developed three NLP algorithms to track positive CT imaging features of respiratory illness typical in SARS-CoV-2 viral infection. We compared our findings against the number of official COVID-19 daily, weekly and state-wide. RESULTS: The NLP algorithms were applied to 450,114 patient chest CT comprehensive reports gathered from January 1(st) to October 3(rd), 2020. The best performing NLP model exhibited strong correlation with daily official COVID-19 cases (r(2)=0.82, p<0.005). The NLP models demonstrated an early rise in cases followed by the increase of official cases, suggesting the possibility of an early predictive marker, with strong correlation to official cases on a weekly basis (r(2)=0.91, p<0.005). There was also substantial correlation between the NLP and official COVID-19 incidence by state (r(2)=0.92, p<0.005). CONCLUSION: Using big data, we developed a novel machine-learning based NLP algorithm that can track imaging findings of respiratory illness detected on chest CT imaging reports with strong correlation with the progression of the COVID-19 pandemic in the US.
format Online
Article
Text
id pubmed-7977750
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Radiological Society of North America
record_format MEDLINE/PubMed
spelling pubmed-79777502021-03-26 Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US Cury, Ricardo C. Megyeri, Istvan Lindsey, Tony Macedo, Robson Batlle, Juan Kim, Shwan Baker, Brian Harris, Robert Clark, Reese H. Radiol Cardiothorac Imaging Original Research BACKGROUND: Coronavirus disease 2019 (COVID-19) has spread quickly throughout the United States (US) causing significant disruption in healthcare and society. Tools to identify hot spots are important for public health planning. The goal of our study was to determine if natural language processing (NLP) algorithm assessment of thoracic computed tomography (CT) imaging reports correlated with the incidence of official COVID-19 cases in the US. METHODS: Using de-identified HIPAA compliant patient data from our common imaging platform interconnected with over 2,100 facilities covering all 50 states, we developed three NLP algorithms to track positive CT imaging features of respiratory illness typical in SARS-CoV-2 viral infection. We compared our findings against the number of official COVID-19 daily, weekly and state-wide. RESULTS: The NLP algorithms were applied to 450,114 patient chest CT comprehensive reports gathered from January 1(st) to October 3(rd), 2020. The best performing NLP model exhibited strong correlation with daily official COVID-19 cases (r(2)=0.82, p<0.005). The NLP models demonstrated an early rise in cases followed by the increase of official cases, suggesting the possibility of an early predictive marker, with strong correlation to official cases on a weekly basis (r(2)=0.91, p<0.005). There was also substantial correlation between the NLP and official COVID-19 incidence by state (r(2)=0.92, p<0.005). CONCLUSION: Using big data, we developed a novel machine-learning based NLP algorithm that can track imaging findings of respiratory illness detected on chest CT imaging reports with strong correlation with the progression of the COVID-19 pandemic in the US. Radiological Society of North America 2021-02-25 /pmc/articles/PMC7977750/ /pubmed/33778666 http://dx.doi.org/10.1148/ryct.2021200596 Text en 2021 by the Radiological Society of North America, Inc. This article is made available via the PMC Open Access Subset for unrestricted re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the COVID-19 pandemic or until permissions are revoked in writing. Upon expiration of these permissions, PMC is granted a perpetual license to make this article available via PMC and Europe PMC, consistent with existing copyright protections.
spellingShingle Original Research
Cury, Ricardo C.
Megyeri, Istvan
Lindsey, Tony
Macedo, Robson
Batlle, Juan
Kim, Shwan
Baker, Brian
Harris, Robert
Clark, Reese H.
Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US
title Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US
title_full Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US
title_fullStr Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US
title_full_unstemmed Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US
title_short Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US
title_sort natural language processing and machine learning for detection of respiratory illness by chest ct imaging and tracking of covid-19 pandemic in the us
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7977750/
https://www.ncbi.nlm.nih.gov/pubmed/33778666
http://dx.doi.org/10.1148/ryct.2021200596
work_keys_str_mv AT curyricardoc naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus
AT megyeriistvan naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus
AT lindseytony naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus
AT macedorobson naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus
AT batllejuan naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus
AT kimshwan naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus
AT bakerbrian naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus
AT harrisrobert naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus
AT clarkreeseh naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus