Cargando…
Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US
BACKGROUND: Coronavirus disease 2019 (COVID-19) has spread quickly throughout the United States (US) causing significant disruption in healthcare and society. Tools to identify hot spots are important for public health planning. The goal of our study was to determine if natural language processing (...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Radiological Society of North America
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7977750/ https://www.ncbi.nlm.nih.gov/pubmed/33778666 http://dx.doi.org/10.1148/ryct.2021200596 |
_version_ | 1783667142894288896 |
---|---|
author | Cury, Ricardo C. Megyeri, Istvan Lindsey, Tony Macedo, Robson Batlle, Juan Kim, Shwan Baker, Brian Harris, Robert Clark, Reese H. |
author_facet | Cury, Ricardo C. Megyeri, Istvan Lindsey, Tony Macedo, Robson Batlle, Juan Kim, Shwan Baker, Brian Harris, Robert Clark, Reese H. |
author_sort | Cury, Ricardo C. |
collection | PubMed |
description | BACKGROUND: Coronavirus disease 2019 (COVID-19) has spread quickly throughout the United States (US) causing significant disruption in healthcare and society. Tools to identify hot spots are important for public health planning. The goal of our study was to determine if natural language processing (NLP) algorithm assessment of thoracic computed tomography (CT) imaging reports correlated with the incidence of official COVID-19 cases in the US. METHODS: Using de-identified HIPAA compliant patient data from our common imaging platform interconnected with over 2,100 facilities covering all 50 states, we developed three NLP algorithms to track positive CT imaging features of respiratory illness typical in SARS-CoV-2 viral infection. We compared our findings against the number of official COVID-19 daily, weekly and state-wide. RESULTS: The NLP algorithms were applied to 450,114 patient chest CT comprehensive reports gathered from January 1(st) to October 3(rd), 2020. The best performing NLP model exhibited strong correlation with daily official COVID-19 cases (r(2)=0.82, p<0.005). The NLP models demonstrated an early rise in cases followed by the increase of official cases, suggesting the possibility of an early predictive marker, with strong correlation to official cases on a weekly basis (r(2)=0.91, p<0.005). There was also substantial correlation between the NLP and official COVID-19 incidence by state (r(2)=0.92, p<0.005). CONCLUSION: Using big data, we developed a novel machine-learning based NLP algorithm that can track imaging findings of respiratory illness detected on chest CT imaging reports with strong correlation with the progression of the COVID-19 pandemic in the US. |
format | Online Article Text |
id | pubmed-7977750 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Radiological Society of North America |
record_format | MEDLINE/PubMed |
spelling | pubmed-79777502021-03-26 Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US Cury, Ricardo C. Megyeri, Istvan Lindsey, Tony Macedo, Robson Batlle, Juan Kim, Shwan Baker, Brian Harris, Robert Clark, Reese H. Radiol Cardiothorac Imaging Original Research BACKGROUND: Coronavirus disease 2019 (COVID-19) has spread quickly throughout the United States (US) causing significant disruption in healthcare and society. Tools to identify hot spots are important for public health planning. The goal of our study was to determine if natural language processing (NLP) algorithm assessment of thoracic computed tomography (CT) imaging reports correlated with the incidence of official COVID-19 cases in the US. METHODS: Using de-identified HIPAA compliant patient data from our common imaging platform interconnected with over 2,100 facilities covering all 50 states, we developed three NLP algorithms to track positive CT imaging features of respiratory illness typical in SARS-CoV-2 viral infection. We compared our findings against the number of official COVID-19 daily, weekly and state-wide. RESULTS: The NLP algorithms were applied to 450,114 patient chest CT comprehensive reports gathered from January 1(st) to October 3(rd), 2020. The best performing NLP model exhibited strong correlation with daily official COVID-19 cases (r(2)=0.82, p<0.005). The NLP models demonstrated an early rise in cases followed by the increase of official cases, suggesting the possibility of an early predictive marker, with strong correlation to official cases on a weekly basis (r(2)=0.91, p<0.005). There was also substantial correlation between the NLP and official COVID-19 incidence by state (r(2)=0.92, p<0.005). CONCLUSION: Using big data, we developed a novel machine-learning based NLP algorithm that can track imaging findings of respiratory illness detected on chest CT imaging reports with strong correlation with the progression of the COVID-19 pandemic in the US. Radiological Society of North America 2021-02-25 /pmc/articles/PMC7977750/ /pubmed/33778666 http://dx.doi.org/10.1148/ryct.2021200596 Text en 2021 by the Radiological Society of North America, Inc. This article is made available via the PMC Open Access Subset for unrestricted re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the COVID-19 pandemic or until permissions are revoked in writing. Upon expiration of these permissions, PMC is granted a perpetual license to make this article available via PMC and Europe PMC, consistent with existing copyright protections. |
spellingShingle | Original Research Cury, Ricardo C. Megyeri, Istvan Lindsey, Tony Macedo, Robson Batlle, Juan Kim, Shwan Baker, Brian Harris, Robert Clark, Reese H. Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US |
title | Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US |
title_full | Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US |
title_fullStr | Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US |
title_full_unstemmed | Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US |
title_short | Natural Language Processing and Machine Learning for Detection of Respiratory Illness by Chest CT Imaging and Tracking of COVID-19 Pandemic in the US |
title_sort | natural language processing and machine learning for detection of respiratory illness by chest ct imaging and tracking of covid-19 pandemic in the us |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7977750/ https://www.ncbi.nlm.nih.gov/pubmed/33778666 http://dx.doi.org/10.1148/ryct.2021200596 |
work_keys_str_mv | AT curyricardoc naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus AT megyeriistvan naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus AT lindseytony naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus AT macedorobson naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus AT batllejuan naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus AT kimshwan naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus AT bakerbrian naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus AT harrisrobert naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus AT clarkreeseh naturallanguageprocessingandmachinelearningfordetectionofrespiratoryillnessbychestctimagingandtrackingofcovid19pandemicintheus |