Cargando…

A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals

IMPORTANCE: An artificial intelligence (AI)-based model to predict COVID-19 likelihood from chest x-ray (CXR) findings can serve as an important adjunct to accelerate immediate clinical decision making and improve clinical decision making. Despite significant efforts, many limitations and biases exi...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Ju, Peng, Le, Li, Taihui, Adila, Dyah, Zaiman, Zach, Melton, Genevieve B., Ingraham, Nicholas, Murray, Eric, Boley, Daniel, Switzer, Sean, Burns, John L., Huang, Kun, Allen, Tadashi, Steenburg, Scott D., Gichoya, Judy Wawira, Kummerfeld, Erich, Tignanelli, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cornell University 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8183017/
https://www.ncbi.nlm.nih.gov/pubmed/34099980
_version_ 1783704303188312064
author Sun, Ju
Peng, Le
Li, Taihui
Adila, Dyah
Zaiman, Zach
Melton, Genevieve B.
Ingraham, Nicholas
Murray, Eric
Boley, Daniel
Switzer, Sean
Burns, John L.
Huang, Kun
Allen, Tadashi
Steenburg, Scott D.
Gichoya, Judy Wawira
Kummerfeld, Erich
Tignanelli, Christopher
author_facet Sun, Ju
Peng, Le
Li, Taihui
Adila, Dyah
Zaiman, Zach
Melton, Genevieve B.
Ingraham, Nicholas
Murray, Eric
Boley, Daniel
Switzer, Sean
Burns, John L.
Huang, Kun
Allen, Tadashi
Steenburg, Scott D.
Gichoya, Judy Wawira
Kummerfeld, Erich
Tignanelli, Christopher
author_sort Sun, Ju
collection PubMed
description IMPORTANCE: An artificial intelligence (AI)-based model to predict COVID-19 likelihood from chest x-ray (CXR) findings can serve as an important adjunct to accelerate immediate clinical decision making and improve clinical decision making. Despite significant efforts, many limitations and biases exist in previously developed AI diagnostic models for COVID-19. Utilizing a large set of local and international CXR images, we developed an AI model with high performance on temporal and external validation. OBJECTIVE: Investigate real-time performance of an AI-enabled COVID-19 diagnostic support system across a 12-hospital system. DESIGN: Prospective observational study. SETTING: Labeled frontal CXR images (samples of COVID-19 and non-COVID-19) from the M Health Fairview (Minnesota, USA), Valencian Region Medical ImageBank (Spain), MIMIC-CXR, Open-I 2013 Chest X-ray Collection, GitHub COVID-19 Image Data Collection (International), Indiana University (Indiana, USA), and Emory University (Georgia, USA) PARTICIPANTS: Internal (training, temporal, and real-time validation): 51,592 CXRs; Public: 27,424 CXRs; External (Indiana University): 10,002 CXRs; External (Emory University): 2002 CXRs MAIN OUTCOME AND MEASURE: Model performance assessed via receiver operating characteristic (ROC), Precision-Recall curves, and F1 score. RESULTS: Patients that were COVID-19 positive had significantly higher COVID-19 Diagnostic Scores (median .1 [IQR: 0.0–0.8] vs median 0.0 [IQR: 0.0–0.1], p < 0.001) than patients that were COVID-19 negative. Pre-implementation the AI-model performed well on temporal validation (AUROC 0.8) and external validation (AUROC 0.76 at Indiana U, AUROC 0.72 at Emory U). The model was noted to have unrealistic performance (AUROC > 0.95) using publicly available databases. Real-time model performance was unchanged over 19 weeks of implementation (AUROC 0.70). On subgroup analysis, the model had improved discrimination for patients with “severe” as compared to “mild or moderate” disease, p < 0.001. Model performance was highest in Asians and lowest in whites and similar between males and females. CONCLUSIONS AND RELEVANCE: AI-based diagnostic tools may serve as an adjunct, but not replacement, for clinical decision support of COVID-19 diagnosis, which largely hinges on exposure history, signs, and symptoms. While AI-based tools have not yet reached full diagnostic potential in COVID-19, they may still offer valuable information to clinicians taken into consideration along with clinical signs and symptoms.
format Online
Article
Text
id pubmed-8183017
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cornell University
record_format MEDLINE/PubMed
spelling pubmed-81830172021-06-08 A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals Sun, Ju Peng, Le Li, Taihui Adila, Dyah Zaiman, Zach Melton, Genevieve B. Ingraham, Nicholas Murray, Eric Boley, Daniel Switzer, Sean Burns, John L. Huang, Kun Allen, Tadashi Steenburg, Scott D. Gichoya, Judy Wawira Kummerfeld, Erich Tignanelli, Christopher ArXiv Article IMPORTANCE: An artificial intelligence (AI)-based model to predict COVID-19 likelihood from chest x-ray (CXR) findings can serve as an important adjunct to accelerate immediate clinical decision making and improve clinical decision making. Despite significant efforts, many limitations and biases exist in previously developed AI diagnostic models for COVID-19. Utilizing a large set of local and international CXR images, we developed an AI model with high performance on temporal and external validation. OBJECTIVE: Investigate real-time performance of an AI-enabled COVID-19 diagnostic support system across a 12-hospital system. DESIGN: Prospective observational study. SETTING: Labeled frontal CXR images (samples of COVID-19 and non-COVID-19) from the M Health Fairview (Minnesota, USA), Valencian Region Medical ImageBank (Spain), MIMIC-CXR, Open-I 2013 Chest X-ray Collection, GitHub COVID-19 Image Data Collection (International), Indiana University (Indiana, USA), and Emory University (Georgia, USA) PARTICIPANTS: Internal (training, temporal, and real-time validation): 51,592 CXRs; Public: 27,424 CXRs; External (Indiana University): 10,002 CXRs; External (Emory University): 2002 CXRs MAIN OUTCOME AND MEASURE: Model performance assessed via receiver operating characteristic (ROC), Precision-Recall curves, and F1 score. RESULTS: Patients that were COVID-19 positive had significantly higher COVID-19 Diagnostic Scores (median .1 [IQR: 0.0–0.8] vs median 0.0 [IQR: 0.0–0.1], p < 0.001) than patients that were COVID-19 negative. Pre-implementation the AI-model performed well on temporal validation (AUROC 0.8) and external validation (AUROC 0.76 at Indiana U, AUROC 0.72 at Emory U). The model was noted to have unrealistic performance (AUROC > 0.95) using publicly available databases. Real-time model performance was unchanged over 19 weeks of implementation (AUROC 0.70). On subgroup analysis, the model had improved discrimination for patients with “severe” as compared to “mild or moderate” disease, p < 0.001. Model performance was highest in Asians and lowest in whites and similar between males and females. CONCLUSIONS AND RELEVANCE: AI-based diagnostic tools may serve as an adjunct, but not replacement, for clinical decision support of COVID-19 diagnosis, which largely hinges on exposure history, signs, and symptoms. While AI-based tools have not yet reached full diagnostic potential in COVID-19, they may still offer valuable information to clinicians taken into consideration along with clinical signs and symptoms. Cornell University 2021-06-03 /pmc/articles/PMC8183017/ /pubmed/34099980 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Sun, Ju
Peng, Le
Li, Taihui
Adila, Dyah
Zaiman, Zach
Melton, Genevieve B.
Ingraham, Nicholas
Murray, Eric
Boley, Daniel
Switzer, Sean
Burns, John L.
Huang, Kun
Allen, Tadashi
Steenburg, Scott D.
Gichoya, Judy Wawira
Kummerfeld, Erich
Tignanelli, Christopher
A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals
title A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals
title_full A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals
title_fullStr A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals
title_full_unstemmed A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals
title_short A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals
title_sort prospective observational study to investigate performance of a chest x-ray artificial intelligence diagnostic support tool across 12 u.s. hospitals
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8183017/
https://www.ncbi.nlm.nih.gov/pubmed/34099980
work_keys_str_mv AT sunju aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT pengle aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT litaihui aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT adiladyah aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT zaimanzach aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT meltongenevieveb aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT ingrahamnicholas aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT murrayeric aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT boleydaniel aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT switzersean aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT burnsjohnl aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT huangkun aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT allentadashi aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT steenburgscottd aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT gichoyajudywawira aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT kummerfelderich aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT tignanellichristopher aprospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT sunju prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT pengle prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT litaihui prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT adiladyah prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT zaimanzach prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT meltongenevieveb prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT ingrahamnicholas prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT murrayeric prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT boleydaniel prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT switzersean prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT burnsjohnl prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT huangkun prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT allentadashi prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT steenburgscottd prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT gichoyajudywawira prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT kummerfelderich prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals
AT tignanellichristopher prospectiveobservationalstudytoinvestigateperformanceofachestxrayartificialintelligencediagnosticsupporttoolacross12ushospitals