Cargando…

Diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: Analysis of an international, multicenter migrants screening study

The aim of this study was to independently evaluate the diagnostic accuracy of three artificial intelligence (AI)-based computer aided detection (CAD) systems for detecting pulmonary tuberculosis (TB) on global migrants screening chest x-ray (CXR) cases when compared against both microbiological and...

Descripción completa

Detalles Bibliográficos
Autores principales: Gelaw, Sifrash Meseret, Kik, Sandra V., Ruhwald, Morten, Ongarello, Stefano, Egzertegegne, Tesfa Semagne, Gorbacheva, Olga, Gilpin, Christopher, Marano, Nina, Lee, Scott, Phares, Christina R., Medina, Victoria, Amatya, Bhaskar, Denkinger, Claudia M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10348531/
https://www.ncbi.nlm.nih.gov/pubmed/37450425
http://dx.doi.org/10.1371/journal.pgph.0000402
_version_ 1785073685468545024
author Gelaw, Sifrash Meseret
Kik, Sandra V.
Ruhwald, Morten
Ongarello, Stefano
Egzertegegne, Tesfa Semagne
Gorbacheva, Olga
Gilpin, Christopher
Marano, Nina
Lee, Scott
Phares, Christina R.
Medina, Victoria
Amatya, Bhaskar
Denkinger, Claudia M.
author_facet Gelaw, Sifrash Meseret
Kik, Sandra V.
Ruhwald, Morten
Ongarello, Stefano
Egzertegegne, Tesfa Semagne
Gorbacheva, Olga
Gilpin, Christopher
Marano, Nina
Lee, Scott
Phares, Christina R.
Medina, Victoria
Amatya, Bhaskar
Denkinger, Claudia M.
author_sort Gelaw, Sifrash Meseret
collection PubMed
description The aim of this study was to independently evaluate the diagnostic accuracy of three artificial intelligence (AI)-based computer aided detection (CAD) systems for detecting pulmonary tuberculosis (TB) on global migrants screening chest x-ray (CXR) cases when compared against both microbiological and radiological reference standards (MRS and RadRS, respectively). Retrospective clinical data and CXR images were collected from the International Organization for Migration (IOM) pre-migration health assessment TB screening global database for US-bound migrants. A total of 2,812 participants were included in the dataset used for analysis against RadRS, of which 1,769 (62.9%) had accompanying microbiological test results and were included against MRS. All CXRs were interpreted by three CAD systems (CAD4TB v6, Lunit INSIGHT v4.9.0, and qXR v2) in offline setting, and re-interpreted by two expert radiologists in a blinded fashion. The performance was evaluated using receiver operating characteristics curve (ROC), estimates of sensitivity and specificity at different CAD thresholds against both microbiological and radiological reference standards (MRS and RadRS, respectively), and was compared with that of the expert radiologists. The area under the curve against MRS was highest for Lunit (0.85; 95% CI 0.83−0.87), followed by qXR (0.75; 95% CI 0.72−0.77) and then CAD4TB (0.71; 95% CI 0.68−0.73). At a set specificity of 70%, Lunit had the highest sensitivity (81.4%; 95% CI 77.9–84.6); at a set sensitivity of 90%, specificity was also highest for Lunit (54.5%; 95% CI 51.7–57.3). The CAD systems performed comparable to the sensitivity (98.3%), and except CAD4TB, to specificity (13.7%) of the expert radiologists. Similar trends were observed when using RadRS. Area under the curve against RadRS was highest for CAD4TB (0.87; 95% CI 0.86–0.89) and Lunit (0.87; 95% CI 0.85–0.88) followed by qXR (0.81; 95% CI 0.80–0.83). At a set specificity of 70%, CAD4TB had highest sensitivity (84.1%; 95% CI 82.3−85.8) followed by Lunit (80.9%; 95% CI 78.9−82.7); and at a set sensitivity of 90%, specificity was also highest for CAD4TB (54.6%; 95% CI 51.3−57.8). In conclusion, the study demonstrated that the three CAD systems had broadly similar diagnostic accuracy with regard to TB screening and comparable accuracy to an expert radiologist against MRS. Compared with different reference standards, Lunit performed better than both qXR and CAD4TB against MRS, and CAD4TB and Lunit better than qXR against RadRS. Moreover, the performance of the CADs can be impacted by characteristics of subgroup of population. The main limitation was that our study relied on retrospective data and MRS was not routinely done in individuals with a low suspicion of TB and a normal CXR. Our findings suggest that CAD systems could be a useful tool for TB screening programs in remote, high TB prevalent places where access to expert radiologists may be limited. However, further large-scale prospective studies are needed to address outstanding questions around the operational performance and technical requirements of the CAD systems.
format Online
Article
Text
id pubmed-10348531
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103485312023-07-15 Diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: Analysis of an international, multicenter migrants screening study Gelaw, Sifrash Meseret Kik, Sandra V. Ruhwald, Morten Ongarello, Stefano Egzertegegne, Tesfa Semagne Gorbacheva, Olga Gilpin, Christopher Marano, Nina Lee, Scott Phares, Christina R. Medina, Victoria Amatya, Bhaskar Denkinger, Claudia M. PLOS Glob Public Health Research Article The aim of this study was to independently evaluate the diagnostic accuracy of three artificial intelligence (AI)-based computer aided detection (CAD) systems for detecting pulmonary tuberculosis (TB) on global migrants screening chest x-ray (CXR) cases when compared against both microbiological and radiological reference standards (MRS and RadRS, respectively). Retrospective clinical data and CXR images were collected from the International Organization for Migration (IOM) pre-migration health assessment TB screening global database for US-bound migrants. A total of 2,812 participants were included in the dataset used for analysis against RadRS, of which 1,769 (62.9%) had accompanying microbiological test results and were included against MRS. All CXRs were interpreted by three CAD systems (CAD4TB v6, Lunit INSIGHT v4.9.0, and qXR v2) in offline setting, and re-interpreted by two expert radiologists in a blinded fashion. The performance was evaluated using receiver operating characteristics curve (ROC), estimates of sensitivity and specificity at different CAD thresholds against both microbiological and radiological reference standards (MRS and RadRS, respectively), and was compared with that of the expert radiologists. The area under the curve against MRS was highest for Lunit (0.85; 95% CI 0.83−0.87), followed by qXR (0.75; 95% CI 0.72−0.77) and then CAD4TB (0.71; 95% CI 0.68−0.73). At a set specificity of 70%, Lunit had the highest sensitivity (81.4%; 95% CI 77.9–84.6); at a set sensitivity of 90%, specificity was also highest for Lunit (54.5%; 95% CI 51.7–57.3). The CAD systems performed comparable to the sensitivity (98.3%), and except CAD4TB, to specificity (13.7%) of the expert radiologists. Similar trends were observed when using RadRS. Area under the curve against RadRS was highest for CAD4TB (0.87; 95% CI 0.86–0.89) and Lunit (0.87; 95% CI 0.85–0.88) followed by qXR (0.81; 95% CI 0.80–0.83). At a set specificity of 70%, CAD4TB had highest sensitivity (84.1%; 95% CI 82.3−85.8) followed by Lunit (80.9%; 95% CI 78.9−82.7); and at a set sensitivity of 90%, specificity was also highest for CAD4TB (54.6%; 95% CI 51.3−57.8). In conclusion, the study demonstrated that the three CAD systems had broadly similar diagnostic accuracy with regard to TB screening and comparable accuracy to an expert radiologist against MRS. Compared with different reference standards, Lunit performed better than both qXR and CAD4TB against MRS, and CAD4TB and Lunit better than qXR against RadRS. Moreover, the performance of the CADs can be impacted by characteristics of subgroup of population. The main limitation was that our study relied on retrospective data and MRS was not routinely done in individuals with a low suspicion of TB and a normal CXR. Our findings suggest that CAD systems could be a useful tool for TB screening programs in remote, high TB prevalent places where access to expert radiologists may be limited. However, further large-scale prospective studies are needed to address outstanding questions around the operational performance and technical requirements of the CAD systems. Public Library of Science 2023-07-14 /pmc/articles/PMC10348531/ /pubmed/37450425 http://dx.doi.org/10.1371/journal.pgph.0000402 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Gelaw, Sifrash Meseret
Kik, Sandra V.
Ruhwald, Morten
Ongarello, Stefano
Egzertegegne, Tesfa Semagne
Gorbacheva, Olga
Gilpin, Christopher
Marano, Nina
Lee, Scott
Phares, Christina R.
Medina, Victoria
Amatya, Bhaskar
Denkinger, Claudia M.
Diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: Analysis of an international, multicenter migrants screening study
title Diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: Analysis of an international, multicenter migrants screening study
title_full Diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: Analysis of an international, multicenter migrants screening study
title_fullStr Diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: Analysis of an international, multicenter migrants screening study
title_full_unstemmed Diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: Analysis of an international, multicenter migrants screening study
title_short Diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: Analysis of an international, multicenter migrants screening study
title_sort diagnostic accuracy of three computer-aided detection systems for detecting pulmonary tuberculosis on chest radiography when used for screening: analysis of an international, multicenter migrants screening study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10348531/
https://www.ncbi.nlm.nih.gov/pubmed/37450425
http://dx.doi.org/10.1371/journal.pgph.0000402
work_keys_str_mv AT gelawsifrashmeseret diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT kiksandrav diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT ruhwaldmorten diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT ongarellostefano diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT egzertegegnetesfasemagne diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT gorbachevaolga diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT gilpinchristopher diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT maranonina diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT leescott diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT phareschristinar diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT medinavictoria diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT amatyabhaskar diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy
AT denkingerclaudiam diagnosticaccuracyofthreecomputeraideddetectionsystemsfordetectingpulmonarytuberculosisonchestradiographywhenusedforscreeninganalysisofaninternationalmulticentermigrantsscreeningstudy