Cargando…

Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems

Deep learning (DL) neural networks have only recently been employed to interpret chest radiography (CXR) to screen and triage people for pulmonary tuberculosis (TB). No published studies have compared multiple DL systems and populations. We conducted a retrospective evaluation of three DL systems (C...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Zhi Zhen, Sander, Melissa S., Rai, Bishwa, Titahong, Collins N., Sudrungrot, Santat, Laah, Sylvain N., Adhikari, Lal Mani, Carter, E. Jane, Puri, Lekha, Codlin, Andrew J., Creswell, Jacob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6802077/
https://www.ncbi.nlm.nih.gov/pubmed/31628424
http://dx.doi.org/10.1038/s41598-019-51503-3
_version_ 1783460727411965952
author Qin, Zhi Zhen
Sander, Melissa S.
Rai, Bishwa
Titahong, Collins N.
Sudrungrot, Santat
Laah, Sylvain N.
Adhikari, Lal Mani
Carter, E. Jane
Puri, Lekha
Codlin, Andrew J.
Creswell, Jacob
author_facet Qin, Zhi Zhen
Sander, Melissa S.
Rai, Bishwa
Titahong, Collins N.
Sudrungrot, Santat
Laah, Sylvain N.
Adhikari, Lal Mani
Carter, E. Jane
Puri, Lekha
Codlin, Andrew J.
Creswell, Jacob
author_sort Qin, Zhi Zhen
collection PubMed
description Deep learning (DL) neural networks have only recently been employed to interpret chest radiography (CXR) to screen and triage people for pulmonary tuberculosis (TB). No published studies have compared multiple DL systems and populations. We conducted a retrospective evaluation of three DL systems (CAD4TB, Lunit INSIGHT, and qXR) for detecting TB-associated abnormalities in chest radiographs from outpatients in Nepal and Cameroon. All 1196 individuals received a Xpert MTB/RIF assay and a CXR read by two groups of radiologists and the DL systems. Xpert was used as the reference standard. The area under the curve of the three systems was similar: Lunit (0.94, 95% CI: 0.93–0.96), qXR (0.94, 95% CI: 0.92–0.97) and CAD4TB (0.92, 95% CI: 0.90–0.95). When matching the sensitivity of the radiologists, the specificities of the DL systems were significantly higher except for one. Using DL systems to read CXRs could reduce the number of Xpert MTB/RIF tests needed by 66% while maintaining sensitivity at 95% or better. Using a universal cutoff score resulted different performance in each site, highlighting the need to select scores based on the population screened. These DL systems should be considered by TB programs where human resources are constrained, and automated technology is available.
format Online
Article
Text
id pubmed-6802077
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-68020772019-10-24 Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems Qin, Zhi Zhen Sander, Melissa S. Rai, Bishwa Titahong, Collins N. Sudrungrot, Santat Laah, Sylvain N. Adhikari, Lal Mani Carter, E. Jane Puri, Lekha Codlin, Andrew J. Creswell, Jacob Sci Rep Article Deep learning (DL) neural networks have only recently been employed to interpret chest radiography (CXR) to screen and triage people for pulmonary tuberculosis (TB). No published studies have compared multiple DL systems and populations. We conducted a retrospective evaluation of three DL systems (CAD4TB, Lunit INSIGHT, and qXR) for detecting TB-associated abnormalities in chest radiographs from outpatients in Nepal and Cameroon. All 1196 individuals received a Xpert MTB/RIF assay and a CXR read by two groups of radiologists and the DL systems. Xpert was used as the reference standard. The area under the curve of the three systems was similar: Lunit (0.94, 95% CI: 0.93–0.96), qXR (0.94, 95% CI: 0.92–0.97) and CAD4TB (0.92, 95% CI: 0.90–0.95). When matching the sensitivity of the radiologists, the specificities of the DL systems were significantly higher except for one. Using DL systems to read CXRs could reduce the number of Xpert MTB/RIF tests needed by 66% while maintaining sensitivity at 95% or better. Using a universal cutoff score resulted different performance in each site, highlighting the need to select scores based on the population screened. These DL systems should be considered by TB programs where human resources are constrained, and automated technology is available. Nature Publishing Group UK 2019-10-18 /pmc/articles/PMC6802077/ /pubmed/31628424 http://dx.doi.org/10.1038/s41598-019-51503-3 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Qin, Zhi Zhen
Sander, Melissa S.
Rai, Bishwa
Titahong, Collins N.
Sudrungrot, Santat
Laah, Sylvain N.
Adhikari, Lal Mani
Carter, E. Jane
Puri, Lekha
Codlin, Andrew J.
Creswell, Jacob
Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems
title Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems
title_full Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems
title_fullStr Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems
title_full_unstemmed Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems
title_short Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems
title_sort using artificial intelligence to read chest radiographs for tuberculosis detection: a multi-site evaluation of the diagnostic accuracy of three deep learning systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6802077/
https://www.ncbi.nlm.nih.gov/pubmed/31628424
http://dx.doi.org/10.1038/s41598-019-51503-3
work_keys_str_mv AT qinzhizhen usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT sandermelissas usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT raibishwa usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT titahongcollinsn usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT sudrungrotsantat usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT laahsylvainn usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT adhikarilalmani usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT carterejane usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT purilekha usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT codlinandrewj usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems
AT creswelljacob usingartificialintelligencetoreadchestradiographsfortuberculosisdetectionamultisiteevaluationofthediagnosticaccuracyofthreedeeplearningsystems