Cargando…
Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training
OBJECTIVES: Diagnostic accuracy of artificial intelligence (AI) pneumothorax (PTX) detection in chest radiographs (CXR) is limited by the noisy annotation quality of public training data and confounding thoracic tubes (TT). We hypothesize that in-image annotations of the dehiscent visceral pleura fo...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8452588/ https://www.ncbi.nlm.nih.gov/pubmed/33774722 http://dx.doi.org/10.1007/s00330-021-07833-w |
_version_ | 1784570102737272832 |
---|---|
author | Rueckel, Johannes Huemmer, Christian Fieselmann, Andreas Ghesu, Florin-Cristian Mansoor, Awais Schachtner, Balthasar Wesp, Philipp Trappmann, Lena Munawwar, Basel Ricke, Jens Ingrisch, Michael Sabel, Bastian O. |
author_facet | Rueckel, Johannes Huemmer, Christian Fieselmann, Andreas Ghesu, Florin-Cristian Mansoor, Awais Schachtner, Balthasar Wesp, Philipp Trappmann, Lena Munawwar, Basel Ricke, Jens Ingrisch, Michael Sabel, Bastian O. |
author_sort | Rueckel, Johannes |
collection | PubMed |
description | OBJECTIVES: Diagnostic accuracy of artificial intelligence (AI) pneumothorax (PTX) detection in chest radiographs (CXR) is limited by the noisy annotation quality of public training data and confounding thoracic tubes (TT). We hypothesize that in-image annotations of the dehiscent visceral pleura for algorithm training boosts algorithm’s performance and suppresses confounders. METHODS: Our single-center evaluation cohort of 3062 supine CXRs includes 760 PTX-positive cases with radiological annotations of PTX size and inserted TTs. Three step-by-step improved algorithms (differing in algorithm architecture, training data from public datasets/clinical sites, and in-image annotations included in algorithm training) were characterized by area under the receiver operating characteristics (AUROC) in detailed subgroup analyses and referenced to the well-established “CheXNet” algorithm. RESULTS: Performances of established algorithms exclusively trained on publicly available data without in-image annotations are limited to AUROCs of 0.778 and strongly biased towards TTs that can completely eliminate algorithm’s discriminative power in individual subgroups. Contrarily, our final “algorithm 2” which was trained on a lower number of images but additionally with in-image annotations of the dehiscent pleura achieved an overall AUROC of 0.877 for unilateral PTX detection with a significantly reduced TT-related confounding bias. CONCLUSIONS: We demonstrated strong limitations of an established PTX-detecting AI algorithm that can be significantly reduced by designing an AI system capable of learning to both classify and localize PTX. Our results are aimed at drawing attention to the necessity of high-quality in-image localization in training data to reduce the risks of unintentionally biasing the training process of pathology-detecting AI algorithms. KEY POINTS: • Established pneumothorax-detecting artificial intelligence algorithms trained on public training data are strongly limited and biased by confounding thoracic tubes. • We used high-quality in-image annotated training data to effectively boost algorithm performance and suppress the impact of confounding thoracic tubes. • Based on our results, we hypothesize that even hidden confounders might be effectively addressed by in-image annotations of pathology-related image features. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00330-021-07833-w. |
format | Online Article Text |
id | pubmed-8452588 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-84525882021-10-05 Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training Rueckel, Johannes Huemmer, Christian Fieselmann, Andreas Ghesu, Florin-Cristian Mansoor, Awais Schachtner, Balthasar Wesp, Philipp Trappmann, Lena Munawwar, Basel Ricke, Jens Ingrisch, Michael Sabel, Bastian O. Eur Radiol Imaging Informatics and Artificial Intelligence OBJECTIVES: Diagnostic accuracy of artificial intelligence (AI) pneumothorax (PTX) detection in chest radiographs (CXR) is limited by the noisy annotation quality of public training data and confounding thoracic tubes (TT). We hypothesize that in-image annotations of the dehiscent visceral pleura for algorithm training boosts algorithm’s performance and suppresses confounders. METHODS: Our single-center evaluation cohort of 3062 supine CXRs includes 760 PTX-positive cases with radiological annotations of PTX size and inserted TTs. Three step-by-step improved algorithms (differing in algorithm architecture, training data from public datasets/clinical sites, and in-image annotations included in algorithm training) were characterized by area under the receiver operating characteristics (AUROC) in detailed subgroup analyses and referenced to the well-established “CheXNet” algorithm. RESULTS: Performances of established algorithms exclusively trained on publicly available data without in-image annotations are limited to AUROCs of 0.778 and strongly biased towards TTs that can completely eliminate algorithm’s discriminative power in individual subgroups. Contrarily, our final “algorithm 2” which was trained on a lower number of images but additionally with in-image annotations of the dehiscent pleura achieved an overall AUROC of 0.877 for unilateral PTX detection with a significantly reduced TT-related confounding bias. CONCLUSIONS: We demonstrated strong limitations of an established PTX-detecting AI algorithm that can be significantly reduced by designing an AI system capable of learning to both classify and localize PTX. Our results are aimed at drawing attention to the necessity of high-quality in-image localization in training data to reduce the risks of unintentionally biasing the training process of pathology-detecting AI algorithms. KEY POINTS: • Established pneumothorax-detecting artificial intelligence algorithms trained on public training data are strongly limited and biased by confounding thoracic tubes. • We used high-quality in-image annotated training data to effectively boost algorithm performance and suppress the impact of confounding thoracic tubes. • Based on our results, we hypothesize that even hidden confounders might be effectively addressed by in-image annotations of pathology-related image features. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00330-021-07833-w. Springer Berlin Heidelberg 2021-03-27 2021 /pmc/articles/PMC8452588/ /pubmed/33774722 http://dx.doi.org/10.1007/s00330-021-07833-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Imaging Informatics and Artificial Intelligence Rueckel, Johannes Huemmer, Christian Fieselmann, Andreas Ghesu, Florin-Cristian Mansoor, Awais Schachtner, Balthasar Wesp, Philipp Trappmann, Lena Munawwar, Basel Ricke, Jens Ingrisch, Michael Sabel, Bastian O. Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training |
title | Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training |
title_full | Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training |
title_fullStr | Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training |
title_full_unstemmed | Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training |
title_short | Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training |
title_sort | pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training |
topic | Imaging Informatics and Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8452588/ https://www.ncbi.nlm.nih.gov/pubmed/33774722 http://dx.doi.org/10.1007/s00330-021-07833-w |
work_keys_str_mv | AT rueckeljohannes pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT huemmerchristian pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT fieselmannandreas pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT ghesuflorincristian pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT mansoorawais pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT schachtnerbalthasar pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT wespphilipp pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT trappmannlena pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT munawwarbasel pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT rickejens pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT ingrischmichael pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining AT sabelbastiano pneumothoraxdetectioninchestradiographsoptimizingartificialintelligencesystemforaccuracyandconfoundingbiasreductionusinginimageannotationsinalgorithmtraining |