Cargando…

Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography

OBJECTIVES: To evaluate the ability of a commercially available comprehensive chest radiography deep convolutional neural network (DCNN) to detect simple and tension pneumothorax, as stratified by the following subgroups: the presence of an intercostal drain; rib, clavicular, scapular or humeral fra...

Descripción completa

Detalles Bibliográficos
Autores principales:	Seah, Jarrel, Tang, Cyril, Buchlak, Quinlan D, Milne, Michael Robert, Holt, Xavier, Ahmad, Hassan, Lambert, John, Esmaili, Nazanin, Oakden-Rayner, Luke, Brotchie, Peter, Jones, Catherine M
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BMJ Publishing Group 2021
Materias:	Diagnostics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8655590/ https://www.ncbi.nlm.nih.gov/pubmed/34876430 http://dx.doi.org/10.1136/bmjopen-2021-053024

_version_	1784612105409789952
author	Seah, Jarrel Tang, Cyril Buchlak, Quinlan D Milne, Michael Robert Holt, Xavier Ahmad, Hassan Lambert, John Esmaili, Nazanin Oakden-Rayner, Luke Brotchie, Peter Jones, Catherine M
author_facet	Seah, Jarrel Tang, Cyril Buchlak, Quinlan D Milne, Michael Robert Holt, Xavier Ahmad, Hassan Lambert, John Esmaili, Nazanin Oakden-Rayner, Luke Brotchie, Peter Jones, Catherine M
author_sort	Seah, Jarrel
collection	PubMed
description	OBJECTIVES: To evaluate the ability of a commercially available comprehensive chest radiography deep convolutional neural network (DCNN) to detect simple and tension pneumothorax, as stratified by the following subgroups: the presence of an intercostal drain; rib, clavicular, scapular or humeral fractures or rib resections; subcutaneous emphysema and erect versus non-erect positioning. The hypothesis was that performance would not differ significantly in each of these subgroups when compared with the overall test dataset. DESIGN: A retrospective case–control study was undertaken. SETTING: Community radiology clinics and hospitals in Australia and the USA. PARTICIPANTS: A test dataset of 2557 chest radiography studies was ground-truthed by three subspecialty thoracic radiologists for the presence of simple or tension pneumothorax as well as each subgroup other than positioning. Radiograph positioning was derived from radiographer annotations on the images. OUTCOME MEASURES: DCNN performance for detecting simple and tension pneumothorax was evaluated over the entire test set, as well as within each subgroup, using the area under the receiver operating characteristic curve (AUC). A difference in AUC of more than 0.05 was considered clinically significant. RESULTS: When compared with the overall test set, performance of the DCNN for detecting simple and tension pneumothorax was statistically non-inferior in all subgroups. The DCNN had an AUC of 0.981 (0.976–0.986) for detecting simple pneumothorax and 0.997 (0.995–0.999) for detecting tension pneumothorax. CONCLUSIONS: Hidden stratification has significant implications for potential failures of deep learning when applied in clinical practice. This study demonstrated that a comprehensively trained DCNN can be resilient to hidden stratification in several clinically meaningful subgroups in detecting pneumothorax.
format	Online Article Text
id	pubmed-8655590
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	BMJ Publishing Group
record_format	MEDLINE/PubMed
spelling	pubmed-86555902021-12-27 Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography Seah, Jarrel Tang, Cyril Buchlak, Quinlan D Milne, Michael Robert Holt, Xavier Ahmad, Hassan Lambert, John Esmaili, Nazanin Oakden-Rayner, Luke Brotchie, Peter Jones, Catherine M BMJ Open Diagnostics OBJECTIVES: To evaluate the ability of a commercially available comprehensive chest radiography deep convolutional neural network (DCNN) to detect simple and tension pneumothorax, as stratified by the following subgroups: the presence of an intercostal drain; rib, clavicular, scapular or humeral fractures or rib resections; subcutaneous emphysema and erect versus non-erect positioning. The hypothesis was that performance would not differ significantly in each of these subgroups when compared with the overall test dataset. DESIGN: A retrospective case–control study was undertaken. SETTING: Community radiology clinics and hospitals in Australia and the USA. PARTICIPANTS: A test dataset of 2557 chest radiography studies was ground-truthed by three subspecialty thoracic radiologists for the presence of simple or tension pneumothorax as well as each subgroup other than positioning. Radiograph positioning was derived from radiographer annotations on the images. OUTCOME MEASURES: DCNN performance for detecting simple and tension pneumothorax was evaluated over the entire test set, as well as within each subgroup, using the area under the receiver operating characteristic curve (AUC). A difference in AUC of more than 0.05 was considered clinically significant. RESULTS: When compared with the overall test set, performance of the DCNN for detecting simple and tension pneumothorax was statistically non-inferior in all subgroups. The DCNN had an AUC of 0.981 (0.976–0.986) for detecting simple pneumothorax and 0.997 (0.995–0.999) for detecting tension pneumothorax. CONCLUSIONS: Hidden stratification has significant implications for potential failures of deep learning when applied in clinical practice. This study demonstrated that a comprehensively trained DCNN can be resilient to hidden stratification in several clinically meaningful subgroups in detecting pneumothorax. BMJ Publishing Group 2021-12-06 /pmc/articles/PMC8655590/ /pubmed/34876430 http://dx.doi.org/10.1136/bmjopen-2021-053024 Text en © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle	Diagnostics Seah, Jarrel Tang, Cyril Buchlak, Quinlan D Milne, Michael Robert Holt, Xavier Ahmad, Hassan Lambert, John Esmaili, Nazanin Oakden-Rayner, Luke Brotchie, Peter Jones, Catherine M Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography
title	Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography
title_full	Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography
title_fullStr	Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography
title_full_unstemmed	Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography
title_short	Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography
title_sort	do comprehensive deep learning algorithms suffer from hidden stratification? a retrospective study on pneumothorax detection in chest radiography
topic	Diagnostics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8655590/ https://www.ncbi.nlm.nih.gov/pubmed/34876430 http://dx.doi.org/10.1136/bmjopen-2021-053024
work_keys_str_mv	AT seahjarrel docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT tangcyril docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT buchlakquinland docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT milnemichaelrobert docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT holtxavier docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT ahmadhassan docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT lambertjohn docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT esmailinazanin docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT oakdenraynerluke docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT brotchiepeter docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography AT jonescatherinem docomprehensivedeeplearningalgorithmssufferfromhiddenstratificationaretrospectivestudyonpneumothoraxdetectioninchestradiography

Do comprehensive deep learning algorithms suffer from hidden stratification? A retrospective study on pneumothorax detection in chest radiography

Ejemplares similares