Cargando…
The effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting
AIMS: Deep learning (DL) has emerged in recent years as an effective technique in automated ECG analysis. METHODS AND RESULTS: A retrospective, observational study was designed to assess the feasibility of detecting induced coronary artery occlusion in human subjects earlier than experienced cardiol...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9707936/ https://www.ncbi.nlm.nih.gov/pubmed/36711180 http://dx.doi.org/10.1093/ehjdh/ztab002 |
_version_ | 1784840811392794624 |
---|---|
author | Brisk, Rob Bond, Raymond Finlay, Dewar McLaughlin, James Piadlo, Alicja Leslie, Stephen J Gossman, David E Menown, Ian B McEneaney, D J Warren, S |
author_facet | Brisk, Rob Bond, Raymond Finlay, Dewar McLaughlin, James Piadlo, Alicja Leslie, Stephen J Gossman, David E Menown, Ian B McEneaney, D J Warren, S |
author_sort | Brisk, Rob |
collection | PubMed |
description | AIMS: Deep learning (DL) has emerged in recent years as an effective technique in automated ECG analysis. METHODS AND RESULTS: A retrospective, observational study was designed to assess the feasibility of detecting induced coronary artery occlusion in human subjects earlier than experienced cardiologists using a DL algorithm. A deep convolutional neural network was trained using data from the STAFF III database. The task was to classify ECG samples as showing acute coronary artery occlusion, or no occlusion. Occluded samples were recorded after 60 s of balloon occlusion of a single coronary artery. For the first iteration of the experiment, non-occluded samples were taken from ECGs recorded in a restroom prior to entering theatres. For the second iteration of the experiment, non-occluded samples were taken in the theatre prior to balloon inflation. Results were obtained using a cross-validation approach. In the first iteration of the experiment, the DL model achieved an F1 score of 0.814, which was higher than any of three reviewing cardiologists or STEMI criteria. In the second iteration of the experiment, the DL model achieved an F1 score of 0.533, which is akin to the performance of a random chance classifier. CONCLUSION: The dataset was too small for the second model to achieve meaningful performance, despite the use of transfer learning. However, ‘data leakage’ during the first iteration of the experiment led to falsely high results. This study highlights the risk of DL models leveraging data leaks to produce spurious results. |
format | Online Article Text |
id | pubmed-9707936 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-97079362023-01-27 The effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting Brisk, Rob Bond, Raymond Finlay, Dewar McLaughlin, James Piadlo, Alicja Leslie, Stephen J Gossman, David E Menown, Ian B McEneaney, D J Warren, S Eur Heart J Digit Health Original Article AIMS: Deep learning (DL) has emerged in recent years as an effective technique in automated ECG analysis. METHODS AND RESULTS: A retrospective, observational study was designed to assess the feasibility of detecting induced coronary artery occlusion in human subjects earlier than experienced cardiologists using a DL algorithm. A deep convolutional neural network was trained using data from the STAFF III database. The task was to classify ECG samples as showing acute coronary artery occlusion, or no occlusion. Occluded samples were recorded after 60 s of balloon occlusion of a single coronary artery. For the first iteration of the experiment, non-occluded samples were taken from ECGs recorded in a restroom prior to entering theatres. For the second iteration of the experiment, non-occluded samples were taken in the theatre prior to balloon inflation. Results were obtained using a cross-validation approach. In the first iteration of the experiment, the DL model achieved an F1 score of 0.814, which was higher than any of three reviewing cardiologists or STEMI criteria. In the second iteration of the experiment, the DL model achieved an F1 score of 0.533, which is akin to the performance of a random chance classifier. CONCLUSION: The dataset was too small for the second model to achieve meaningful performance, despite the use of transfer learning. However, ‘data leakage’ during the first iteration of the experiment led to falsely high results. This study highlights the risk of DL models leveraging data leaks to produce spurious results. Oxford University Press 2021-02-20 /pmc/articles/PMC9707936/ /pubmed/36711180 http://dx.doi.org/10.1093/ehjdh/ztab002 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the European Society of Cardiology. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Article Brisk, Rob Bond, Raymond Finlay, Dewar McLaughlin, James Piadlo, Alicja Leslie, Stephen J Gossman, David E Menown, Ian B McEneaney, D J Warren, S The effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting |
title | The effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting |
title_full | The effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting |
title_fullStr | The effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting |
title_full_unstemmed | The effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting |
title_short | The effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting |
title_sort | effect of confounding data features on a deep learning algorithm to predict complete coronary occlusion in a retrospective observational setting |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9707936/ https://www.ncbi.nlm.nih.gov/pubmed/36711180 http://dx.doi.org/10.1093/ehjdh/ztab002 |
work_keys_str_mv | AT briskrob theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT bondraymond theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT finlaydewar theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT mclaughlinjames theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT piadloalicja theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT lesliestephenj theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT gossmandavide theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT menownianb theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT mceneaneydj theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT warrens theeffectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT briskrob effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT bondraymond effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT finlaydewar effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT mclaughlinjames effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT piadloalicja effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT lesliestephenj effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT gossmandavide effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT menownianb effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT mceneaneydj effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting AT warrens effectofconfoundingdatafeaturesonadeeplearningalgorithmtopredictcompletecoronaryocclusioninaretrospectiveobservationalsetting |