Cargando…

Machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms

OBJECTIVE: To identify confounding variables influencing the accuracy of a convolutional neural network (CNN) specific for infrarenal abdominal aortic aneurysms (AAAs) on computed tomography angiograms (CTAs). METHODS: A Health Insurance Portability and Accountability Act-compliant, institutional re...

Descripción completa

Detalles Bibliográficos
Autores principales: Tomihama, Roger T., Camara, Justin R., Kiang, Sharon C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245322/
https://www.ncbi.nlm.nih.gov/pubmed/37292186
http://dx.doi.org/10.1016/j.jvssci.2022.11.004
_version_ 1785054840530927616
author Tomihama, Roger T.
Camara, Justin R.
Kiang, Sharon C.
author_facet Tomihama, Roger T.
Camara, Justin R.
Kiang, Sharon C.
author_sort Tomihama, Roger T.
collection PubMed
description OBJECTIVE: To identify confounding variables influencing the accuracy of a convolutional neural network (CNN) specific for infrarenal abdominal aortic aneurysms (AAAs) on computed tomography angiograms (CTAs). METHODS: A Health Insurance Portability and Accountability Act-compliant, institutional review board-approved, retrospective study analyzed abdominopelvic CTA scans from 200 patients with infrarenal AAAs and 200 propensity-matched control patients. An AAA-specific trained CNN was developed by the application of transfer learning to the VGG-16 base model using model training, validation, and testing techniques. Model accuracy and area under the curve were analyzed based on data sets (selected, balanced, or unbalanced), aneurysm size, extra-abdominal extension, dissections, and mural thrombus. Misjudgments were analyzed by review of heatmaps, via gradient weighted class activation, overlaid on CTA images. RESULTS: The trained custom CNN model reported high test group accuracies of 94.1%, 99.1%, and 99.6% and area under the curve of 0.9900, 0.9998, and 0.9993 in selected (n = 120), balanced (n = 3704), and unbalanced image sets (n = 31,899), respectively. Despite an eightfold difference between balanced and unbalanced image sets, the CNN model demonstrated high test group sensitivities (98.7% vs 98.9%) and specificities (99.7% vs 99.3%) in unbalanced and balanced image sets, respectively. For aneurysm size, the CNN model demonstrates decreasing misjudgments as aneurysm size increases: 47% (16/34) for aneurysms <3.3 cm, 32% (11/34) for aneurysms 3.3 to 5 cm, and 20% (7/34) for aneurysms >5 cm. Aneurysms containing measurable mural thrombus were over-represented within type II (false-negative) misjudgments compared with type I (false-positive) misjudgments (71% vs 15%, P < .05). Inclusion of extra-abdominal aneurysm extension (thoracic or iliac artery) or dissection flaps in these imaging sets did not decrease the model's overall accuracy, indicating that the model performance was excellent without the need to clean the data set of confounding or comorbid diagnoses. CONCLUSIONS: Analysis of an AAA-specific CNN model can accurately screen and identify infrarenal AAAs on CTA despite varying pathology and quantitative data sets. The highest anatomic misjudgments were with small aneurysms (<3.3 cm) or the presence of mural thrombus. Accuracy of the CNN model is maintained despite the inclusion of extra-abdominal pathology and imbalanced data sets.
format Online
Article
Text
id pubmed-10245322
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-102453222023-06-08 Machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms Tomihama, Roger T. Camara, Justin R. Kiang, Sharon C. JVS Vasc Sci Article OBJECTIVE: To identify confounding variables influencing the accuracy of a convolutional neural network (CNN) specific for infrarenal abdominal aortic aneurysms (AAAs) on computed tomography angiograms (CTAs). METHODS: A Health Insurance Portability and Accountability Act-compliant, institutional review board-approved, retrospective study analyzed abdominopelvic CTA scans from 200 patients with infrarenal AAAs and 200 propensity-matched control patients. An AAA-specific trained CNN was developed by the application of transfer learning to the VGG-16 base model using model training, validation, and testing techniques. Model accuracy and area under the curve were analyzed based on data sets (selected, balanced, or unbalanced), aneurysm size, extra-abdominal extension, dissections, and mural thrombus. Misjudgments were analyzed by review of heatmaps, via gradient weighted class activation, overlaid on CTA images. RESULTS: The trained custom CNN model reported high test group accuracies of 94.1%, 99.1%, and 99.6% and area under the curve of 0.9900, 0.9998, and 0.9993 in selected (n = 120), balanced (n = 3704), and unbalanced image sets (n = 31,899), respectively. Despite an eightfold difference between balanced and unbalanced image sets, the CNN model demonstrated high test group sensitivities (98.7% vs 98.9%) and specificities (99.7% vs 99.3%) in unbalanced and balanced image sets, respectively. For aneurysm size, the CNN model demonstrates decreasing misjudgments as aneurysm size increases: 47% (16/34) for aneurysms <3.3 cm, 32% (11/34) for aneurysms 3.3 to 5 cm, and 20% (7/34) for aneurysms >5 cm. Aneurysms containing measurable mural thrombus were over-represented within type II (false-negative) misjudgments compared with type I (false-positive) misjudgments (71% vs 15%, P < .05). Inclusion of extra-abdominal aneurysm extension (thoracic or iliac artery) or dissection flaps in these imaging sets did not decrease the model's overall accuracy, indicating that the model performance was excellent without the need to clean the data set of confounding or comorbid diagnoses. CONCLUSIONS: Analysis of an AAA-specific CNN model can accurately screen and identify infrarenal AAAs on CTA despite varying pathology and quantitative data sets. The highest anatomic misjudgments were with small aneurysms (<3.3 cm) or the presence of mural thrombus. Accuracy of the CNN model is maintained despite the inclusion of extra-abdominal pathology and imbalanced data sets. Elsevier 2023-01-13 /pmc/articles/PMC10245322/ /pubmed/37292186 http://dx.doi.org/10.1016/j.jvssci.2022.11.004 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Tomihama, Roger T.
Camara, Justin R.
Kiang, Sharon C.
Machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms
title Machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms
title_full Machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms
title_fullStr Machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms
title_full_unstemmed Machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms
title_short Machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms
title_sort machine learning analysis of confounding variables of a convolutional neural network specific for abdominal aortic aneurysms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245322/
https://www.ncbi.nlm.nih.gov/pubmed/37292186
http://dx.doi.org/10.1016/j.jvssci.2022.11.004
work_keys_str_mv AT tomihamarogert machinelearninganalysisofconfoundingvariablesofaconvolutionalneuralnetworkspecificforabdominalaorticaneurysms
AT camarajustinr machinelearninganalysisofconfoundingvariablesofaconvolutionalneuralnetworkspecificforabdominalaorticaneurysms
AT kiangsharonc machinelearninganalysisofconfoundingvariablesofaconvolutionalneuralnetworkspecificforabdominalaorticaneurysms