Cargando…

RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer

PURPOSE: To evaluate deep learning analysis of thyroid nodule ultrasound images as a rule-out test for thyroid malignancy. METHODS: Supervised deep learning (DL) classifier of thyroid nodules was trained on 32,545 thyroid US images from 621 nodules representing all major benign and malignant types o...

Descripción completa

Detalles Bibliográficos
Autores principales: Clark, Toshimasa, Cohen, Trevor, Haugen, Bryan R, Subramanian, Devika, Pozdeyev, Nikita, Dighe, Manjiri, Barrio, Martin, Leu, Michael G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9628735/
http://dx.doi.org/10.1210/jendso/bvac150.1762
_version_ 1784823251661225984
author Clark, Toshimasa
Cohen, Trevor
Haugen, Bryan R
Subramanian, Devika
Pozdeyev, Nikita
Dighe, Manjiri
Barrio, Martin
Leu, Michael G
author_facet Clark, Toshimasa
Cohen, Trevor
Haugen, Bryan R
Subramanian, Devika
Pozdeyev, Nikita
Dighe, Manjiri
Barrio, Martin
Leu, Michael G
author_sort Clark, Toshimasa
collection PubMed
description PURPOSE: To evaluate deep learning analysis of thyroid nodule ultrasound images as a rule-out test for thyroid malignancy. METHODS: Supervised deep learning (DL) classifier of thyroid nodules was trained on 32,545 thyroid US images from 621 nodules representing all major benign and malignant types of thyroid lesions and tested on an independent set of 145 nodules collected at a different healthcare system in the United States. The Big Transfer BiT-M ResNet-50×1 convolutional neural net architecture was modified to contain 3, 4, 6 and 3 PreActBottleneck units per block 1 through 4. Weights pretrained on the ImageNet-21k dataset were loaded and weights for blocks 3 and 4 were fine-tuned for the binary classification task of distinguishing benign and malignant thyroid nodules. RESULTS: The deep learning thyroid nodule classifier achieved an area under receiver operating characteristic curve (AUROC) of 0.889 on five-fold cross-validation. The AUROC improved when images were scaled by nodule size and six randomly selected cine clip frames were added to the training set per epoch. GradCAM class activation heatmaps revealed that microcalcifications and spongiform appearance were reliably recognized by the classifier as malignant and benign features, respectively. Spongiform nodules were found to be benign even when microcystic spaces constituted less than 50% of nodule volume. To investigate the clinical relevance of the benign vs. malignant classifier, the binary classification threshold for the probability of malignancy generated by model was set at 7% to achieve sensitivity and negative predictive value (NPV) comparable to that of the fine needle aspiration biopsy (FNA). At this threshold, cross-validated deep-learning model achieved a sensitivity of 90%, specificity of 63%, positive predictive value (PPV) of 46% and negative predictive value of 94%. When tested on an independent image set that includes 18 classic papillary thyroid cancers (PTC), 5 follicular variant PTC, 4 medullary thyroid cancers, 3 follicular thyroid cancers (FTC), and 1 Hurthle cell thyroid cancer, the DL classifier achieved AUROC of 0.88, sensitivity of 97%, specificity of 61%, PPV of 40% and NPV of 99%. A single minimally-invasive FTC that had no suspicious features on thyroid ultrasound was incorrectly classified as benign. CONCLUSIONS: This study demonstrates that the ultrasound-based deep-learning classifier of thyroid nodules achieves sensitivity and negative predictive value comparable to that of thyroid fine needle aspiration (FNA). Clinicians may use this tool to augment clinical judgment when determining whether to perform FNA procedures. Presentation: Saturday, June 11, 2022 1:00 p.m. - 3:00 p.m., Saturday, June 11, 2022 1:06 p.m. - 1:11 p.m.
format Online
Article
Text
id pubmed-9628735
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96287352022-11-04 RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer Clark, Toshimasa Cohen, Trevor Haugen, Bryan R Subramanian, Devika Pozdeyev, Nikita Dighe, Manjiri Barrio, Martin Leu, Michael G J Endocr Soc Thyroid PURPOSE: To evaluate deep learning analysis of thyroid nodule ultrasound images as a rule-out test for thyroid malignancy. METHODS: Supervised deep learning (DL) classifier of thyroid nodules was trained on 32,545 thyroid US images from 621 nodules representing all major benign and malignant types of thyroid lesions and tested on an independent set of 145 nodules collected at a different healthcare system in the United States. The Big Transfer BiT-M ResNet-50×1 convolutional neural net architecture was modified to contain 3, 4, 6 and 3 PreActBottleneck units per block 1 through 4. Weights pretrained on the ImageNet-21k dataset were loaded and weights for blocks 3 and 4 were fine-tuned for the binary classification task of distinguishing benign and malignant thyroid nodules. RESULTS: The deep learning thyroid nodule classifier achieved an area under receiver operating characteristic curve (AUROC) of 0.889 on five-fold cross-validation. The AUROC improved when images were scaled by nodule size and six randomly selected cine clip frames were added to the training set per epoch. GradCAM class activation heatmaps revealed that microcalcifications and spongiform appearance were reliably recognized by the classifier as malignant and benign features, respectively. Spongiform nodules were found to be benign even when microcystic spaces constituted less than 50% of nodule volume. To investigate the clinical relevance of the benign vs. malignant classifier, the binary classification threshold for the probability of malignancy generated by model was set at 7% to achieve sensitivity and negative predictive value (NPV) comparable to that of the fine needle aspiration biopsy (FNA). At this threshold, cross-validated deep-learning model achieved a sensitivity of 90%, specificity of 63%, positive predictive value (PPV) of 46% and negative predictive value of 94%. When tested on an independent image set that includes 18 classic papillary thyroid cancers (PTC), 5 follicular variant PTC, 4 medullary thyroid cancers, 3 follicular thyroid cancers (FTC), and 1 Hurthle cell thyroid cancer, the DL classifier achieved AUROC of 0.88, sensitivity of 97%, specificity of 61%, PPV of 40% and NPV of 99%. A single minimally-invasive FTC that had no suspicious features on thyroid ultrasound was incorrectly classified as benign. CONCLUSIONS: This study demonstrates that the ultrasound-based deep-learning classifier of thyroid nodules achieves sensitivity and negative predictive value comparable to that of thyroid fine needle aspiration (FNA). Clinicians may use this tool to augment clinical judgment when determining whether to perform FNA procedures. Presentation: Saturday, June 11, 2022 1:00 p.m. - 3:00 p.m., Saturday, June 11, 2022 1:06 p.m. - 1:11 p.m. Oxford University Press 2022-11-01 /pmc/articles/PMC9628735/ http://dx.doi.org/10.1210/jendso/bvac150.1762 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the Endocrine Society. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Thyroid
Clark, Toshimasa
Cohen, Trevor
Haugen, Bryan R
Subramanian, Devika
Pozdeyev, Nikita
Dighe, Manjiri
Barrio, Martin
Leu, Michael G
RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer
title RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer
title_full RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer
title_fullStr RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer
title_full_unstemmed RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer
title_short RF11 | PSAT234 Deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer
title_sort rf11 | psat234 deep learning analysis of thyroid nodule ultrasound images has high sensitivity and negative predictive value to rule-out thyroid cancer
topic Thyroid
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9628735/
http://dx.doi.org/10.1210/jendso/bvac150.1762
work_keys_str_mv AT clarktoshimasa rf11psat234deeplearninganalysisofthyroidnoduleultrasoundimageshashighsensitivityandnegativepredictivevaluetoruleoutthyroidcancer
AT cohentrevor rf11psat234deeplearninganalysisofthyroidnoduleultrasoundimageshashighsensitivityandnegativepredictivevaluetoruleoutthyroidcancer
AT haugenbryanr rf11psat234deeplearninganalysisofthyroidnoduleultrasoundimageshashighsensitivityandnegativepredictivevaluetoruleoutthyroidcancer
AT subramaniandevika rf11psat234deeplearninganalysisofthyroidnoduleultrasoundimageshashighsensitivityandnegativepredictivevaluetoruleoutthyroidcancer
AT pozdeyevnikita rf11psat234deeplearninganalysisofthyroidnoduleultrasoundimageshashighsensitivityandnegativepredictivevaluetoruleoutthyroidcancer
AT dighemanjiri rf11psat234deeplearninganalysisofthyroidnoduleultrasoundimageshashighsensitivityandnegativepredictivevaluetoruleoutthyroidcancer
AT barriomartin rf11psat234deeplearninganalysisofthyroidnoduleultrasoundimageshashighsensitivityandnegativepredictivevaluetoruleoutthyroidcancer
AT leumichaelg rf11psat234deeplearninganalysisofthyroidnoduleultrasoundimageshashighsensitivityandnegativepredictivevaluetoruleoutthyroidcancer