Cargando…

Thyroid ultrasound image classification using a convolutional neural network

BACKGROUND: Ultrasound (US) is widely used in the clinical diagnosis of thyroid nodules. Artificial intelligence-powered US is becoming an important issue in the research community. This study aimed to develop an improved deep learning model-based algorithm to classify benign and malignant thyroid n...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yi-Cheng, Jin, Peng-Fei, Bao, Jie, Jiang, Quan, Wang, Ximing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8576712/
https://www.ncbi.nlm.nih.gov/pubmed/34790732
http://dx.doi.org/10.21037/atm-21-4328
Descripción
Sumario:BACKGROUND: Ultrasound (US) is widely used in the clinical diagnosis of thyroid nodules. Artificial intelligence-powered US is becoming an important issue in the research community. This study aimed to develop an improved deep learning model-based algorithm to classify benign and malignant thyroid nodules (TNs) using thyroid US images. METHODS: In total, 592 patients with 600 TNs were included in the internal training, validation, and testing data set; 187 patients with 200 TNs were recruited for the external test data set. We developed a Visual Geometry Group (VGG)-16T model, based on the VGG-16 architecture, but with additional batch normalization (BN) and dropout layers in addition to the fully connected layers. We conducted a 10-fold cross-validation to analyze the performance of the VGG-16T model using a data set of gray-scale US images from 5 different brands of US machines. RESULTS: For the internal data set, the VGG-16T model had 87.43% sensitivity, 85.43% specificity, and 86.43% accuracy. For the external data set, the VGG-16T model achieved an area under the curve (AUC) of 0.829 [95% confidence interval (CI): 0.770–0.879], a radiologist with 15 years’ working experience achieved an AUC of 0.705 (95% CI: 0.659–0.801), a radiologist with 10 years’ experience achieved an AUC of 0.725 (95% CI: 0.653–0.797), and a radiologist with 5 years’ experience achieved an AUC of 0.660 (95% CI: 0.584–0.736). CONCLUSIONS: The VGG-16T model had high specificity, sensitivity, and accuracy in differentiating between malignant and benign TNs. Its diagnostic performance was superior to that of experienced radiologists. Thus, the proposed improved deep-learning model can assist radiologists to diagnose thyroid cancer.