Cargando…

Diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features

BACKGROUND: Automated classification of medical images through neural networks can reach high accuracy rates but lacks interpretability. OBJECTIVES: To compare the diagnostic accuracy obtained by using content‐based image retrieval (CBIR) to retrieve visually similar dermatoscopic images with corres...

Descripción completa

Detalles Bibliográficos
Autores principales: Tschandl, P., Argenziano, G., Razmara, M., Yap, J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7379719/
https://www.ncbi.nlm.nih.gov/pubmed/30207594
http://dx.doi.org/10.1111/bjd.17189
_version_ 1783562704954327040
author Tschandl, P.
Argenziano, G.
Razmara, M.
Yap, J.
author_facet Tschandl, P.
Argenziano, G.
Razmara, M.
Yap, J.
author_sort Tschandl, P.
collection PubMed
description BACKGROUND: Automated classification of medical images through neural networks can reach high accuracy rates but lacks interpretability. OBJECTIVES: To compare the diagnostic accuracy obtained by using content‐based image retrieval (CBIR) to retrieve visually similar dermatoscopic images with corresponding disease labels against predictions made by a neural network. METHODS: A neural network was trained to predict disease classes on dermatoscopic images from three retrospectively collected image datasets containing 888, 2750 and 16 691 images, respectively. Diagnosis predictions were made based on the most commonly occurring diagnosis in visually similar images, or based on the top‐1 class prediction of the softmax output from the network. Outcome measures were area under the receiver operating characteristic curve (AUC) for predicting a malignant lesion, multiclass‐accuracy and mean average precision (mAP), measured on unseen test images of the corresponding dataset. RESULTS: In all three datasets the skin cancer predictions from CBIR (evaluating the 16 most similar images) showed AUC values similar to softmax predictions (0·842, 0·806 and 0·852 vs. 0·830, 0·810 and 0·847, respectively; P > 0·99 for all). Similarly, the multiclass‐accuracy of CBIR was comparable with softmax predictions. Compared with softmax predictions, networks trained for detecting only three classes performed better on a dataset with eight classes when using CBIR (mAP 0·184 vs. 0·368 and 0·198 vs. 0·403, respectively). CONCLUSIONS: Presenting visually similar images based on features from a neural network shows comparable accuracy with the softmax probability‐based diagnoses of convolutional neural networks. CBIR may be more helpful than a softmax classifier in improving diagnostic accuracy of clinicians in a routine clinical setting.
format Online
Article
Text
id pubmed-7379719
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-73797192020-07-27 Diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features Tschandl, P. Argenziano, G. Razmara, M. Yap, J. Br J Dermatol Original Articles BACKGROUND: Automated classification of medical images through neural networks can reach high accuracy rates but lacks interpretability. OBJECTIVES: To compare the diagnostic accuracy obtained by using content‐based image retrieval (CBIR) to retrieve visually similar dermatoscopic images with corresponding disease labels against predictions made by a neural network. METHODS: A neural network was trained to predict disease classes on dermatoscopic images from three retrospectively collected image datasets containing 888, 2750 and 16 691 images, respectively. Diagnosis predictions were made based on the most commonly occurring diagnosis in visually similar images, or based on the top‐1 class prediction of the softmax output from the network. Outcome measures were area under the receiver operating characteristic curve (AUC) for predicting a malignant lesion, multiclass‐accuracy and mean average precision (mAP), measured on unseen test images of the corresponding dataset. RESULTS: In all three datasets the skin cancer predictions from CBIR (evaluating the 16 most similar images) showed AUC values similar to softmax predictions (0·842, 0·806 and 0·852 vs. 0·830, 0·810 and 0·847, respectively; P > 0·99 for all). Similarly, the multiclass‐accuracy of CBIR was comparable with softmax predictions. Compared with softmax predictions, networks trained for detecting only three classes performed better on a dataset with eight classes when using CBIR (mAP 0·184 vs. 0·368 and 0·198 vs. 0·403, respectively). CONCLUSIONS: Presenting visually similar images based on features from a neural network shows comparable accuracy with the softmax probability‐based diagnoses of convolutional neural networks. CBIR may be more helpful than a softmax classifier in improving diagnostic accuracy of clinicians in a routine clinical setting. John Wiley and Sons Inc. 2018-10-17 2019-07 /pmc/articles/PMC7379719/ /pubmed/30207594 http://dx.doi.org/10.1111/bjd.17189 Text en © 2018 The Authors. British Journal of Dermatology published by John Wiley & Sons Ltd on behalf of British Association of Dermatologists This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Original Articles
Tschandl, P.
Argenziano, G.
Razmara, M.
Yap, J.
Diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features
title Diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features
title_full Diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features
title_fullStr Diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features
title_full_unstemmed Diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features
title_short Diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features
title_sort diagnostic accuracy of content‐based dermatoscopic image retrieval with deep classification features
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7379719/
https://www.ncbi.nlm.nih.gov/pubmed/30207594
http://dx.doi.org/10.1111/bjd.17189
work_keys_str_mv AT tschandlp diagnosticaccuracyofcontentbaseddermatoscopicimageretrievalwithdeepclassificationfeatures
AT argenzianog diagnosticaccuracyofcontentbaseddermatoscopicimageretrievalwithdeepclassificationfeatures
AT razmaram diagnosticaccuracyofcontentbaseddermatoscopicimageretrievalwithdeepclassificationfeatures
AT yapj diagnosticaccuracyofcontentbaseddermatoscopicimageretrievalwithdeepclassificationfeatures