Cargando…

Automatic image annotation method based on a convolutional neural network with threshold optimization

In this study, a convolutional neural network with threshold optimization (CNN-THOP) is proposed to solve the issue of overlabeling or downlabeling arising during the multilabel image annotation process in the use of a ranking function for label annotation along with prediction probability. This mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Jianfang, Zhao, Aidi, Zhang, Zibang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7511011/
https://www.ncbi.nlm.nih.gov/pubmed/32966319
http://dx.doi.org/10.1371/journal.pone.0238956
_version_ 1783585894367756288
author Cao, Jianfang
Zhao, Aidi
Zhang, Zibang
author_facet Cao, Jianfang
Zhao, Aidi
Zhang, Zibang
author_sort Cao, Jianfang
collection PubMed
description In this study, a convolutional neural network with threshold optimization (CNN-THOP) is proposed to solve the issue of overlabeling or downlabeling arising during the multilabel image annotation process in the use of a ranking function for label annotation along with prediction probability. This model fuses the threshold optimization algorithm to the CNN structure. First, an optimal model trained by the CNN is used to predict the test set images, and batch normalization (BN) is added to the CNN structure to effectively accelerate the convergence speed and obtain a group of prediction probabilities. Second, threshold optimization is performed on the obtained prediction probability to derive an optimal threshold for each class of labels to form a group of optimal thresholds. When the prediction probability for this class of labels is greater than or equal to the corresponding optimal threshold, this class of labels is used as the annotation result for the image. During the annotation process, the multilabel annotation for the image to be annotated is realized by loading the optimal model and the optimal threshold. Verification experiments are performed on the MIML, COREL5K, and MSRC datasets. Compared with the MBRM, the CNN-THOP increases the average precision on MIML, COREL5K, and MSRC by 27%, 28% and 33%, respectively. Compared with the E2E-DCNN, the CNN-THOP increases the average recall rate by 3% on both COREL5K and MSRC. The most precise annotation effect for CNN-THOP is observed on the MIML dataset, with a complete matching degree reaching 64.8%.
format Online
Article
Text
id pubmed-7511011
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-75110112020-10-01 Automatic image annotation method based on a convolutional neural network with threshold optimization Cao, Jianfang Zhao, Aidi Zhang, Zibang PLoS One Research Article In this study, a convolutional neural network with threshold optimization (CNN-THOP) is proposed to solve the issue of overlabeling or downlabeling arising during the multilabel image annotation process in the use of a ranking function for label annotation along with prediction probability. This model fuses the threshold optimization algorithm to the CNN structure. First, an optimal model trained by the CNN is used to predict the test set images, and batch normalization (BN) is added to the CNN structure to effectively accelerate the convergence speed and obtain a group of prediction probabilities. Second, threshold optimization is performed on the obtained prediction probability to derive an optimal threshold for each class of labels to form a group of optimal thresholds. When the prediction probability for this class of labels is greater than or equal to the corresponding optimal threshold, this class of labels is used as the annotation result for the image. During the annotation process, the multilabel annotation for the image to be annotated is realized by loading the optimal model and the optimal threshold. Verification experiments are performed on the MIML, COREL5K, and MSRC datasets. Compared with the MBRM, the CNN-THOP increases the average precision on MIML, COREL5K, and MSRC by 27%, 28% and 33%, respectively. Compared with the E2E-DCNN, the CNN-THOP increases the average recall rate by 3% on both COREL5K and MSRC. The most precise annotation effect for CNN-THOP is observed on the MIML dataset, with a complete matching degree reaching 64.8%. Public Library of Science 2020-09-23 /pmc/articles/PMC7511011/ /pubmed/32966319 http://dx.doi.org/10.1371/journal.pone.0238956 Text en © 2020 Cao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cao, Jianfang
Zhao, Aidi
Zhang, Zibang
Automatic image annotation method based on a convolutional neural network with threshold optimization
title Automatic image annotation method based on a convolutional neural network with threshold optimization
title_full Automatic image annotation method based on a convolutional neural network with threshold optimization
title_fullStr Automatic image annotation method based on a convolutional neural network with threshold optimization
title_full_unstemmed Automatic image annotation method based on a convolutional neural network with threshold optimization
title_short Automatic image annotation method based on a convolutional neural network with threshold optimization
title_sort automatic image annotation method based on a convolutional neural network with threshold optimization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7511011/
https://www.ncbi.nlm.nih.gov/pubmed/32966319
http://dx.doi.org/10.1371/journal.pone.0238956
work_keys_str_mv AT caojianfang automaticimageannotationmethodbasedonaconvolutionalneuralnetworkwiththresholdoptimization
AT zhaoaidi automaticimageannotationmethodbasedonaconvolutionalneuralnetworkwiththresholdoptimization
AT zhangzibang automaticimageannotationmethodbasedonaconvolutionalneuralnetworkwiththresholdoptimization