Cargando…

Proposals Generation for Weakly Supervised Object Detection in Artwork Images

Object Detection requires many precise annotations, which are available for natural images but not for many non-natural data sets such as artworks data sets. A solution is using Weakly Supervised Object Detection (WSOD) techniques that learn accurate object localization from image-level labels. Stud...

Descripción completa

Detalles Bibliográficos
Autores principales:	Milani, Federico, Pinciroli Vago, Nicolò Oreste, Fraternali, Piero
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9410216/ https://www.ncbi.nlm.nih.gov/pubmed/36005458 http://dx.doi.org/10.3390/jimaging8080215

_version_	1784775040112263168
author	Milani, Federico Pinciroli Vago, Nicolò Oreste Fraternali, Piero
author_facet	Milani, Federico Pinciroli Vago, Nicolò Oreste Fraternali, Piero
author_sort	Milani, Federico
collection	PubMed
description	Object Detection requires many precise annotations, which are available for natural images but not for many non-natural data sets such as artworks data sets. A solution is using Weakly Supervised Object Detection (WSOD) techniques that learn accurate object localization from image-level labels. Studies have demonstrated that state-of-the-art end-to-end architectures may not be suitable for domains in which images or classes sensibly differ from those used to pre-train networks. This paper presents a novel two-stage Weakly Supervised Object Detection approach for obtaining accurate bounding boxes on non-natural data sets. The proposed method exploits existing classification knowledge to generate pseudo-ground truth bounding boxes from Class Activation Maps (CAMs). The automatically generated annotations are used to train a robust Faster R-CNN object detector. Quantitative and qualitative analysis shows that bounding boxes generated from CAMs can compensate for the lack of manually annotated ground truth (GT) and that an object detector, trained with such pseudo-GT, surpasses end-to-end WSOD state-of-the-art methods on ArtDL 2.0 (≈41.5% mAP) and IconArt (≈17% mAP), two artworks data sets. The proposed solution is a step towards the computer-aided study of non-natural images and opens the way to more advanced tasks, e.g., automatic artwork image captioning for digital archive applications.
format	Online Article Text
id	pubmed-9410216
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-94102162022-08-26 Proposals Generation for Weakly Supervised Object Detection in Artwork Images Milani, Federico Pinciroli Vago, Nicolò Oreste Fraternali, Piero J Imaging Article Object Detection requires many precise annotations, which are available for natural images but not for many non-natural data sets such as artworks data sets. A solution is using Weakly Supervised Object Detection (WSOD) techniques that learn accurate object localization from image-level labels. Studies have demonstrated that state-of-the-art end-to-end architectures may not be suitable for domains in which images or classes sensibly differ from those used to pre-train networks. This paper presents a novel two-stage Weakly Supervised Object Detection approach for obtaining accurate bounding boxes on non-natural data sets. The proposed method exploits existing classification knowledge to generate pseudo-ground truth bounding boxes from Class Activation Maps (CAMs). The automatically generated annotations are used to train a robust Faster R-CNN object detector. Quantitative and qualitative analysis shows that bounding boxes generated from CAMs can compensate for the lack of manually annotated ground truth (GT) and that an object detector, trained with such pseudo-GT, surpasses end-to-end WSOD state-of-the-art methods on ArtDL 2.0 (≈41.5% mAP) and IconArt (≈17% mAP), two artworks data sets. The proposed solution is a step towards the computer-aided study of non-natural images and opens the way to more advanced tasks, e.g., automatic artwork image captioning for digital archive applications. MDPI 2022-08-06 /pmc/articles/PMC9410216/ /pubmed/36005458 http://dx.doi.org/10.3390/jimaging8080215 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Milani, Federico Pinciroli Vago, Nicolò Oreste Fraternali, Piero Proposals Generation for Weakly Supervised Object Detection in Artwork Images
title	Proposals Generation for Weakly Supervised Object Detection in Artwork Images
title_full	Proposals Generation for Weakly Supervised Object Detection in Artwork Images
title_fullStr	Proposals Generation for Weakly Supervised Object Detection in Artwork Images
title_full_unstemmed	Proposals Generation for Weakly Supervised Object Detection in Artwork Images
title_short	Proposals Generation for Weakly Supervised Object Detection in Artwork Images
title_sort	proposals generation for weakly supervised object detection in artwork images
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9410216/ https://www.ncbi.nlm.nih.gov/pubmed/36005458 http://dx.doi.org/10.3390/jimaging8080215
work_keys_str_mv	AT milanifederico proposalsgenerationforweaklysupervisedobjectdetectioninartworkimages AT pincirolivagonicolooreste proposalsgenerationforweaklysupervisedobjectdetectioninartworkimages AT fraternalipiero proposalsgenerationforweaklysupervisedobjectdetectioninartworkimages

Proposals Generation for Weakly Supervised Object Detection in Artwork Images

Ejemplares similares