Cargando…

Instance-Level Contrastive Learning for Weakly Supervised Object Detection

Weakly supervised object detection (WSOD) has received increasing attention in object detection field, because it only requires image-level annotations to indicate the presence or absence of target objects, which greatly reduces the labeling costs. Existing methods usually focus on the current indiv...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Ming, Zeng, Bing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570746/
https://www.ncbi.nlm.nih.gov/pubmed/36236624
http://dx.doi.org/10.3390/s22197525
_version_ 1784810187393073152
author Zhang, Ming
Zeng, Bing
author_facet Zhang, Ming
Zeng, Bing
author_sort Zhang, Ming
collection PubMed
description Weakly supervised object detection (WSOD) has received increasing attention in object detection field, because it only requires image-level annotations to indicate the presence or absence of target objects, which greatly reduces the labeling costs. Existing methods usually focus on the current individual image to learn object instance representations, while ignoring instance correlations between different images. To address this problem, we propose an instance-level contrastive learning (ICL) framework to mine reliable instance representations from all learned images, and use the contrastive loss to guide instance representation learning for the current image. Due to the diversity of instances, with different appearances, sizes or shapes, we propose an instance-diverse memory updating (IMU) algorithm to mine different instance representations and store them in a memory bank with multiple representation vectors per class, which also considers background information to enhance foreground representations. With the help of memory bank, we further propose a memory-aware instance mining (MIM) algorithm that combines proposal confidence and instance similarity across images to mine more reliable object instances. In addition, we also propose a memory-aware proposal sampling (MPS) algorithm to sample more positive proposals and remove some negative proposals to balance the learning of positive-negative samples. We conduct extensive experiments on the PASCAL VOC2007 and VOC2012 datasets, which are widely used in WSOD, to demonstrate the effectiveness of our method. Compared to our baseline, our method brings [Formula: see text] mAP and [Formula: see text] CorLoc gains on PASCAL VOC2007 dataset, and [Formula: see text] mAP and [Formula: see text] CorLoc gains on PASCAL VOC2012 dataset.
format Online
Article
Text
id pubmed-9570746
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95707462022-10-17 Instance-Level Contrastive Learning for Weakly Supervised Object Detection Zhang, Ming Zeng, Bing Sensors (Basel) Article Weakly supervised object detection (WSOD) has received increasing attention in object detection field, because it only requires image-level annotations to indicate the presence or absence of target objects, which greatly reduces the labeling costs. Existing methods usually focus on the current individual image to learn object instance representations, while ignoring instance correlations between different images. To address this problem, we propose an instance-level contrastive learning (ICL) framework to mine reliable instance representations from all learned images, and use the contrastive loss to guide instance representation learning for the current image. Due to the diversity of instances, with different appearances, sizes or shapes, we propose an instance-diverse memory updating (IMU) algorithm to mine different instance representations and store them in a memory bank with multiple representation vectors per class, which also considers background information to enhance foreground representations. With the help of memory bank, we further propose a memory-aware instance mining (MIM) algorithm that combines proposal confidence and instance similarity across images to mine more reliable object instances. In addition, we also propose a memory-aware proposal sampling (MPS) algorithm to sample more positive proposals and remove some negative proposals to balance the learning of positive-negative samples. We conduct extensive experiments on the PASCAL VOC2007 and VOC2012 datasets, which are widely used in WSOD, to demonstrate the effectiveness of our method. Compared to our baseline, our method brings [Formula: see text] mAP and [Formula: see text] CorLoc gains on PASCAL VOC2007 dataset, and [Formula: see text] mAP and [Formula: see text] CorLoc gains on PASCAL VOC2012 dataset. MDPI 2022-10-04 /pmc/articles/PMC9570746/ /pubmed/36236624 http://dx.doi.org/10.3390/s22197525 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Ming
Zeng, Bing
Instance-Level Contrastive Learning for Weakly Supervised Object Detection
title Instance-Level Contrastive Learning for Weakly Supervised Object Detection
title_full Instance-Level Contrastive Learning for Weakly Supervised Object Detection
title_fullStr Instance-Level Contrastive Learning for Weakly Supervised Object Detection
title_full_unstemmed Instance-Level Contrastive Learning for Weakly Supervised Object Detection
title_short Instance-Level Contrastive Learning for Weakly Supervised Object Detection
title_sort instance-level contrastive learning for weakly supervised object detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570746/
https://www.ncbi.nlm.nih.gov/pubmed/36236624
http://dx.doi.org/10.3390/s22197525
work_keys_str_mv AT zhangming instancelevelcontrastivelearningforweaklysupervisedobjectdetection
AT zengbing instancelevelcontrastivelearningforweaklysupervisedobjectdetection