Cargando…

SCD: A Stacked Carton Dataset for Detection and Segmentation

Carton detection is an important technique in the automatic logistics system and can be applied to many applications such as the stacking and unstacking of cartons and the unloading of cartons in the containers. However, there is no public large-scale carton dataset for the research community to tra...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Jinrong, Wu, Shengkai, Gou, Lijun, Yu, Hangcheng, Lin, Chenxi, Wang, Jiazhuo, Wang, Pan, Li, Minxuan, Li, Xiaoping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9142905/
https://www.ncbi.nlm.nih.gov/pubmed/35632027
http://dx.doi.org/10.3390/s22103617
Descripción
Sumario:Carton detection is an important technique in the automatic logistics system and can be applied to many applications such as the stacking and unstacking of cartons and the unloading of cartons in the containers. However, there is no public large-scale carton dataset for the research community to train and evaluate the carton detection models up to now, which hinders the development of carton detection. In this article, we present a large-scale carton dataset named Stacked Carton Dataset (SCD) with the goal of advancing the state-of-the-art in carton detection. Images were collected from the Internet and several warehouses, and objects were labeled for precise localization using instance mask annotation. There were a total of 250,000 instance masks from 16,136 images. Naturally, a suite of benchmarks was established with several popular detectors and instance segmentation models. In addition, we designed a carton detector based on RetinaNet by embedding our proposed Offset Prediction between the Classification and Localization module (OPCL) and the Boundary Guided Supervision module (BGS). OPCL alleviates the imbalance problem between classification and localization quality, which boosts AP by [Formula: see text] ∼ [Formula: see text] on SCD at the model level, while BGS guides the detector to pay more attention to the boundary information of cartons and decouple repeated carton textures at the task level. To demonstrate the generalization of OPCL for other datasets, we conducted extensive experiments on MS COCO and PASCAL VOC. The improvements in AP on MS COCO and PASCAL VOC were [Formula: see text] ∼ [Formula: see text] and [Formula: see text] ∼ [Formula: see text] , respectively.