Cargando…

DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection

Object detection plays a vital role in autonomous driving systems, and the accurate detection of surrounding objects can ensure the safe driving of vehicles. This paper proposes a category-assisted transformer object detector called DetectFormer for autonomous driving. The proposed object detector c...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Tianjiao, Bao, Hong, Pan, Weiguo, Fan, Xinyue, Li, Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9268794/
https://www.ncbi.nlm.nih.gov/pubmed/35808332
http://dx.doi.org/10.3390/s22134833
_version_ 1784744073238675456
author Liang, Tianjiao
Bao, Hong
Pan, Weiguo
Fan, Xinyue
Li, Han
author_facet Liang, Tianjiao
Bao, Hong
Pan, Weiguo
Fan, Xinyue
Li, Han
author_sort Liang, Tianjiao
collection PubMed
description Object detection plays a vital role in autonomous driving systems, and the accurate detection of surrounding objects can ensure the safe driving of vehicles. This paper proposes a category-assisted transformer object detector called DetectFormer for autonomous driving. The proposed object detector can achieve better accuracy compared with the baseline. Specifically, ClassDecoder is assisted by proposal categories and global information from the Global Extract Encoder (GEE) to improve the category sensitivity and detection performance. This fits the distribution of object categories in specific scene backgrounds and the connection between objects and the image context. Data augmentation is used to improve robustness and attention mechanism added in backbone network to extract channel-wise spatial features and direction information. The results obtained by benchmark experiment reveal that the proposed method can achieve higher real-time detection performance in traffic scenes compared with RetinaNet and FCOS. The proposed method achieved a detection performance of 97.6% and 91.4% in AP50 and AP75 on the BCTSDB dataset, respectively.
format Online
Article
Text
id pubmed-9268794
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-92687942022-07-09 DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection Liang, Tianjiao Bao, Hong Pan, Weiguo Fan, Xinyue Li, Han Sensors (Basel) Article Object detection plays a vital role in autonomous driving systems, and the accurate detection of surrounding objects can ensure the safe driving of vehicles. This paper proposes a category-assisted transformer object detector called DetectFormer for autonomous driving. The proposed object detector can achieve better accuracy compared with the baseline. Specifically, ClassDecoder is assisted by proposal categories and global information from the Global Extract Encoder (GEE) to improve the category sensitivity and detection performance. This fits the distribution of object categories in specific scene backgrounds and the connection between objects and the image context. Data augmentation is used to improve robustness and attention mechanism added in backbone network to extract channel-wise spatial features and direction information. The results obtained by benchmark experiment reveal that the proposed method can achieve higher real-time detection performance in traffic scenes compared with RetinaNet and FCOS. The proposed method achieved a detection performance of 97.6% and 91.4% in AP50 and AP75 on the BCTSDB dataset, respectively. MDPI 2022-06-26 /pmc/articles/PMC9268794/ /pubmed/35808332 http://dx.doi.org/10.3390/s22134833 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liang, Tianjiao
Bao, Hong
Pan, Weiguo
Fan, Xinyue
Li, Han
DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection
title DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection
title_full DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection
title_fullStr DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection
title_full_unstemmed DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection
title_short DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection
title_sort detectformer: category-assisted transformer for traffic scene object detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9268794/
https://www.ncbi.nlm.nih.gov/pubmed/35808332
http://dx.doi.org/10.3390/s22134833
work_keys_str_mv AT liangtianjiao detectformercategoryassistedtransformerfortrafficsceneobjectdetection
AT baohong detectformercategoryassistedtransformerfortrafficsceneobjectdetection
AT panweiguo detectformercategoryassistedtransformerfortrafficsceneobjectdetection
AT fanxinyue detectformercategoryassistedtransformerfortrafficsceneobjectdetection
AT lihan detectformercategoryassistedtransformerfortrafficsceneobjectdetection