Cargando…
DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection
Object detection plays a vital role in autonomous driving systems, and the accurate detection of surrounding objects can ensure the safe driving of vehicles. This paper proposes a category-assisted transformer object detector called DetectFormer for autonomous driving. The proposed object detector c...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9268794/ https://www.ncbi.nlm.nih.gov/pubmed/35808332 http://dx.doi.org/10.3390/s22134833 |
_version_ | 1784744073238675456 |
---|---|
author | Liang, Tianjiao Bao, Hong Pan, Weiguo Fan, Xinyue Li, Han |
author_facet | Liang, Tianjiao Bao, Hong Pan, Weiguo Fan, Xinyue Li, Han |
author_sort | Liang, Tianjiao |
collection | PubMed |
description | Object detection plays a vital role in autonomous driving systems, and the accurate detection of surrounding objects can ensure the safe driving of vehicles. This paper proposes a category-assisted transformer object detector called DetectFormer for autonomous driving. The proposed object detector can achieve better accuracy compared with the baseline. Specifically, ClassDecoder is assisted by proposal categories and global information from the Global Extract Encoder (GEE) to improve the category sensitivity and detection performance. This fits the distribution of object categories in specific scene backgrounds and the connection between objects and the image context. Data augmentation is used to improve robustness and attention mechanism added in backbone network to extract channel-wise spatial features and direction information. The results obtained by benchmark experiment reveal that the proposed method can achieve higher real-time detection performance in traffic scenes compared with RetinaNet and FCOS. The proposed method achieved a detection performance of 97.6% and 91.4% in AP50 and AP75 on the BCTSDB dataset, respectively. |
format | Online Article Text |
id | pubmed-9268794 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-92687942022-07-09 DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection Liang, Tianjiao Bao, Hong Pan, Weiguo Fan, Xinyue Li, Han Sensors (Basel) Article Object detection plays a vital role in autonomous driving systems, and the accurate detection of surrounding objects can ensure the safe driving of vehicles. This paper proposes a category-assisted transformer object detector called DetectFormer for autonomous driving. The proposed object detector can achieve better accuracy compared with the baseline. Specifically, ClassDecoder is assisted by proposal categories and global information from the Global Extract Encoder (GEE) to improve the category sensitivity and detection performance. This fits the distribution of object categories in specific scene backgrounds and the connection between objects and the image context. Data augmentation is used to improve robustness and attention mechanism added in backbone network to extract channel-wise spatial features and direction information. The results obtained by benchmark experiment reveal that the proposed method can achieve higher real-time detection performance in traffic scenes compared with RetinaNet and FCOS. The proposed method achieved a detection performance of 97.6% and 91.4% in AP50 and AP75 on the BCTSDB dataset, respectively. MDPI 2022-06-26 /pmc/articles/PMC9268794/ /pubmed/35808332 http://dx.doi.org/10.3390/s22134833 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Liang, Tianjiao Bao, Hong Pan, Weiguo Fan, Xinyue Li, Han DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection |
title | DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection |
title_full | DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection |
title_fullStr | DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection |
title_full_unstemmed | DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection |
title_short | DetectFormer: Category-Assisted Transformer for Traffic Scene Object Detection |
title_sort | detectformer: category-assisted transformer for traffic scene object detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9268794/ https://www.ncbi.nlm.nih.gov/pubmed/35808332 http://dx.doi.org/10.3390/s22134833 |
work_keys_str_mv | AT liangtianjiao detectformercategoryassistedtransformerfortrafficsceneobjectdetection AT baohong detectformercategoryassistedtransformerfortrafficsceneobjectdetection AT panweiguo detectformercategoryassistedtransformerfortrafficsceneobjectdetection AT fanxinyue detectformercategoryassistedtransformerfortrafficsceneobjectdetection AT lihan detectformercategoryassistedtransformerfortrafficsceneobjectdetection |