Cargando…
Template-Aware Transformer for Person Reidentification
Person reidentification (ReID) is a challenging computer vision task for identifying or verifying one or more persons when the faces are not available. In ReID, the indistinguishable background usually affects the model's perception of the foreground, which reduces the performance of ReID. Gene...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8993552/ https://www.ncbi.nlm.nih.gov/pubmed/35401719 http://dx.doi.org/10.1155/2022/8917964 |
_version_ | 1784683920773611520 |
---|---|
author | Zheng, Yanwei Zhao, Zengrui Yu, Xiaowei Yu, Dongxiao |
author_facet | Zheng, Yanwei Zhao, Zengrui Yu, Xiaowei Yu, Dongxiao |
author_sort | Zheng, Yanwei |
collection | PubMed |
description | Person reidentification (ReID) is a challenging computer vision task for identifying or verifying one or more persons when the faces are not available. In ReID, the indistinguishable background usually affects the model's perception of the foreground, which reduces the performance of ReID. Generally, the background of the same camera is similar, whereas that of different cameras is quite different. Based on this finding, we propose a template-aware transformer (TAT) method which can learn intersample indistinguishable features by introducing a learnable template for the transformer structure to cut down the model's attention to regions of the image with low discrimination, including backgrounds and occlusions. In the multiheaded attention module of the encoder, this template directs template-aware attention to indistinguishable features of the image and gradually increases the attention to distinguishable features as the encoder block deepens. We also increase the number of templates using side information considering the characteristics of ReID tasks to adapt the model to backgrounds that vary significantly with different camera IDs. Finally, we demonstrate the validity of our theories using various public data sets and achieve competitive results via a quantitative evaluation. |
format | Online Article Text |
id | pubmed-8993552 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-89935522022-04-09 Template-Aware Transformer for Person Reidentification Zheng, Yanwei Zhao, Zengrui Yu, Xiaowei Yu, Dongxiao Comput Intell Neurosci Research Article Person reidentification (ReID) is a challenging computer vision task for identifying or verifying one or more persons when the faces are not available. In ReID, the indistinguishable background usually affects the model's perception of the foreground, which reduces the performance of ReID. Generally, the background of the same camera is similar, whereas that of different cameras is quite different. Based on this finding, we propose a template-aware transformer (TAT) method which can learn intersample indistinguishable features by introducing a learnable template for the transformer structure to cut down the model's attention to regions of the image with low discrimination, including backgrounds and occlusions. In the multiheaded attention module of the encoder, this template directs template-aware attention to indistinguishable features of the image and gradually increases the attention to distinguishable features as the encoder block deepens. We also increase the number of templates using side information considering the characteristics of ReID tasks to adapt the model to backgrounds that vary significantly with different camera IDs. Finally, we demonstrate the validity of our theories using various public data sets and achieve competitive results via a quantitative evaluation. Hindawi 2022-04-01 /pmc/articles/PMC8993552/ /pubmed/35401719 http://dx.doi.org/10.1155/2022/8917964 Text en Copyright © 2022 Yanwei Zheng et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zheng, Yanwei Zhao, Zengrui Yu, Xiaowei Yu, Dongxiao Template-Aware Transformer for Person Reidentification |
title | Template-Aware Transformer for Person Reidentification |
title_full | Template-Aware Transformer for Person Reidentification |
title_fullStr | Template-Aware Transformer for Person Reidentification |
title_full_unstemmed | Template-Aware Transformer for Person Reidentification |
title_short | Template-Aware Transformer for Person Reidentification |
title_sort | template-aware transformer for person reidentification |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8993552/ https://www.ncbi.nlm.nih.gov/pubmed/35401719 http://dx.doi.org/10.1155/2022/8917964 |
work_keys_str_mv | AT zhengyanwei templateawaretransformerforpersonreidentification AT zhaozengrui templateawaretransformerforpersonreidentification AT yuxiaowei templateawaretransformerforpersonreidentification AT yudongxiao templateawaretransformerforpersonreidentification |