Cargando…

Vision Transformer with hierarchical structure and windows shifting for person re-identification

Extracting rich feature representations is a key challenge in person re-identification (Re-ID) tasks. However, traditional Convolutional Neural Networks (CNN) based methods could ignore a part of information when processing local regions of person images, which leads to incomplete feature extraction...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yinghua, Hou, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10313053/
https://www.ncbi.nlm.nih.gov/pubmed/37390091
http://dx.doi.org/10.1371/journal.pone.0287979
_version_ 1785067041152040960
author Zhang, Yinghua
Hou, Wei
author_facet Zhang, Yinghua
Hou, Wei
author_sort Zhang, Yinghua
collection PubMed
description Extracting rich feature representations is a key challenge in person re-identification (Re-ID) tasks. However, traditional Convolutional Neural Networks (CNN) based methods could ignore a part of information when processing local regions of person images, which leads to incomplete feature extraction. To this end, this paper proposes a person Re-ID method based on vision Transformer with hierarchical structure and window shifting. When extracting person image features, the hierarchical Transformer model is constructed by introducing the hierarchical construction method commonly used in CNN. Then, considering the importance of local information of person images for complete feature extraction, the self-attention calculation is performed by shifting within the window region. Finally, experiments on three standard datasets demonstrate the effectiveness and superiority of the proposed method.
format Online
Article
Text
id pubmed-10313053
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103130532023-07-01 Vision Transformer with hierarchical structure and windows shifting for person re-identification Zhang, Yinghua Hou, Wei PLoS One Research Article Extracting rich feature representations is a key challenge in person re-identification (Re-ID) tasks. However, traditional Convolutional Neural Networks (CNN) based methods could ignore a part of information when processing local regions of person images, which leads to incomplete feature extraction. To this end, this paper proposes a person Re-ID method based on vision Transformer with hierarchical structure and window shifting. When extracting person image features, the hierarchical Transformer model is constructed by introducing the hierarchical construction method commonly used in CNN. Then, considering the importance of local information of person images for complete feature extraction, the self-attention calculation is performed by shifting within the window region. Finally, experiments on three standard datasets demonstrate the effectiveness and superiority of the proposed method. Public Library of Science 2023-06-30 /pmc/articles/PMC10313053/ /pubmed/37390091 http://dx.doi.org/10.1371/journal.pone.0287979 Text en © 2023 Zhang, Hou https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Yinghua
Hou, Wei
Vision Transformer with hierarchical structure and windows shifting for person re-identification
title Vision Transformer with hierarchical structure and windows shifting for person re-identification
title_full Vision Transformer with hierarchical structure and windows shifting for person re-identification
title_fullStr Vision Transformer with hierarchical structure and windows shifting for person re-identification
title_full_unstemmed Vision Transformer with hierarchical structure and windows shifting for person re-identification
title_short Vision Transformer with hierarchical structure and windows shifting for person re-identification
title_sort vision transformer with hierarchical structure and windows shifting for person re-identification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10313053/
https://www.ncbi.nlm.nih.gov/pubmed/37390091
http://dx.doi.org/10.1371/journal.pone.0287979
work_keys_str_mv AT zhangyinghua visiontransformerwithhierarchicalstructureandwindowsshiftingforpersonreidentification
AT houwei visiontransformerwithhierarchicalstructureandwindowsshiftingforpersonreidentification