Cargando…
Vision Transformer with hierarchical structure and windows shifting for person re-identification
Extracting rich feature representations is a key challenge in person re-identification (Re-ID) tasks. However, traditional Convolutional Neural Networks (CNN) based methods could ignore a part of information when processing local regions of person images, which leads to incomplete feature extraction...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10313053/ https://www.ncbi.nlm.nih.gov/pubmed/37390091 http://dx.doi.org/10.1371/journal.pone.0287979 |
_version_ | 1785067041152040960 |
---|---|
author | Zhang, Yinghua Hou, Wei |
author_facet | Zhang, Yinghua Hou, Wei |
author_sort | Zhang, Yinghua |
collection | PubMed |
description | Extracting rich feature representations is a key challenge in person re-identification (Re-ID) tasks. However, traditional Convolutional Neural Networks (CNN) based methods could ignore a part of information when processing local regions of person images, which leads to incomplete feature extraction. To this end, this paper proposes a person Re-ID method based on vision Transformer with hierarchical structure and window shifting. When extracting person image features, the hierarchical Transformer model is constructed by introducing the hierarchical construction method commonly used in CNN. Then, considering the importance of local information of person images for complete feature extraction, the self-attention calculation is performed by shifting within the window region. Finally, experiments on three standard datasets demonstrate the effectiveness and superiority of the proposed method. |
format | Online Article Text |
id | pubmed-10313053 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-103130532023-07-01 Vision Transformer with hierarchical structure and windows shifting for person re-identification Zhang, Yinghua Hou, Wei PLoS One Research Article Extracting rich feature representations is a key challenge in person re-identification (Re-ID) tasks. However, traditional Convolutional Neural Networks (CNN) based methods could ignore a part of information when processing local regions of person images, which leads to incomplete feature extraction. To this end, this paper proposes a person Re-ID method based on vision Transformer with hierarchical structure and window shifting. When extracting person image features, the hierarchical Transformer model is constructed by introducing the hierarchical construction method commonly used in CNN. Then, considering the importance of local information of person images for complete feature extraction, the self-attention calculation is performed by shifting within the window region. Finally, experiments on three standard datasets demonstrate the effectiveness and superiority of the proposed method. Public Library of Science 2023-06-30 /pmc/articles/PMC10313053/ /pubmed/37390091 http://dx.doi.org/10.1371/journal.pone.0287979 Text en © 2023 Zhang, Hou https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zhang, Yinghua Hou, Wei Vision Transformer with hierarchical structure and windows shifting for person re-identification |
title | Vision Transformer with hierarchical structure and windows shifting for person re-identification |
title_full | Vision Transformer with hierarchical structure and windows shifting for person re-identification |
title_fullStr | Vision Transformer with hierarchical structure and windows shifting for person re-identification |
title_full_unstemmed | Vision Transformer with hierarchical structure and windows shifting for person re-identification |
title_short | Vision Transformer with hierarchical structure and windows shifting for person re-identification |
title_sort | vision transformer with hierarchical structure and windows shifting for person re-identification |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10313053/ https://www.ncbi.nlm.nih.gov/pubmed/37390091 http://dx.doi.org/10.1371/journal.pone.0287979 |
work_keys_str_mv | AT zhangyinghua visiontransformerwithhierarchicalstructureandwindowsshiftingforpersonreidentification AT houwei visiontransformerwithhierarchicalstructureandwindowsshiftingforpersonreidentification |