Cargando…

Offset-decoupled deformable convolution for efficient crowd counting

Crowd counting is considered a challenging issue in computer vision. One of the most critical challenges in crowd counting is considering the impact of scale variations. Compared with other methods, better performance is achieved with CNN-based methods. However, given the limit of fixed geometric st...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Xin, Qin, Jing, Guo, Mingyue, Zuo, Wangmeng, Lu, Weigang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9293988/
https://www.ncbi.nlm.nih.gov/pubmed/35851829
http://dx.doi.org/10.1038/s41598-022-16415-9
_version_ 1784749759050809344
author Zhong, Xin
Qin, Jing
Guo, Mingyue
Zuo, Wangmeng
Lu, Weigang
author_facet Zhong, Xin
Qin, Jing
Guo, Mingyue
Zuo, Wangmeng
Lu, Weigang
author_sort Zhong, Xin
collection PubMed
description Crowd counting is considered a challenging issue in computer vision. One of the most critical challenges in crowd counting is considering the impact of scale variations. Compared with other methods, better performance is achieved with CNN-based methods. However, given the limit of fixed geometric structures, the head-scale features are not completely obtained. Deformable convolution with additional offsets is widely used in the fields of image classification and pattern recognition, as it can successfully exploit the potential of spatial information. However, owing to the randomly generated parameters of offsets in network initialization, the sampling points of the deformable convolution are disorderly stacked, weakening the effectiveness of feature extraction. To handle the invalid learning of offsets and the inefficient utilization of deformable convolution, an offset-decoupled deformable convolution (ODConv) is proposed in this paper. It can completely obtain information within the effective region of sampling points, leading to better performance. In extensive experiments, average MAE of 62.3, 8.3, 91.9, and 159.3 are achieved using our method on the ShanghaiTech A, ShanghaiTech B, UCF-QNRF, and UCF_CC_50 datasets, respectively, outperforming the state-of-the-art methods and validating the effectiveness of the proposed ODConv.
format Online
Article
Text
id pubmed-9293988
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92939882022-07-20 Offset-decoupled deformable convolution for efficient crowd counting Zhong, Xin Qin, Jing Guo, Mingyue Zuo, Wangmeng Lu, Weigang Sci Rep Article Crowd counting is considered a challenging issue in computer vision. One of the most critical challenges in crowd counting is considering the impact of scale variations. Compared with other methods, better performance is achieved with CNN-based methods. However, given the limit of fixed geometric structures, the head-scale features are not completely obtained. Deformable convolution with additional offsets is widely used in the fields of image classification and pattern recognition, as it can successfully exploit the potential of spatial information. However, owing to the randomly generated parameters of offsets in network initialization, the sampling points of the deformable convolution are disorderly stacked, weakening the effectiveness of feature extraction. To handle the invalid learning of offsets and the inefficient utilization of deformable convolution, an offset-decoupled deformable convolution (ODConv) is proposed in this paper. It can completely obtain information within the effective region of sampling points, leading to better performance. In extensive experiments, average MAE of 62.3, 8.3, 91.9, and 159.3 are achieved using our method on the ShanghaiTech A, ShanghaiTech B, UCF-QNRF, and UCF_CC_50 datasets, respectively, outperforming the state-of-the-art methods and validating the effectiveness of the proposed ODConv. Nature Publishing Group UK 2022-07-18 /pmc/articles/PMC9293988/ /pubmed/35851829 http://dx.doi.org/10.1038/s41598-022-16415-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhong, Xin
Qin, Jing
Guo, Mingyue
Zuo, Wangmeng
Lu, Weigang
Offset-decoupled deformable convolution for efficient crowd counting
title Offset-decoupled deformable convolution for efficient crowd counting
title_full Offset-decoupled deformable convolution for efficient crowd counting
title_fullStr Offset-decoupled deformable convolution for efficient crowd counting
title_full_unstemmed Offset-decoupled deformable convolution for efficient crowd counting
title_short Offset-decoupled deformable convolution for efficient crowd counting
title_sort offset-decoupled deformable convolution for efficient crowd counting
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9293988/
https://www.ncbi.nlm.nih.gov/pubmed/35851829
http://dx.doi.org/10.1038/s41598-022-16415-9
work_keys_str_mv AT zhongxin offsetdecoupleddeformableconvolutionforefficientcrowdcounting
AT qinjing offsetdecoupleddeformableconvolutionforefficientcrowdcounting
AT guomingyue offsetdecoupleddeformableconvolutionforefficientcrowdcounting
AT zuowangmeng offsetdecoupleddeformableconvolutionforefficientcrowdcounting
AT luweigang offsetdecoupleddeformableconvolutionforefficientcrowdcounting