Cargando…
Offset-decoupled deformable convolution for efficient crowd counting
Crowd counting is considered a challenging issue in computer vision. One of the most critical challenges in crowd counting is considering the impact of scale variations. Compared with other methods, better performance is achieved with CNN-based methods. However, given the limit of fixed geometric st...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9293988/ https://www.ncbi.nlm.nih.gov/pubmed/35851829 http://dx.doi.org/10.1038/s41598-022-16415-9 |
_version_ | 1784749759050809344 |
---|---|
author | Zhong, Xin Qin, Jing Guo, Mingyue Zuo, Wangmeng Lu, Weigang |
author_facet | Zhong, Xin Qin, Jing Guo, Mingyue Zuo, Wangmeng Lu, Weigang |
author_sort | Zhong, Xin |
collection | PubMed |
description | Crowd counting is considered a challenging issue in computer vision. One of the most critical challenges in crowd counting is considering the impact of scale variations. Compared with other methods, better performance is achieved with CNN-based methods. However, given the limit of fixed geometric structures, the head-scale features are not completely obtained. Deformable convolution with additional offsets is widely used in the fields of image classification and pattern recognition, as it can successfully exploit the potential of spatial information. However, owing to the randomly generated parameters of offsets in network initialization, the sampling points of the deformable convolution are disorderly stacked, weakening the effectiveness of feature extraction. To handle the invalid learning of offsets and the inefficient utilization of deformable convolution, an offset-decoupled deformable convolution (ODConv) is proposed in this paper. It can completely obtain information within the effective region of sampling points, leading to better performance. In extensive experiments, average MAE of 62.3, 8.3, 91.9, and 159.3 are achieved using our method on the ShanghaiTech A, ShanghaiTech B, UCF-QNRF, and UCF_CC_50 datasets, respectively, outperforming the state-of-the-art methods and validating the effectiveness of the proposed ODConv. |
format | Online Article Text |
id | pubmed-9293988 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-92939882022-07-20 Offset-decoupled deformable convolution for efficient crowd counting Zhong, Xin Qin, Jing Guo, Mingyue Zuo, Wangmeng Lu, Weigang Sci Rep Article Crowd counting is considered a challenging issue in computer vision. One of the most critical challenges in crowd counting is considering the impact of scale variations. Compared with other methods, better performance is achieved with CNN-based methods. However, given the limit of fixed geometric structures, the head-scale features are not completely obtained. Deformable convolution with additional offsets is widely used in the fields of image classification and pattern recognition, as it can successfully exploit the potential of spatial information. However, owing to the randomly generated parameters of offsets in network initialization, the sampling points of the deformable convolution are disorderly stacked, weakening the effectiveness of feature extraction. To handle the invalid learning of offsets and the inefficient utilization of deformable convolution, an offset-decoupled deformable convolution (ODConv) is proposed in this paper. It can completely obtain information within the effective region of sampling points, leading to better performance. In extensive experiments, average MAE of 62.3, 8.3, 91.9, and 159.3 are achieved using our method on the ShanghaiTech A, ShanghaiTech B, UCF-QNRF, and UCF_CC_50 datasets, respectively, outperforming the state-of-the-art methods and validating the effectiveness of the proposed ODConv. Nature Publishing Group UK 2022-07-18 /pmc/articles/PMC9293988/ /pubmed/35851829 http://dx.doi.org/10.1038/s41598-022-16415-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zhong, Xin Qin, Jing Guo, Mingyue Zuo, Wangmeng Lu, Weigang Offset-decoupled deformable convolution for efficient crowd counting |
title | Offset-decoupled deformable convolution for efficient crowd counting |
title_full | Offset-decoupled deformable convolution for efficient crowd counting |
title_fullStr | Offset-decoupled deformable convolution for efficient crowd counting |
title_full_unstemmed | Offset-decoupled deformable convolution for efficient crowd counting |
title_short | Offset-decoupled deformable convolution for efficient crowd counting |
title_sort | offset-decoupled deformable convolution for efficient crowd counting |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9293988/ https://www.ncbi.nlm.nih.gov/pubmed/35851829 http://dx.doi.org/10.1038/s41598-022-16415-9 |
work_keys_str_mv | AT zhongxin offsetdecoupleddeformableconvolutionforefficientcrowdcounting AT qinjing offsetdecoupleddeformableconvolutionforefficientcrowdcounting AT guomingyue offsetdecoupleddeformableconvolutionforefficientcrowdcounting AT zuowangmeng offsetdecoupleddeformableconvolutionforefficientcrowdcounting AT luweigang offsetdecoupleddeformableconvolutionforefficientcrowdcounting |