Cargando…

Enhancement of Local Crowd Location and Count: Multiscale Counting Guided by Head RGB-Mask

BACKGROUND: In crowded crowd images, traditional detection models often have the problems of inaccurate multiscale target count and low recall rate. METHODS: In order to solve the above two problems, this paper proposes an MLP-CNN model, which combined with FPN feature pyramid can fuse the feature m...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Guoyin, Lu, Xiaoqi, Wang, Jingyu, Li, Yuhao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9433205/
https://www.ncbi.nlm.nih.gov/pubmed/36059394
http://dx.doi.org/10.1155/2022/5708807
Descripción
Sumario:BACKGROUND: In crowded crowd images, traditional detection models often have the problems of inaccurate multiscale target count and low recall rate. METHODS: In order to solve the above two problems, this paper proposes an MLP-CNN model, which combined with FPN feature pyramid can fuse the feature map of low-resolution and high-resolution semantic information with less computation and can effectively solve the problem of inaccurate head count of multiscale people. MLP-CNN “mid-term” fusion model can effectively fuse the features of RGB head image and RGB-Mask image. With the help of head RGB-Mask annotation and adaptive Gaussian kernel regression, the enhanced density map can be generated, which can effectively solve the problem of low recall of head detection. RESULTS: MLP-CNN model was applied in ShanghaiTech and UCF_ CC_ 50 and UCF-QNRF. The test results show that the error of the method proposed in this paper has been significantly improved, and the recall rate can reach 79.91%. CONCLUSION: MLP-CNN model not only improves the accuracy of population counting in density map regression, but also improves the detection rate of multiscale population head targets.