Cargando…

Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images

Tiny objects in remote sensing images only have a few pixels, and the detection difficulty is much higher than that of regular objects. General object detectors lack effective extraction of tiny object features, and are sensitive to the Intersection-over-Union (IoU) calculation and the threshold set...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ni, Shuyan, Lin, Cunbao, Wang, Haining, Li, Yang, Liao, Yurong, Li, Na
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665846/ https://www.ncbi.nlm.nih.gov/pubmed/38023452 http://dx.doi.org/10.3389/fnbot.2023.1273251

_version_	1785138916448272384
author	Ni, Shuyan Lin, Cunbao Wang, Haining Li, Yang Liao, Yurong Li, Na
author_facet	Ni, Shuyan Lin, Cunbao Wang, Haining Li, Yang Liao, Yurong Li, Na
author_sort	Ni, Shuyan
collection	PubMed
description	Tiny objects in remote sensing images only have a few pixels, and the detection difficulty is much higher than that of regular objects. General object detectors lack effective extraction of tiny object features, and are sensitive to the Intersection-over-Union (IoU) calculation and the threshold setting in the prediction stage. Therefore, it is particularly important to design a tiny-object-specific detector that can avoid the above problems. This article proposes the network JSDNet by learning the geometric Jensen-Shannon (JS) divergence representation between Gaussian distributions. First, the Swin Transformer model is integrated into the feature extraction stage as the backbone to improve the feature extraction capability of JSDNet for tiny objects. Second, the anchor box and ground-truth are modeled as two two-dimensional (2D) Gaussian distributions, so that the tiny object is represented as a statistical distribution model. Then, in view of the sensitivity problem faced by the IoU calculation for tiny objects, the JSDM module is designed as a regression sub-network, and the geometric JS divergence between two Gaussian distributions is derived from the perspective of information geometry to guide the regression prediction of anchor boxes. Experiments on the AI-TOD and DOTA datasets show that JSDNet can achieve superior detection performance for tiny objects compared to state-of-the-art general object detectors.
format	Online Article Text
id	pubmed-10665846
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-106658462023-01-01 Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images Ni, Shuyan Lin, Cunbao Wang, Haining Li, Yang Liao, Yurong Li, Na Front Neurorobot Neuroscience Tiny objects in remote sensing images only have a few pixels, and the detection difficulty is much higher than that of regular objects. General object detectors lack effective extraction of tiny object features, and are sensitive to the Intersection-over-Union (IoU) calculation and the threshold setting in the prediction stage. Therefore, it is particularly important to design a tiny-object-specific detector that can avoid the above problems. This article proposes the network JSDNet by learning the geometric Jensen-Shannon (JS) divergence representation between Gaussian distributions. First, the Swin Transformer model is integrated into the feature extraction stage as the backbone to improve the feature extraction capability of JSDNet for tiny objects. Second, the anchor box and ground-truth are modeled as two two-dimensional (2D) Gaussian distributions, so that the tiny object is represented as a statistical distribution model. Then, in view of the sensitivity problem faced by the IoU calculation for tiny objects, the JSDM module is designed as a regression sub-network, and the geometric JS divergence between two Gaussian distributions is derived from the perspective of information geometry to guide the regression prediction of anchor boxes. Experiments on the AI-TOD and DOTA datasets show that JSDNet can achieve superior detection performance for tiny objects compared to state-of-the-art general object detectors. Frontiers Media S.A. 2023-11-09 /pmc/articles/PMC10665846/ /pubmed/38023452 http://dx.doi.org/10.3389/fnbot.2023.1273251 Text en Copyright © 2023 Ni, Lin, Wang, Li, Liao and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Ni, Shuyan Lin, Cunbao Wang, Haining Li, Yang Liao, Yurong Li, Na Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images
title	Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images
title_full	Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images
title_fullStr	Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images
title_full_unstemmed	Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images
title_short	Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images
title_sort	learning geometric jensen-shannon divergence for tiny object detection in remote sensing images
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665846/ https://www.ncbi.nlm.nih.gov/pubmed/38023452 http://dx.doi.org/10.3389/fnbot.2023.1273251
work_keys_str_mv	AT nishuyan learninggeometricjensenshannondivergencefortinyobjectdetectioninremotesensingimages AT lincunbao learninggeometricjensenshannondivergencefortinyobjectdetectioninremotesensingimages AT wanghaining learninggeometricjensenshannondivergencefortinyobjectdetectioninremotesensingimages AT liyang learninggeometricjensenshannondivergencefortinyobjectdetectioninremotesensingimages AT liaoyurong learninggeometricjensenshannondivergencefortinyobjectdetectioninremotesensingimages AT lina learninggeometricjensenshannondivergencefortinyobjectdetectioninremotesensingimages

Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images

Ejemplares similares