Cargando…

Spatial-aware topic-driven-based image Chinese caption for disaster news

Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. Howev...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhou, Jinfei, Zhu, Yaping, Zhang, Yana, Yang, Cheng, Pan, Hong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer London 2023
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019430/ https://www.ncbi.nlm.nih.gov/pubmed/37077618 http://dx.doi.org/10.1007/s00521-022-08072-w

_version_	1784908027184283648
author	Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong
author_facet	Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong
author_sort	Zhou, Jinfei
collection	PubMed
description	Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. However, current image caption algorithms trained on existing image caption datasets fail to describe the disaster images with fundamental news elements. In this paper, we developed a large-scale disaster news image Chinese caption dataset (DNICC19k), which collected and annotated enormous news images related to disaster. Furthermore, we proposed a spatial-aware topic driven caption network (STCNet) to encode the interrelationships between these news objects and generate descriptive sentences related to news topics. STCNet firstly constructs a graph representation based on objects feature similarity. The graph reasoning module uses the spatial information to infer the weights of aggregated adjacent nodes according to a learnable Gaussian kernel function. Finally, the generation of news sentences are driven by the spatial-aware graph representations and the news topics distribution. Experimental results demonstrate that STCNet trained on DNICC19k could not only automatically creates descriptive sentences related to news topics for disaster news images, but also outperforms benchmark models such as Bottom-up, NIC, Show attend and AoANet on multiple evaluation metrics, achieving CIDEr/BLEU-4 scores of 60.26 and 17.01, respectively.
format	Online Article Text
id	pubmed-10019430
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Springer London
record_format	MEDLINE/PubMed
spelling	pubmed-100194302023-03-16 Spatial-aware topic-driven-based image Chinese caption for disaster news Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong Neural Comput Appl Original Article Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. However, current image caption algorithms trained on existing image caption datasets fail to describe the disaster images with fundamental news elements. In this paper, we developed a large-scale disaster news image Chinese caption dataset (DNICC19k), which collected and annotated enormous news images related to disaster. Furthermore, we proposed a spatial-aware topic driven caption network (STCNet) to encode the interrelationships between these news objects and generate descriptive sentences related to news topics. STCNet firstly constructs a graph representation based on objects feature similarity. The graph reasoning module uses the spatial information to infer the weights of aggregated adjacent nodes according to a learnable Gaussian kernel function. Finally, the generation of news sentences are driven by the spatial-aware graph representations and the news topics distribution. Experimental results demonstrate that STCNet trained on DNICC19k could not only automatically creates descriptive sentences related to news topics for disaster news images, but also outperforms benchmark models such as Bottom-up, NIC, Show attend and AoANet on multiple evaluation metrics, achieving CIDEr/BLEU-4 scores of 60.26 and 17.01, respectively. Springer London 2023-03-16 2023 /pmc/articles/PMC10019430/ /pubmed/37077618 http://dx.doi.org/10.1007/s00521-022-08072-w Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Original Article Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong Spatial-aware topic-driven-based image Chinese caption for disaster news
title	Spatial-aware topic-driven-based image Chinese caption for disaster news
title_full	Spatial-aware topic-driven-based image Chinese caption for disaster news
title_fullStr	Spatial-aware topic-driven-based image Chinese caption for disaster news
title_full_unstemmed	Spatial-aware topic-driven-based image Chinese caption for disaster news
title_short	Spatial-aware topic-driven-based image Chinese caption for disaster news
title_sort	spatial-aware topic-driven-based image chinese caption for disaster news
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019430/ https://www.ncbi.nlm.nih.gov/pubmed/37077618 http://dx.doi.org/10.1007/s00521-022-08072-w
work_keys_str_mv	AT zhoujinfei spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews AT zhuyaping spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews AT zhangyana spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews AT yangcheng spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews AT panhong spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews

Spatial-aware topic-driven-based image Chinese caption for disaster news

Ejemplares similares