Cargando…

Spatial-aware topic-driven-based image Chinese caption for disaster news

Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. Howev...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Jinfei, Zhu, Yaping, Zhang, Yana, Yang, Cheng, Pan, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019430/
https://www.ncbi.nlm.nih.gov/pubmed/37077618
http://dx.doi.org/10.1007/s00521-022-08072-w
_version_ 1784908027184283648
author Zhou, Jinfei
Zhu, Yaping
Zhang, Yana
Yang, Cheng
Pan, Hong
author_facet Zhou, Jinfei
Zhu, Yaping
Zhang, Yana
Yang, Cheng
Pan, Hong
author_sort Zhou, Jinfei
collection PubMed
description Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. However, current image caption algorithms trained on existing image caption datasets fail to describe the disaster images with fundamental news elements. In this paper, we developed a large-scale disaster news image Chinese caption dataset (DNICC19k), which collected and annotated enormous news images related to disaster. Furthermore, we proposed a spatial-aware topic driven caption network (STCNet) to encode the interrelationships between these news objects and generate descriptive sentences related to news topics. STCNet firstly constructs a graph representation based on objects feature similarity. The graph reasoning module uses the spatial information to infer the weights of aggregated adjacent nodes according to a learnable Gaussian kernel function. Finally, the generation of news sentences are driven by the spatial-aware graph representations and the news topics distribution. Experimental results demonstrate that STCNet trained on DNICC19k could not only automatically creates descriptive sentences related to news topics for disaster news images, but also outperforms benchmark models such as Bottom-up, NIC, Show attend and AoANet on multiple evaluation metrics, achieving CIDEr/BLEU-4 scores of 60.26 and 17.01, respectively.
format Online
Article
Text
id pubmed-10019430
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-100194302023-03-16 Spatial-aware topic-driven-based image Chinese caption for disaster news Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong Neural Comput Appl Original Article Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. However, current image caption algorithms trained on existing image caption datasets fail to describe the disaster images with fundamental news elements. In this paper, we developed a large-scale disaster news image Chinese caption dataset (DNICC19k), which collected and annotated enormous news images related to disaster. Furthermore, we proposed a spatial-aware topic driven caption network (STCNet) to encode the interrelationships between these news objects and generate descriptive sentences related to news topics. STCNet firstly constructs a graph representation based on objects feature similarity. The graph reasoning module uses the spatial information to infer the weights of aggregated adjacent nodes according to a learnable Gaussian kernel function. Finally, the generation of news sentences are driven by the spatial-aware graph representations and the news topics distribution. Experimental results demonstrate that STCNet trained on DNICC19k could not only automatically creates descriptive sentences related to news topics for disaster news images, but also outperforms benchmark models such as Bottom-up, NIC, Show attend and AoANet on multiple evaluation metrics, achieving CIDEr/BLEU-4 scores of 60.26 and 17.01, respectively. Springer London 2023-03-16 2023 /pmc/articles/PMC10019430/ /pubmed/37077618 http://dx.doi.org/10.1007/s00521-022-08072-w Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Zhou, Jinfei
Zhu, Yaping
Zhang, Yana
Yang, Cheng
Pan, Hong
Spatial-aware topic-driven-based image Chinese caption for disaster news
title Spatial-aware topic-driven-based image Chinese caption for disaster news
title_full Spatial-aware topic-driven-based image Chinese caption for disaster news
title_fullStr Spatial-aware topic-driven-based image Chinese caption for disaster news
title_full_unstemmed Spatial-aware topic-driven-based image Chinese caption for disaster news
title_short Spatial-aware topic-driven-based image Chinese caption for disaster news
title_sort spatial-aware topic-driven-based image chinese caption for disaster news
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019430/
https://www.ncbi.nlm.nih.gov/pubmed/37077618
http://dx.doi.org/10.1007/s00521-022-08072-w
work_keys_str_mv AT zhoujinfei spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews
AT zhuyaping spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews
AT zhangyana spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews
AT yangcheng spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews
AT panhong spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews