Cargando…
Spatial-aware topic-driven-based image Chinese caption for disaster news
Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. Howev...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer London
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019430/ https://www.ncbi.nlm.nih.gov/pubmed/37077618 http://dx.doi.org/10.1007/s00521-022-08072-w |
_version_ | 1784908027184283648 |
---|---|
author | Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong |
author_facet | Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong |
author_sort | Zhou, Jinfei |
collection | PubMed |
description | Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. However, current image caption algorithms trained on existing image caption datasets fail to describe the disaster images with fundamental news elements. In this paper, we developed a large-scale disaster news image Chinese caption dataset (DNICC19k), which collected and annotated enormous news images related to disaster. Furthermore, we proposed a spatial-aware topic driven caption network (STCNet) to encode the interrelationships between these news objects and generate descriptive sentences related to news topics. STCNet firstly constructs a graph representation based on objects feature similarity. The graph reasoning module uses the spatial information to infer the weights of aggregated adjacent nodes according to a learnable Gaussian kernel function. Finally, the generation of news sentences are driven by the spatial-aware graph representations and the news topics distribution. Experimental results demonstrate that STCNet trained on DNICC19k could not only automatically creates descriptive sentences related to news topics for disaster news images, but also outperforms benchmark models such as Bottom-up, NIC, Show attend and AoANet on multiple evaluation metrics, achieving CIDEr/BLEU-4 scores of 60.26 and 17.01, respectively. |
format | Online Article Text |
id | pubmed-10019430 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer London |
record_format | MEDLINE/PubMed |
spelling | pubmed-100194302023-03-16 Spatial-aware topic-driven-based image Chinese caption for disaster news Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong Neural Comput Appl Original Article Automatically generating descriptions for disaster news images could effectively accelerate the spread of disaster message and lighten the burden of news editors from tedious news materials. Image caption algorithms are remarkable for generating captions directly from the content of the image. However, current image caption algorithms trained on existing image caption datasets fail to describe the disaster images with fundamental news elements. In this paper, we developed a large-scale disaster news image Chinese caption dataset (DNICC19k), which collected and annotated enormous news images related to disaster. Furthermore, we proposed a spatial-aware topic driven caption network (STCNet) to encode the interrelationships between these news objects and generate descriptive sentences related to news topics. STCNet firstly constructs a graph representation based on objects feature similarity. The graph reasoning module uses the spatial information to infer the weights of aggregated adjacent nodes according to a learnable Gaussian kernel function. Finally, the generation of news sentences are driven by the spatial-aware graph representations and the news topics distribution. Experimental results demonstrate that STCNet trained on DNICC19k could not only automatically creates descriptive sentences related to news topics for disaster news images, but also outperforms benchmark models such as Bottom-up, NIC, Show attend and AoANet on multiple evaluation metrics, achieving CIDEr/BLEU-4 scores of 60.26 and 17.01, respectively. Springer London 2023-03-16 2023 /pmc/articles/PMC10019430/ /pubmed/37077618 http://dx.doi.org/10.1007/s00521-022-08072-w Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Article Zhou, Jinfei Zhu, Yaping Zhang, Yana Yang, Cheng Pan, Hong Spatial-aware topic-driven-based image Chinese caption for disaster news |
title | Spatial-aware topic-driven-based image Chinese caption for disaster news |
title_full | Spatial-aware topic-driven-based image Chinese caption for disaster news |
title_fullStr | Spatial-aware topic-driven-based image Chinese caption for disaster news |
title_full_unstemmed | Spatial-aware topic-driven-based image Chinese caption for disaster news |
title_short | Spatial-aware topic-driven-based image Chinese caption for disaster news |
title_sort | spatial-aware topic-driven-based image chinese caption for disaster news |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019430/ https://www.ncbi.nlm.nih.gov/pubmed/37077618 http://dx.doi.org/10.1007/s00521-022-08072-w |
work_keys_str_mv | AT zhoujinfei spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews AT zhuyaping spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews AT zhangyana spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews AT yangcheng spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews AT panhong spatialawaretopicdrivenbasedimagechinesecaptionfordisasternews |