Cargando…

Attention-Guided Image Captioning through Word Information

Image captioning generates written descriptions of an image. In recent image captioning research, attention regions seldom cover all objects, and generated captions may lack the details of objects and may remain far from reality. In this paper, we propose a word guided attention (WGA) method for ima...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tang, Ziwei, Yi, Yaohua, Sheng, Hao
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659425/ https://www.ncbi.nlm.nih.gov/pubmed/34883986 http://dx.doi.org/10.3390/s21237982

_version_	1784612959157223424
author	Tang, Ziwei Yi, Yaohua Sheng, Hao
author_facet	Tang, Ziwei Yi, Yaohua Sheng, Hao
author_sort	Tang, Ziwei
collection	PubMed
description	Image captioning generates written descriptions of an image. In recent image captioning research, attention regions seldom cover all objects, and generated captions may lack the details of objects and may remain far from reality. In this paper, we propose a word guided attention (WGA) method for image captioning. First, WGA extracts word information using the embedded word and memory cell by applying transformation and multiplication. Then, WGA applies word information to the attention results and obtains the attended feature vectors via elementwise multiplication. Finally, we apply WGA with the words from different time steps to obtain previous word guided attention (PW) and current word attention (CW) in the decoder. Experiments on the MSCOCO dataset show that our proposed WGA can achieve competitive performance against state-of-the-art methods, with PW results of a 39.1 Bilingual Evaluation Understudy score (BLEU-4) and a 127.6 Consensus-Based Image Description Evaluation score (CIDEr-D); and CW results of a 39.1 BLEU-4 score and a 127.2 CIDER-D score on a Karpathy test split.
format	Online Article Text
id	pubmed-8659425
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-86594252021-12-10 Attention-Guided Image Captioning through Word Information Tang, Ziwei Yi, Yaohua Sheng, Hao Sensors (Basel) Article Image captioning generates written descriptions of an image. In recent image captioning research, attention regions seldom cover all objects, and generated captions may lack the details of objects and may remain far from reality. In this paper, we propose a word guided attention (WGA) method for image captioning. First, WGA extracts word information using the embedded word and memory cell by applying transformation and multiplication. Then, WGA applies word information to the attention results and obtains the attended feature vectors via elementwise multiplication. Finally, we apply WGA with the words from different time steps to obtain previous word guided attention (PW) and current word attention (CW) in the decoder. Experiments on the MSCOCO dataset show that our proposed WGA can achieve competitive performance against state-of-the-art methods, with PW results of a 39.1 Bilingual Evaluation Understudy score (BLEU-4) and a 127.6 Consensus-Based Image Description Evaluation score (CIDEr-D); and CW results of a 39.1 BLEU-4 score and a 127.2 CIDER-D score on a Karpathy test split. MDPI 2021-11-30 /pmc/articles/PMC8659425/ /pubmed/34883986 http://dx.doi.org/10.3390/s21237982 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Tang, Ziwei Yi, Yaohua Sheng, Hao Attention-Guided Image Captioning through Word Information
title	Attention-Guided Image Captioning through Word Information
title_full	Attention-Guided Image Captioning through Word Information
title_fullStr	Attention-Guided Image Captioning through Word Information
title_full_unstemmed	Attention-Guided Image Captioning through Word Information
title_short	Attention-Guided Image Captioning through Word Information
title_sort	attention-guided image captioning through word information
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659425/ https://www.ncbi.nlm.nih.gov/pubmed/34883986 http://dx.doi.org/10.3390/s21237982
work_keys_str_mv	AT tangziwei attentionguidedimagecaptioningthroughwordinformation AT yiyaohua attentionguidedimagecaptioningthroughwordinformation AT shenghao attentionguidedimagecaptioningthroughwordinformation

Attention-Guided Image Captioning through Word Information

Ejemplares similares