Cargando…

Supervised Deep Learning Techniques for Image Description: A Systematic Review

Automatic image description, also known as image captioning, aims to describe the elements included in an image and their relationships. This task involves two research fields: computer vision and natural language processing; thus, it has received much attention in computer science. In this review p...

Descripción completa

Detalles Bibliográficos
Autores principales: López-Sánchez, Marco, Hernández-Ocaña, Betania, Chávez-Bosquez, Oscar, Hernández-Torruco, José
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10138089/
https://www.ncbi.nlm.nih.gov/pubmed/37190341
http://dx.doi.org/10.3390/e25040553
_version_ 1785032624063905792
author López-Sánchez, Marco
Hernández-Ocaña, Betania
Chávez-Bosquez, Oscar
Hernández-Torruco, José
author_facet López-Sánchez, Marco
Hernández-Ocaña, Betania
Chávez-Bosquez, Oscar
Hernández-Torruco, José
author_sort López-Sánchez, Marco
collection PubMed
description Automatic image description, also known as image captioning, aims to describe the elements included in an image and their relationships. This task involves two research fields: computer vision and natural language processing; thus, it has received much attention in computer science. In this review paper, we follow the Kitchenham review methodology to present the most relevant approaches to image description methodologies based on deep learning. We focused on works using convolutional neural networks (CNN) to extract the characteristics of images and recurrent neural networks (RNN) for automatic sentence generation. As a result, 53 research articles using the encoder-decoder approach were selected, focusing only on supervised learning. The main contributions of this systematic review are: (i) to describe the most relevant image description papers implementing an encoder-decoder approach from 2014 to 2022 and (ii) to determine the main architectures, datasets, and metrics that have been applied to image description.
format Online
Article
Text
id pubmed-10138089
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101380892023-04-28 Supervised Deep Learning Techniques for Image Description: A Systematic Review López-Sánchez, Marco Hernández-Ocaña, Betania Chávez-Bosquez, Oscar Hernández-Torruco, José Entropy (Basel) Review Automatic image description, also known as image captioning, aims to describe the elements included in an image and their relationships. This task involves two research fields: computer vision and natural language processing; thus, it has received much attention in computer science. In this review paper, we follow the Kitchenham review methodology to present the most relevant approaches to image description methodologies based on deep learning. We focused on works using convolutional neural networks (CNN) to extract the characteristics of images and recurrent neural networks (RNN) for automatic sentence generation. As a result, 53 research articles using the encoder-decoder approach were selected, focusing only on supervised learning. The main contributions of this systematic review are: (i) to describe the most relevant image description papers implementing an encoder-decoder approach from 2014 to 2022 and (ii) to determine the main architectures, datasets, and metrics that have been applied to image description. MDPI 2023-03-23 /pmc/articles/PMC10138089/ /pubmed/37190341 http://dx.doi.org/10.3390/e25040553 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
López-Sánchez, Marco
Hernández-Ocaña, Betania
Chávez-Bosquez, Oscar
Hernández-Torruco, José
Supervised Deep Learning Techniques for Image Description: A Systematic Review
title Supervised Deep Learning Techniques for Image Description: A Systematic Review
title_full Supervised Deep Learning Techniques for Image Description: A Systematic Review
title_fullStr Supervised Deep Learning Techniques for Image Description: A Systematic Review
title_full_unstemmed Supervised Deep Learning Techniques for Image Description: A Systematic Review
title_short Supervised Deep Learning Techniques for Image Description: A Systematic Review
title_sort supervised deep learning techniques for image description: a systematic review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10138089/
https://www.ncbi.nlm.nih.gov/pubmed/37190341
http://dx.doi.org/10.3390/e25040553
work_keys_str_mv AT lopezsanchezmarco superviseddeeplearningtechniquesforimagedescriptionasystematicreview
AT hernandezocanabetania superviseddeeplearningtechniquesforimagedescriptionasystematicreview
AT chavezbosquezoscar superviseddeeplearningtechniquesforimagedescriptionasystematicreview
AT hernandeztorrucojose superviseddeeplearningtechniquesforimagedescriptionasystematicreview