Cargando…
Supervised Deep Learning Techniques for Image Description: A Systematic Review
Automatic image description, also known as image captioning, aims to describe the elements included in an image and their relationships. This task involves two research fields: computer vision and natural language processing; thus, it has received much attention in computer science. In this review p...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10138089/ https://www.ncbi.nlm.nih.gov/pubmed/37190341 http://dx.doi.org/10.3390/e25040553 |
_version_ | 1785032624063905792 |
---|---|
author | López-Sánchez, Marco Hernández-Ocaña, Betania Chávez-Bosquez, Oscar Hernández-Torruco, José |
author_facet | López-Sánchez, Marco Hernández-Ocaña, Betania Chávez-Bosquez, Oscar Hernández-Torruco, José |
author_sort | López-Sánchez, Marco |
collection | PubMed |
description | Automatic image description, also known as image captioning, aims to describe the elements included in an image and their relationships. This task involves two research fields: computer vision and natural language processing; thus, it has received much attention in computer science. In this review paper, we follow the Kitchenham review methodology to present the most relevant approaches to image description methodologies based on deep learning. We focused on works using convolutional neural networks (CNN) to extract the characteristics of images and recurrent neural networks (RNN) for automatic sentence generation. As a result, 53 research articles using the encoder-decoder approach were selected, focusing only on supervised learning. The main contributions of this systematic review are: (i) to describe the most relevant image description papers implementing an encoder-decoder approach from 2014 to 2022 and (ii) to determine the main architectures, datasets, and metrics that have been applied to image description. |
format | Online Article Text |
id | pubmed-10138089 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-101380892023-04-28 Supervised Deep Learning Techniques for Image Description: A Systematic Review López-Sánchez, Marco Hernández-Ocaña, Betania Chávez-Bosquez, Oscar Hernández-Torruco, José Entropy (Basel) Review Automatic image description, also known as image captioning, aims to describe the elements included in an image and their relationships. This task involves two research fields: computer vision and natural language processing; thus, it has received much attention in computer science. In this review paper, we follow the Kitchenham review methodology to present the most relevant approaches to image description methodologies based on deep learning. We focused on works using convolutional neural networks (CNN) to extract the characteristics of images and recurrent neural networks (RNN) for automatic sentence generation. As a result, 53 research articles using the encoder-decoder approach were selected, focusing only on supervised learning. The main contributions of this systematic review are: (i) to describe the most relevant image description papers implementing an encoder-decoder approach from 2014 to 2022 and (ii) to determine the main architectures, datasets, and metrics that have been applied to image description. MDPI 2023-03-23 /pmc/articles/PMC10138089/ /pubmed/37190341 http://dx.doi.org/10.3390/e25040553 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Review López-Sánchez, Marco Hernández-Ocaña, Betania Chávez-Bosquez, Oscar Hernández-Torruco, José Supervised Deep Learning Techniques for Image Description: A Systematic Review |
title | Supervised Deep Learning Techniques for Image Description: A Systematic Review |
title_full | Supervised Deep Learning Techniques for Image Description: A Systematic Review |
title_fullStr | Supervised Deep Learning Techniques for Image Description: A Systematic Review |
title_full_unstemmed | Supervised Deep Learning Techniques for Image Description: A Systematic Review |
title_short | Supervised Deep Learning Techniques for Image Description: A Systematic Review |
title_sort | supervised deep learning techniques for image description: a systematic review |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10138089/ https://www.ncbi.nlm.nih.gov/pubmed/37190341 http://dx.doi.org/10.3390/e25040553 |
work_keys_str_mv | AT lopezsanchezmarco superviseddeeplearningtechniquesforimagedescriptionasystematicreview AT hernandezocanabetania superviseddeeplearningtechniquesforimagedescriptionasystematicreview AT chavezbosquezoscar superviseddeeplearningtechniquesforimagedescriptionasystematicreview AT hernandeztorrucojose superviseddeeplearningtechniquesforimagedescriptionasystematicreview |