Cargando…

Arabic Captioning for Images of Clothing Using Deep Learning

Fashion is one of the many fields of application that image captioning is being used in. For e-commerce websites holding tens of thousands of images of clothing, automated item descriptions are quite desirable. This paper addresses captioning images of clothing in the Arabic language using deep lear...

Descripción completa

Detalles Bibliográficos
Autores principales:	Al-Malki, Rasha Saleh, Al-Aama, Arwa Yousuf
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10144643/ https://www.ncbi.nlm.nih.gov/pubmed/37112124 http://dx.doi.org/10.3390/s23083783

_version_	1785034147290415104
author	Al-Malki, Rasha Saleh Al-Aama, Arwa Yousuf
author_facet	Al-Malki, Rasha Saleh Al-Aama, Arwa Yousuf
author_sort	Al-Malki, Rasha Saleh
collection	PubMed
description	Fashion is one of the many fields of application that image captioning is being used in. For e-commerce websites holding tens of thousands of images of clothing, automated item descriptions are quite desirable. This paper addresses captioning images of clothing in the Arabic language using deep learning. Image captioning systems are based on Computer Vision and Natural Language Processing techniques because visual and textual understanding is needed for these systems. Many approaches have been proposed to build such systems. The most widely used methods are deep learning methods which use the image model to analyze the visual content of the image, and the language model to generate the caption. Generating the caption in the English language using deep learning algorithms received great attention from many researchers in their research, but there is still a gap in generating the caption in the Arabic language because public datasets are often not available in the Arabic language. In this work, we created an Arabic dataset for captioning images of clothing which we named “ArabicFashionData” because this model is the first model for captioning images of clothing in the Arabic language. Moreover, we classified the attributes of the images of clothing and used them as inputs to the decoder of our image captioning model to enhance Arabic caption quality. In addition, we used the attention mechanism. Our approach achieved a BLEU-1 score of 88.52. The experiment findings are encouraging and suggest that, with a bigger dataset, the attributes-based image captioning model can achieve excellent results for Arabic image captioning.
format	Online Article Text
id	pubmed-10144643
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-101446432023-04-29 Arabic Captioning for Images of Clothing Using Deep Learning Al-Malki, Rasha Saleh Al-Aama, Arwa Yousuf Sensors (Basel) Article Fashion is one of the many fields of application that image captioning is being used in. For e-commerce websites holding tens of thousands of images of clothing, automated item descriptions are quite desirable. This paper addresses captioning images of clothing in the Arabic language using deep learning. Image captioning systems are based on Computer Vision and Natural Language Processing techniques because visual and textual understanding is needed for these systems. Many approaches have been proposed to build such systems. The most widely used methods are deep learning methods which use the image model to analyze the visual content of the image, and the language model to generate the caption. Generating the caption in the English language using deep learning algorithms received great attention from many researchers in their research, but there is still a gap in generating the caption in the Arabic language because public datasets are often not available in the Arabic language. In this work, we created an Arabic dataset for captioning images of clothing which we named “ArabicFashionData” because this model is the first model for captioning images of clothing in the Arabic language. Moreover, we classified the attributes of the images of clothing and used them as inputs to the decoder of our image captioning model to enhance Arabic caption quality. In addition, we used the attention mechanism. Our approach achieved a BLEU-1 score of 88.52. The experiment findings are encouraging and suggest that, with a bigger dataset, the attributes-based image captioning model can achieve excellent results for Arabic image captioning. MDPI 2023-04-07 /pmc/articles/PMC10144643/ /pubmed/37112124 http://dx.doi.org/10.3390/s23083783 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Al-Malki, Rasha Saleh Al-Aama, Arwa Yousuf Arabic Captioning for Images of Clothing Using Deep Learning
title	Arabic Captioning for Images of Clothing Using Deep Learning
title_full	Arabic Captioning for Images of Clothing Using Deep Learning
title_fullStr	Arabic Captioning for Images of Clothing Using Deep Learning
title_full_unstemmed	Arabic Captioning for Images of Clothing Using Deep Learning
title_short	Arabic Captioning for Images of Clothing Using Deep Learning
title_sort	arabic captioning for images of clothing using deep learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10144643/ https://www.ncbi.nlm.nih.gov/pubmed/37112124 http://dx.doi.org/10.3390/s23083783
work_keys_str_mv	AT almalkirashasaleh arabiccaptioningforimagesofclothingusingdeeplearning AT alaamaarwayousuf arabiccaptioningforimagesofclothingusingdeeplearning

Arabic Captioning for Images of Clothing Using Deep Learning

Ejemplares similares