Cargando…

Evaluating the effectiveness of automatic image captioning for web accessibility

The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and s...

Descripción completa

Detalles Bibliográficos
Autores principales:	Leotta, Maurizio, Mori, Fabrizio, Ribaudo, Marina
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Berlin Heidelberg 2022
Materias:	Long Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395872/ https://www.ncbi.nlm.nih.gov/pubmed/36032410 http://dx.doi.org/10.1007/s10209-022-00906-7

_version_	1784771797701361664
author	Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina
author_facet	Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina
author_sort	Leotta, Maurizio
collection	PubMed
description	The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and specific HTML elements were defined to enrich images with textual descriptions, which can be read aloud by screen readers or rendered by braille displays. A relevant problem is that adding a text describing each image published on a website is a demanding task requiring a non-negligible effort for web developers. Several tools based on machine learning have emerged, which can automatically return descriptions for the images. In this work, we evaluate the correctness of their outputs by comparing the generated descriptions with human-defined references. More specifically, we selected 60 images from Wikipedia and their corresponding descriptions as defined by Wikipedia contributors. We then generated the corresponding descriptions employing four state of the art tools (Azure Computer Vision Engine, Amazon Rekognition, Cloudsight, and Auto Alt-Text for Google Chrome) and asked 76 computer science students to blindly evaluate the perceived correctness of the descriptions without being aware of their source. The results show that the descriptions available in Wikipedia are still perceived as the best ones. However, some tools generate good results for specific categories of images, and they can represent proper candidates for the automated and massive addition of image descriptions to websites, helping to increase the accessibility level of the web drastically.
format	Online Article Text
id	pubmed-9395872
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer Berlin Heidelberg
record_format	MEDLINE/PubMed
spelling	pubmed-93958722022-08-23 Evaluating the effectiveness of automatic image captioning for web accessibility Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina Univers Access Inf Soc Long Paper The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and specific HTML elements were defined to enrich images with textual descriptions, which can be read aloud by screen readers or rendered by braille displays. A relevant problem is that adding a text describing each image published on a website is a demanding task requiring a non-negligible effort for web developers. Several tools based on machine learning have emerged, which can automatically return descriptions for the images. In this work, we evaluate the correctness of their outputs by comparing the generated descriptions with human-defined references. More specifically, we selected 60 images from Wikipedia and their corresponding descriptions as defined by Wikipedia contributors. We then generated the corresponding descriptions employing four state of the art tools (Azure Computer Vision Engine, Amazon Rekognition, Cloudsight, and Auto Alt-Text for Google Chrome) and asked 76 computer science students to blindly evaluate the perceived correctness of the descriptions without being aware of their source. The results show that the descriptions available in Wikipedia are still perceived as the best ones. However, some tools generate good results for specific categories of images, and they can represent proper candidates for the automated and massive addition of image descriptions to websites, helping to increase the accessibility level of the web drastically. Springer Berlin Heidelberg 2022-08-22 /pmc/articles/PMC9395872/ /pubmed/36032410 http://dx.doi.org/10.1007/s10209-022-00906-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/ Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Long Paper Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina Evaluating the effectiveness of automatic image captioning for web accessibility
title	Evaluating the effectiveness of automatic image captioning for web accessibility
title_full	Evaluating the effectiveness of automatic image captioning for web accessibility
title_fullStr	Evaluating the effectiveness of automatic image captioning for web accessibility
title_full_unstemmed	Evaluating the effectiveness of automatic image captioning for web accessibility
title_short	Evaluating the effectiveness of automatic image captioning for web accessibility
title_sort	evaluating the effectiveness of automatic image captioning for web accessibility
topic	Long Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395872/ https://www.ncbi.nlm.nih.gov/pubmed/36032410 http://dx.doi.org/10.1007/s10209-022-00906-7
work_keys_str_mv	AT leottamaurizio evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility AT morifabrizio evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility AT ribaudomarina evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility

Evaluating the effectiveness of automatic image captioning for web accessibility

Ejemplares similares