Cargando…

Evaluating the effectiveness of automatic image captioning for web accessibility

The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and s...

Descripción completa

Detalles Bibliográficos
Autores principales: Leotta, Maurizio, Mori, Fabrizio, Ribaudo, Marina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395872/
https://www.ncbi.nlm.nih.gov/pubmed/36032410
http://dx.doi.org/10.1007/s10209-022-00906-7
_version_ 1784771797701361664
author Leotta, Maurizio
Mori, Fabrizio
Ribaudo, Marina
author_facet Leotta, Maurizio
Mori, Fabrizio
Ribaudo, Marina
author_sort Leotta, Maurizio
collection PubMed
description The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and specific HTML elements were defined to enrich images with textual descriptions, which can be read aloud by screen readers or rendered by braille displays. A relevant problem is that adding a text describing each image published on a website is a demanding task requiring a non-negligible effort for web developers. Several tools based on machine learning have emerged, which can automatically return descriptions for the images. In this work, we evaluate the correctness of their outputs by comparing the generated descriptions with human-defined references. More specifically, we selected 60 images from Wikipedia and their corresponding descriptions as defined by Wikipedia contributors. We then generated the corresponding descriptions employing four state of the art tools (Azure Computer Vision Engine, Amazon Rekognition, Cloudsight, and Auto Alt-Text for Google Chrome) and asked 76 computer science students to blindly evaluate the perceived correctness of the descriptions without being aware of their source. The results show that the descriptions available in Wikipedia are still perceived as the best ones. However, some tools generate good results for specific categories of images, and they can represent proper candidates for the automated and massive addition of image descriptions to websites, helping to increase the accessibility level of the web drastically.
format Online
Article
Text
id pubmed-9395872
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-93958722022-08-23 Evaluating the effectiveness of automatic image captioning for web accessibility Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina Univers Access Inf Soc Long Paper The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and specific HTML elements were defined to enrich images with textual descriptions, which can be read aloud by screen readers or rendered by braille displays. A relevant problem is that adding a text describing each image published on a website is a demanding task requiring a non-negligible effort for web developers. Several tools based on machine learning have emerged, which can automatically return descriptions for the images. In this work, we evaluate the correctness of their outputs by comparing the generated descriptions with human-defined references. More specifically, we selected 60 images from Wikipedia and their corresponding descriptions as defined by Wikipedia contributors. We then generated the corresponding descriptions employing four state of the art tools (Azure Computer Vision Engine, Amazon Rekognition, Cloudsight, and Auto Alt-Text for Google Chrome) and asked 76 computer science students to blindly evaluate the perceived correctness of the descriptions without being aware of their source. The results show that the descriptions available in Wikipedia are still perceived as the best ones. However, some tools generate good results for specific categories of images, and they can represent proper candidates for the automated and massive addition of image descriptions to websites, helping to increase the accessibility level of the web drastically. Springer Berlin Heidelberg 2022-08-22 /pmc/articles/PMC9395872/ /pubmed/36032410 http://dx.doi.org/10.1007/s10209-022-00906-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/ Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Long Paper
Leotta, Maurizio
Mori, Fabrizio
Ribaudo, Marina
Evaluating the effectiveness of automatic image captioning for web accessibility
title Evaluating the effectiveness of automatic image captioning for web accessibility
title_full Evaluating the effectiveness of automatic image captioning for web accessibility
title_fullStr Evaluating the effectiveness of automatic image captioning for web accessibility
title_full_unstemmed Evaluating the effectiveness of automatic image captioning for web accessibility
title_short Evaluating the effectiveness of automatic image captioning for web accessibility
title_sort evaluating the effectiveness of automatic image captioning for web accessibility
topic Long Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395872/
https://www.ncbi.nlm.nih.gov/pubmed/36032410
http://dx.doi.org/10.1007/s10209-022-00906-7
work_keys_str_mv AT leottamaurizio evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility
AT morifabrizio evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility
AT ribaudomarina evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility