Cargando…
Evaluating the effectiveness of automatic image captioning for web accessibility
The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395872/ https://www.ncbi.nlm.nih.gov/pubmed/36032410 http://dx.doi.org/10.1007/s10209-022-00906-7 |
_version_ | 1784771797701361664 |
---|---|
author | Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina |
author_facet | Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina |
author_sort | Leotta, Maurizio |
collection | PubMed |
description | The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and specific HTML elements were defined to enrich images with textual descriptions, which can be read aloud by screen readers or rendered by braille displays. A relevant problem is that adding a text describing each image published on a website is a demanding task requiring a non-negligible effort for web developers. Several tools based on machine learning have emerged, which can automatically return descriptions for the images. In this work, we evaluate the correctness of their outputs by comparing the generated descriptions with human-defined references. More specifically, we selected 60 images from Wikipedia and their corresponding descriptions as defined by Wikipedia contributors. We then generated the corresponding descriptions employing four state of the art tools (Azure Computer Vision Engine, Amazon Rekognition, Cloudsight, and Auto Alt-Text for Google Chrome) and asked 76 computer science students to blindly evaluate the perceived correctness of the descriptions without being aware of their source. The results show that the descriptions available in Wikipedia are still perceived as the best ones. However, some tools generate good results for specific categories of images, and they can represent proper candidates for the automated and massive addition of image descriptions to websites, helping to increase the accessibility level of the web drastically. |
format | Online Article Text |
id | pubmed-9395872 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-93958722022-08-23 Evaluating the effectiveness of automatic image captioning for web accessibility Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina Univers Access Inf Soc Long Paper The web has become a fundamental tool for carrying out many activities spanning from education to work and private life. For this reason, it must be accessible to every user regardless of any form of impairment or disability. Images on the web are a primary means for communicating information, and specific HTML elements were defined to enrich images with textual descriptions, which can be read aloud by screen readers or rendered by braille displays. A relevant problem is that adding a text describing each image published on a website is a demanding task requiring a non-negligible effort for web developers. Several tools based on machine learning have emerged, which can automatically return descriptions for the images. In this work, we evaluate the correctness of their outputs by comparing the generated descriptions with human-defined references. More specifically, we selected 60 images from Wikipedia and their corresponding descriptions as defined by Wikipedia contributors. We then generated the corresponding descriptions employing four state of the art tools (Azure Computer Vision Engine, Amazon Rekognition, Cloudsight, and Auto Alt-Text for Google Chrome) and asked 76 computer science students to blindly evaluate the perceived correctness of the descriptions without being aware of their source. The results show that the descriptions available in Wikipedia are still perceived as the best ones. However, some tools generate good results for specific categories of images, and they can represent proper candidates for the automated and massive addition of image descriptions to websites, helping to increase the accessibility level of the web drastically. Springer Berlin Heidelberg 2022-08-22 /pmc/articles/PMC9395872/ /pubmed/36032410 http://dx.doi.org/10.1007/s10209-022-00906-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/ Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Long Paper Leotta, Maurizio Mori, Fabrizio Ribaudo, Marina Evaluating the effectiveness of automatic image captioning for web accessibility |
title | Evaluating the effectiveness of automatic image captioning for web accessibility |
title_full | Evaluating the effectiveness of automatic image captioning for web accessibility |
title_fullStr | Evaluating the effectiveness of automatic image captioning for web accessibility |
title_full_unstemmed | Evaluating the effectiveness of automatic image captioning for web accessibility |
title_short | Evaluating the effectiveness of automatic image captioning for web accessibility |
title_sort | evaluating the effectiveness of automatic image captioning for web accessibility |
topic | Long Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395872/ https://www.ncbi.nlm.nih.gov/pubmed/36032410 http://dx.doi.org/10.1007/s10209-022-00906-7 |
work_keys_str_mv | AT leottamaurizio evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility AT morifabrizio evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility AT ribaudomarina evaluatingtheeffectivenessofautomaticimagecaptioningforwebaccessibility |