Cargando…

Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users

We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR,...

Descripción completa

Detalles Bibliográficos
Autores principales: Neat, Leo, Peng, Ren, Qin, Siyang, Manduchi, Roberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824725/
https://www.ncbi.nlm.nih.gov/pubmed/31681911
http://dx.doi.org/10.1145/3301275.3302271
_version_ 1783464784619896832
author Neat, Leo
Peng, Ren
Qin, Siyang
Manduchi, Roberto
author_facet Neat, Leo
Peng, Ren
Qin, Siyang
Manduchi, Roberto
author_sort Neat, Leo
collection PubMed
description We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR, a custom application that separates the task of text detection from OCR proper. Upon detection of text in the image, Spot+OCR generates a short vibration; as soon as the user stabilizes the phone, a high-resolution snapshot is taken and OCR-processed. The third app, Guided OCR, was designed to guide the user in taking several pictures in a 360° span at the maximum resolution available by the camera, with minimum overlap between pictures. Quantitative results (in terms of true positive ratios and traversal speed) were recorded. Along with the qualitative observation and outcomes from an exit survey, these results allow us to identify and assess the different strategies used by our participants, as well as the challenges of operating these systems without sight.
format Online
Article
Text
id pubmed-6824725
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-68247252019-11-01 Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto IUI Article We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR, a custom application that separates the task of text detection from OCR proper. Upon detection of text in the image, Spot+OCR generates a short vibration; as soon as the user stabilizes the phone, a high-resolution snapshot is taken and OCR-processed. The third app, Guided OCR, was designed to guide the user in taking several pictures in a 360° span at the maximum resolution available by the camera, with minimum overlap between pictures. Quantitative results (in terms of true positive ratios and traversal speed) were recorded. Along with the qualitative observation and outcomes from an exit survey, these results allow us to identify and assess the different strategies used by our participants, as well as the challenges of operating these systems without sight. 2019-03 /pmc/articles/PMC6824725/ /pubmed/31681911 http://dx.doi.org/10.1145/3301275.3302271 Text en http://creativecommons.org/licenses/by/4.0/ Publication rights licensed to ACM. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org.
spellingShingle Article
Neat, Leo
Peng, Ren
Qin, Siyang
Manduchi, Roberto
Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_full Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_fullStr Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_full_unstemmed Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_short Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_sort scene text access: a comparison of mobile ocr modalities for blind users
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824725/
https://www.ncbi.nlm.nih.gov/pubmed/31681911
http://dx.doi.org/10.1145/3301275.3302271
work_keys_str_mv AT neatleo scenetextaccessacomparisonofmobileocrmodalitiesforblindusers
AT pengren scenetextaccessacomparisonofmobileocrmodalitiesforblindusers
AT qinsiyang scenetextaccessacomparisonofmobileocrmodalitiesforblindusers
AT manduchiroberto scenetextaccessacomparisonofmobileocrmodalitiesforblindusers