Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users

We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Neat, Leo, Peng, Ren, Qin, Siyang, Manduchi, Roberto
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824725/ https://www.ncbi.nlm.nih.gov/pubmed/31681911 http://dx.doi.org/10.1145/3301275.3302271

_version_	1783464784619896832
author	Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto
author_facet	Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto
author_sort	Neat, Leo
collection	PubMed
description	We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR, a custom application that separates the task of text detection from OCR proper. Upon detection of text in the image, Spot+OCR generates a short vibration; as soon as the user stabilizes the phone, a high-resolution snapshot is taken and OCR-processed. The third app, Guided OCR, was designed to guide the user in taking several pictures in a 360° span at the maximum resolution available by the camera, with minimum overlap between pictures. Quantitative results (in terms of true positive ratios and traversal speed) were recorded. Along with the qualitative observation and outcomes from an exit survey, these results allow us to identify and assess the different strategies used by our participants, as well as the challenges of operating these systems without sight.
format	Online Article Text
id	pubmed-6824725
institution	National Center for Biotechnology Information
language	English
publishDate	2019
record_format	MEDLINE/PubMed
spelling	pubmed-68247252019-11-01 Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto IUI Article We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR, a custom application that separates the task of text detection from OCR proper. Upon detection of text in the image, Spot+OCR generates a short vibration; as soon as the user stabilizes the phone, a high-resolution snapshot is taken and OCR-processed. The third app, Guided OCR, was designed to guide the user in taking several pictures in a 360° span at the maximum resolution available by the camera, with minimum overlap between pictures. Quantitative results (in terms of true positive ratios and traversal speed) were recorded. Along with the qualitative observation and outcomes from an exit survey, these results allow us to identify and assess the different strategies used by our participants, as well as the challenges of operating these systems without sight. 2019-03 /pmc/articles/PMC6824725/ /pubmed/31681911 http://dx.doi.org/10.1145/3301275.3302271 Text en http://creativecommons.org/licenses/by/4.0/ Publication rights licensed to ACM. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org.
spellingShingle	Article Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title	Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_full	Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_fullStr	Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_full_unstemmed	Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_short	Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
title_sort	scene text access: a comparison of mobile ocr modalities for blind users
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824725/ https://www.ncbi.nlm.nih.gov/pubmed/31681911 http://dx.doi.org/10.1145/3301275.3302271
work_keys_str_mv	AT neatleo scenetextaccessacomparisonofmobileocrmodalitiesforblindusers AT pengren scenetextaccessacomparisonofmobileocrmodalitiesforblindusers AT qinsiyang scenetextaccessacomparisonofmobileocrmodalitiesforblindusers AT manduchiroberto scenetextaccessacomparisonofmobileocrmodalitiesforblindusers

Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users

Ejemplares similares