Cargando…
Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users
We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824725/ https://www.ncbi.nlm.nih.gov/pubmed/31681911 http://dx.doi.org/10.1145/3301275.3302271 |
_version_ | 1783464784619896832 |
---|---|
author | Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto |
author_facet | Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto |
author_sort | Neat, Leo |
collection | PubMed |
description | We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR, a custom application that separates the task of text detection from OCR proper. Upon detection of text in the image, Spot+OCR generates a short vibration; as soon as the user stabilizes the phone, a high-resolution snapshot is taken and OCR-processed. The third app, Guided OCR, was designed to guide the user in taking several pictures in a 360° span at the maximum resolution available by the camera, with minimum overlap between pictures. Quantitative results (in terms of true positive ratios and traversal speed) were recorded. Along with the qualitative observation and outcomes from an exit survey, these results allow us to identify and assess the different strategies used by our participants, as well as the challenges of operating these systems without sight. |
format | Online Article Text |
id | pubmed-6824725 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
record_format | MEDLINE/PubMed |
spelling | pubmed-68247252019-11-01 Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto IUI Article We present a study with seven blind participants using three different mobile OCR apps to find text posted in various indoor environments. The first app considered was Microsoft SeeingAI in its Short Text mode, which reads any text in sight with a minimalistic interface. The second app was Spot+OCR, a custom application that separates the task of text detection from OCR proper. Upon detection of text in the image, Spot+OCR generates a short vibration; as soon as the user stabilizes the phone, a high-resolution snapshot is taken and OCR-processed. The third app, Guided OCR, was designed to guide the user in taking several pictures in a 360° span at the maximum resolution available by the camera, with minimum overlap between pictures. Quantitative results (in terms of true positive ratios and traversal speed) were recorded. Along with the qualitative observation and outcomes from an exit survey, these results allow us to identify and assess the different strategies used by our participants, as well as the challenges of operating these systems without sight. 2019-03 /pmc/articles/PMC6824725/ /pubmed/31681911 http://dx.doi.org/10.1145/3301275.3302271 Text en http://creativecommons.org/licenses/by/4.0/ Publication rights licensed to ACM. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org. |
spellingShingle | Article Neat, Leo Peng, Ren Qin, Siyang Manduchi, Roberto Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users |
title | Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users |
title_full | Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users |
title_fullStr | Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users |
title_full_unstemmed | Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users |
title_short | Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users |
title_sort | scene text access: a comparison of mobile ocr modalities for blind users |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6824725/ https://www.ncbi.nlm.nih.gov/pubmed/31681911 http://dx.doi.org/10.1145/3301275.3302271 |
work_keys_str_mv | AT neatleo scenetextaccessacomparisonofmobileocrmodalitiesforblindusers AT pengren scenetextaccessacomparisonofmobileocrmodalitiesforblindusers AT qinsiyang scenetextaccessacomparisonofmobileocrmodalitiesforblindusers AT manduchiroberto scenetextaccessacomparisonofmobileocrmodalitiesforblindusers |