Cargando…

A new method of mark detection for software-based optical mark recognition

Software optical mark recognition (SOMR) is the process whereby information entered on a survey form or questionnaire is converted using specialized software into a machine-readable format. SOMR normally requires input fields to be completely darkened, have no internal labels, or be filled with a so...

Descripción completa

Detalles Bibliográficos
Autores principales: Loke, Seng Cheong, Kasmiran, Khairul A., Haron, Sharifah A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6226159/
https://www.ncbi.nlm.nih.gov/pubmed/30412588
http://dx.doi.org/10.1371/journal.pone.0206420
_version_ 1783369911274307584
author Loke, Seng Cheong
Kasmiran, Khairul A.
Haron, Sharifah A.
author_facet Loke, Seng Cheong
Kasmiran, Khairul A.
Haron, Sharifah A.
author_sort Loke, Seng Cheong
collection PubMed
description Software optical mark recognition (SOMR) is the process whereby information entered on a survey form or questionnaire is converted using specialized software into a machine-readable format. SOMR normally requires input fields to be completely darkened, have no internal labels, or be filled with a soft pencil, otherwise mark detection will be inaccurate. Forms can also have print and scan artefacts that further increase the error rate. This article presents a new method of mark detection that improves over existing techniques based on pixel counting and simple thresholding. Its main advantage is that it can be used under a variety of conditions and yet maintain a high level of accuracy that is sufficient for scientific applications. Field testing shows no software misclassification in 5695 samples filled by trained personnel, and only two misclassifications in 6000 samples filled by untrained respondents. Sensitivity, specificity, and accuracy were 99.73%, 99.98%, and 99.94% respectively, even in the presence of print and scan artefacts, which was superior to other methods tested. A separate direct comparison for mark detection showed a sensitivity, specificity, and accuracy respectively of 99.7%, 100.0%, 100.0% (new method), 96.3%, 96.0%, 96.1% (pixel counting), and 99.9%, 99.8%, 99.8% (simple thresholding) on clean forms, and 100.0%, 99.1%, 99.3% (new method), 98.4%, 95.6%, 96.2% (pixel counting), 100.0%, 38.3%, 51.4% (simple thresholding) on forms with print artefacts. This method is designed for bubble and box fields, while other types such as handwriting fields require separate error control measures.
format Online
Article
Text
id pubmed-6226159
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62261592018-11-19 A new method of mark detection for software-based optical mark recognition Loke, Seng Cheong Kasmiran, Khairul A. Haron, Sharifah A. PLoS One Research Article Software optical mark recognition (SOMR) is the process whereby information entered on a survey form or questionnaire is converted using specialized software into a machine-readable format. SOMR normally requires input fields to be completely darkened, have no internal labels, or be filled with a soft pencil, otherwise mark detection will be inaccurate. Forms can also have print and scan artefacts that further increase the error rate. This article presents a new method of mark detection that improves over existing techniques based on pixel counting and simple thresholding. Its main advantage is that it can be used under a variety of conditions and yet maintain a high level of accuracy that is sufficient for scientific applications. Field testing shows no software misclassification in 5695 samples filled by trained personnel, and only two misclassifications in 6000 samples filled by untrained respondents. Sensitivity, specificity, and accuracy were 99.73%, 99.98%, and 99.94% respectively, even in the presence of print and scan artefacts, which was superior to other methods tested. A separate direct comparison for mark detection showed a sensitivity, specificity, and accuracy respectively of 99.7%, 100.0%, 100.0% (new method), 96.3%, 96.0%, 96.1% (pixel counting), and 99.9%, 99.8%, 99.8% (simple thresholding) on clean forms, and 100.0%, 99.1%, 99.3% (new method), 98.4%, 95.6%, 96.2% (pixel counting), 100.0%, 38.3%, 51.4% (simple thresholding) on forms with print artefacts. This method is designed for bubble and box fields, while other types such as handwriting fields require separate error control measures. Public Library of Science 2018-11-09 /pmc/articles/PMC6226159/ /pubmed/30412588 http://dx.doi.org/10.1371/journal.pone.0206420 Text en © 2018 Loke et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Loke, Seng Cheong
Kasmiran, Khairul A.
Haron, Sharifah A.
A new method of mark detection for software-based optical mark recognition
title A new method of mark detection for software-based optical mark recognition
title_full A new method of mark detection for software-based optical mark recognition
title_fullStr A new method of mark detection for software-based optical mark recognition
title_full_unstemmed A new method of mark detection for software-based optical mark recognition
title_short A new method of mark detection for software-based optical mark recognition
title_sort new method of mark detection for software-based optical mark recognition
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6226159/
https://www.ncbi.nlm.nih.gov/pubmed/30412588
http://dx.doi.org/10.1371/journal.pone.0206420
work_keys_str_mv AT lokesengcheong anewmethodofmarkdetectionforsoftwarebasedopticalmarkrecognition
AT kasmirankhairula anewmethodofmarkdetectionforsoftwarebasedopticalmarkrecognition
AT haronsharifaha anewmethodofmarkdetectionforsoftwarebasedopticalmarkrecognition
AT lokesengcheong newmethodofmarkdetectionforsoftwarebasedopticalmarkrecognition
AT kasmirankhairula newmethodofmarkdetectionforsoftwarebasedopticalmarkrecognition
AT haronsharifaha newmethodofmarkdetectionforsoftwarebasedopticalmarkrecognition