Cargando…
MiBio: A dataset for OCR post-processing evaluation
We introduce a dataset for OCR post-processing model evaluation. This dataset contains fully aligned OCR texts and the ground truth recognition texts of a English biodiversity book. To better used for benchmark evaluation, we extracted the following information in TSV files: 1) 2907 OCR-generated er...
Autores principales: | Mei, Jie, Islam, Aminul, Moh’d, Abidalrahman, Wu, Yajing, Milios, Evangelos E. |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6197712/ https://www.ncbi.nlm.nih.gov/pubmed/30364639 http://dx.doi.org/10.1016/j.dib.2018.08.099 |
Ejemplares similares
-
OCR ICT for A2
por: Stuart, Sonia, et al.
Publicado: (2013) -
Summer Student Project 2010: The gridsubmit package & using grid for OCRing of documents
por: Sompolski, J
Publicado: (2010) -
Assessing the Impact of OCR Errors in Information Retrieval
por: Bazzo, Guilherme Torresan, et al.
Publicado: (2020) -
Exploring the DNA mimicry of the Ocr protein of phage T7
por: Roberts, Gareth A., et al.
Publicado: (2012) -
The use of Optical Character Recognition (OCR) in the digitisation of herbarium specimen labels
por: Drinkwater, Robyn E., et al.
Publicado: (2014)