Cargando…

Assessing the Impact of OCR Errors in Information Retrieval

A significant amount of the textual content available on the Web is stored in PDF files. These files are typically converted into plain text before they can be processed by information retrieval or text mining systems. Automatic conversion typically introduces various errors, especially if OCR is ne...

Descripción completa

Detalles Bibliográficos
Autores principales: Bazzo, Guilherme Torresan, Lorentz, Gustavo Acauan, Suarez Vargas, Danny, Moreira, Viviane P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148068/
http://dx.doi.org/10.1007/978-3-030-45442-5_13