Cargando…

Layout-aware text extraction from full-text PDF of scientific articles

BACKGROUND: The Portable Document Format (PDF) is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocurati...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramakrishnan, Cartic, Patnia, Abhishek, Hovy, Eduard, Burns, Gully APC
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3441580/
https://www.ncbi.nlm.nih.gov/pubmed/22640904
http://dx.doi.org/10.1186/1751-0473-7-7