Cargando…
Layout-aware text extraction from full-text PDF of scientific articles
BACKGROUND: The Portable Document Format (PDF) is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocurati...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3441580/ https://www.ncbi.nlm.nih.gov/pubmed/22640904 http://dx.doi.org/10.1186/1751-0473-7-7 |