Cargando…

Building a biomedical tokenizer using the token lattice design pattern and the adapted Viterbi algorithm

BACKGROUND: Tokenization is an important component of language processing yet there is no widely accepted tokenization method for English texts, including biomedical texts. Other than rule based techniques, tokenization in the biomedical domain has been regarded as a classification task. Biomedical...

Descripción completa

Detalles Bibliográficos
Autores principales: Barrett, Neil, Weber-Jahnke, Jens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111587/
https://www.ncbi.nlm.nih.gov/pubmed/21658288
http://dx.doi.org/10.1186/1471-2105-12-S3-S1