Cargando…

Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters

BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general appro...

Descripción completa

Detalles Bibliográficos
Autores principales:	Funk, Christopher, Baumgartner, William, Garcia, Benjamin, Roeder, Christophe, Bada, Michael, Cohen, K Bretonnel, Hunter, Lawrence E, Verspoor, Karin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015610/ https://www.ncbi.nlm.nih.gov/pubmed/24571547 http://dx.doi.org/10.1186/1471-2105-15-59

Descripción
Sumario:	BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem. RESULTS: Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented. CONCLUSIONS: Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14–0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented.

Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters

Ejemplares similares