Cargando…

Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters

BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general appro...

Descripción completa

Detalles Bibliográficos
Autores principales: Funk, Christopher, Baumgartner, William, Garcia, Benjamin, Roeder, Christophe, Bada, Michael, Cohen, K Bretonnel, Hunter, Lawrence E, Verspoor, Karin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015610/
https://www.ncbi.nlm.nih.gov/pubmed/24571547
http://dx.doi.org/10.1186/1471-2105-15-59
_version_ 1782315363924443136
author Funk, Christopher
Baumgartner, William
Garcia, Benjamin
Roeder, Christophe
Bada, Michael
Cohen, K Bretonnel
Hunter, Lawrence E
Verspoor, Karin
author_facet Funk, Christopher
Baumgartner, William
Garcia, Benjamin
Roeder, Christophe
Bada, Michael
Cohen, K Bretonnel
Hunter, Lawrence E
Verspoor, Karin
author_sort Funk, Christopher
collection PubMed
description BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem. RESULTS: Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented. CONCLUSIONS: Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14–0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented.
format Online
Article
Text
id pubmed-4015610
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40156102014-05-23 Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters Funk, Christopher Baumgartner, William Garcia, Benjamin Roeder, Christophe Bada, Michael Cohen, K Bretonnel Hunter, Lawrence E Verspoor, Karin BMC Bioinformatics Research Article BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem. RESULTS: Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented. CONCLUSIONS: Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14–0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented. BioMed Central 2014-02-26 /pmc/articles/PMC4015610/ /pubmed/24571547 http://dx.doi.org/10.1186/1471-2105-15-59 Text en Copyright © 2014 Funk et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research Article
Funk, Christopher
Baumgartner, William
Garcia, Benjamin
Roeder, Christophe
Bada, Michael
Cohen, K Bretonnel
Hunter, Lawrence E
Verspoor, Karin
Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
title Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
title_full Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
title_fullStr Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
title_full_unstemmed Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
title_short Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
title_sort large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015610/
https://www.ncbi.nlm.nih.gov/pubmed/24571547
http://dx.doi.org/10.1186/1471-2105-15-59
work_keys_str_mv AT funkchristopher largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters
AT baumgartnerwilliam largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters
AT garciabenjamin largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters
AT roederchristophe largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters
AT badamichael largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters
AT cohenkbretonnel largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters
AT hunterlawrencee largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters
AT verspoorkarin largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters