Cargando…
Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters
BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general appro...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015610/ https://www.ncbi.nlm.nih.gov/pubmed/24571547 http://dx.doi.org/10.1186/1471-2105-15-59 |
_version_ | 1782315363924443136 |
---|---|
author | Funk, Christopher Baumgartner, William Garcia, Benjamin Roeder, Christophe Bada, Michael Cohen, K Bretonnel Hunter, Lawrence E Verspoor, Karin |
author_facet | Funk, Christopher Baumgartner, William Garcia, Benjamin Roeder, Christophe Bada, Michael Cohen, K Bretonnel Hunter, Lawrence E Verspoor, Karin |
author_sort | Funk, Christopher |
collection | PubMed |
description | BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem. RESULTS: Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented. CONCLUSIONS: Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14–0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented. |
format | Online Article Text |
id | pubmed-4015610 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40156102014-05-23 Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters Funk, Christopher Baumgartner, William Garcia, Benjamin Roeder, Christophe Bada, Michael Cohen, K Bretonnel Hunter, Lawrence E Verspoor, Karin BMC Bioinformatics Research Article BACKGROUND: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem. RESULTS: Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented. CONCLUSIONS: Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14–0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented. BioMed Central 2014-02-26 /pmc/articles/PMC4015610/ /pubmed/24571547 http://dx.doi.org/10.1186/1471-2105-15-59 Text en Copyright © 2014 Funk et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Article Funk, Christopher Baumgartner, William Garcia, Benjamin Roeder, Christophe Bada, Michael Cohen, K Bretonnel Hunter, Lawrence E Verspoor, Karin Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters |
title | Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters |
title_full | Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters |
title_fullStr | Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters |
title_full_unstemmed | Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters |
title_short | Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters |
title_sort | large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015610/ https://www.ncbi.nlm.nih.gov/pubmed/24571547 http://dx.doi.org/10.1186/1471-2105-15-59 |
work_keys_str_mv | AT funkchristopher largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters AT baumgartnerwilliam largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters AT garciabenjamin largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters AT roederchristophe largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters AT badamichael largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters AT cohenkbretonnel largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters AT hunterlawrencee largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters AT verspoorkarin largescalebiomedicalconceptrecognitionanevaluationofcurrentautomaticannotatorsandtheirparameters |