Cargando…

Comparison of concept recognizers for building the Open Biomedical Annotator

The National Center for Biomedical Ontology (NCBO) is developing a system for automated, ontology-based access to online biomedical resources (Shah NH, et al.: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 2009, 10(Suppl 2):S1). The system's in...

Descripción completa

Detalles Bibliográficos
Autores principales: Shah, Nigam H, Bhatia, Nipun, Jonquet, Clement, Rubin, Daniel, Chiang, Annie P, Musen, Mark A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2745685/
https://www.ncbi.nlm.nih.gov/pubmed/19761568
http://dx.doi.org/10.1186/1471-2105-10-S9-S14
_version_ 1782171986078728192
author Shah, Nigam H
Bhatia, Nipun
Jonquet, Clement
Rubin, Daniel
Chiang, Annie P
Musen, Mark A
author_facet Shah, Nigam H
Bhatia, Nipun
Jonquet, Clement
Rubin, Daniel
Chiang, Annie P
Musen, Mark A
author_sort Shah, Nigam H
collection PubMed
description The National Center for Biomedical Ontology (NCBO) is developing a system for automated, ontology-based access to online biomedical resources (Shah NH, et al.: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 2009, 10(Suppl 2):S1). The system's indexing workflow processes the text metadata of diverse resources such as datasets from GEO and ArrayExpress to annotate and index them with concepts from appropriate ontologies. This indexing requires the use of a concept-recognition tool to identify ontology concepts in the resource's textual metadata. In this paper, we present a comparison of two concept recognizers – NLM's MetaMap and the University of Michigan's Mgrep. We utilize a number of data sources and dictionaries to evaluate the concept recognizers in terms of precision, recall, speed of execution, scalability and customizability. Our evaluations demonstrate that Mgrep has a clear edge over MetaMap for large-scale service oriented applications. Based on our analysis we also suggest areas of potential improvements for Mgrep. We have subsequently used Mgrep to build the Open Biomedical Annotator service. The Annotator service has access to a large dictionary of biomedical terms derived from the United Medical Language System (UMLS) and NCBO ontologies. The Annotator also leverages the hierarchical structure of the ontologies and their mappings to expand annotations. The Annotator service is available to the community as a REST Web service for creating ontology-based annotations of their data.
format Text
id pubmed-2745685
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27456852009-09-18 Comparison of concept recognizers for building the Open Biomedical Annotator Shah, Nigam H Bhatia, Nipun Jonquet, Clement Rubin, Daniel Chiang, Annie P Musen, Mark A BMC Bioinformatics Proceedings The National Center for Biomedical Ontology (NCBO) is developing a system for automated, ontology-based access to online biomedical resources (Shah NH, et al.: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 2009, 10(Suppl 2):S1). The system's indexing workflow processes the text metadata of diverse resources such as datasets from GEO and ArrayExpress to annotate and index them with concepts from appropriate ontologies. This indexing requires the use of a concept-recognition tool to identify ontology concepts in the resource's textual metadata. In this paper, we present a comparison of two concept recognizers – NLM's MetaMap and the University of Michigan's Mgrep. We utilize a number of data sources and dictionaries to evaluate the concept recognizers in terms of precision, recall, speed of execution, scalability and customizability. Our evaluations demonstrate that Mgrep has a clear edge over MetaMap for large-scale service oriented applications. Based on our analysis we also suggest areas of potential improvements for Mgrep. We have subsequently used Mgrep to build the Open Biomedical Annotator service. The Annotator service has access to a large dictionary of biomedical terms derived from the United Medical Language System (UMLS) and NCBO ontologies. The Annotator also leverages the hierarchical structure of the ontologies and their mappings to expand annotations. The Annotator service is available to the community as a REST Web service for creating ontology-based annotations of their data. BioMed Central 2009-09-17 /pmc/articles/PMC2745685/ /pubmed/19761568 http://dx.doi.org/10.1186/1471-2105-10-S9-S14 Text en Copyright © 2009 Shah et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Shah, Nigam H
Bhatia, Nipun
Jonquet, Clement
Rubin, Daniel
Chiang, Annie P
Musen, Mark A
Comparison of concept recognizers for building the Open Biomedical Annotator
title Comparison of concept recognizers for building the Open Biomedical Annotator
title_full Comparison of concept recognizers for building the Open Biomedical Annotator
title_fullStr Comparison of concept recognizers for building the Open Biomedical Annotator
title_full_unstemmed Comparison of concept recognizers for building the Open Biomedical Annotator
title_short Comparison of concept recognizers for building the Open Biomedical Annotator
title_sort comparison of concept recognizers for building the open biomedical annotator
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2745685/
https://www.ncbi.nlm.nih.gov/pubmed/19761568
http://dx.doi.org/10.1186/1471-2105-10-S9-S14
work_keys_str_mv AT shahnigamh comparisonofconceptrecognizersforbuildingtheopenbiomedicalannotator
AT bhatianipun comparisonofconceptrecognizersforbuildingtheopenbiomedicalannotator
AT jonquetclement comparisonofconceptrecognizersforbuildingtheopenbiomedicalannotator
AT rubindaniel comparisonofconceptrecognizersforbuildingtheopenbiomedicalannotator
AT chianganniep comparisonofconceptrecognizersforbuildingtheopenbiomedicalannotator
AT musenmarka comparisonofconceptrecognizersforbuildingtheopenbiomedicalannotator