Cargando…
An effective method of large scale ontology matching
BACKGROUND: We are currently facing a proliferation of heterogeneous biomedical data sources accessible through various knowledge-based applications. These data are annotated by increasingly extensive and widely disseminated knowledge organisation systems ranging from simple terminologies and struct...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236493/ https://www.ncbi.nlm.nih.gov/pubmed/25411633 http://dx.doi.org/10.1186/2041-1480-5-44 |
_version_ | 1782345175017717760 |
---|---|
author | Diallo, Gayo |
author_facet | Diallo, Gayo |
author_sort | Diallo, Gayo |
collection | PubMed |
description | BACKGROUND: We are currently facing a proliferation of heterogeneous biomedical data sources accessible through various knowledge-based applications. These data are annotated by increasingly extensive and widely disseminated knowledge organisation systems ranging from simple terminologies and structured vocabularies to formal ontologies. In order to solve the interoperability issue, which arises due to the heterogeneity of these ontologies, an alignment task is usually performed. However, while significant effort has been made to provide tools that automatically align small ontologies containing hundreds or thousands of entities, little attention has been paid to the matching of large sized ontologies in the life sciences domain. RESULTS: We have designed and implemented ServOMap, an effective method for large scale ontology matching. It is a fast and efficient high precision system able to perform matching of input ontologies containing hundreds of thousands of entities. The system, which was included in the 2012 and 2013 editions of the Ontology Alignment Evaluation Initiative campaign, performed very well. It was ranked among the top systems for the large ontologies matching. CONCLUSIONS: We proposed an approach for large scale ontology matching relying on Information Retrieval (IR) techniques and the combination of lexical and machine learning contextual similarity computing for the generation of candidate mappings. It is particularly adapted to the life sciences domain as many of the ontologies in this domain benefit from synonym terms taken from the Unified Medical Language System and that can be used by our IR strategy. The ServOMap system we implemented is able to deal with hundreds of thousands entities with an efficient computation time. |
format | Online Article Text |
id | pubmed-4236493 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42364932014-11-19 An effective method of large scale ontology matching Diallo, Gayo J Biomed Semantics Research BACKGROUND: We are currently facing a proliferation of heterogeneous biomedical data sources accessible through various knowledge-based applications. These data are annotated by increasingly extensive and widely disseminated knowledge organisation systems ranging from simple terminologies and structured vocabularies to formal ontologies. In order to solve the interoperability issue, which arises due to the heterogeneity of these ontologies, an alignment task is usually performed. However, while significant effort has been made to provide tools that automatically align small ontologies containing hundreds or thousands of entities, little attention has been paid to the matching of large sized ontologies in the life sciences domain. RESULTS: We have designed and implemented ServOMap, an effective method for large scale ontology matching. It is a fast and efficient high precision system able to perform matching of input ontologies containing hundreds of thousands of entities. The system, which was included in the 2012 and 2013 editions of the Ontology Alignment Evaluation Initiative campaign, performed very well. It was ranked among the top systems for the large ontologies matching. CONCLUSIONS: We proposed an approach for large scale ontology matching relying on Information Retrieval (IR) techniques and the combination of lexical and machine learning contextual similarity computing for the generation of candidate mappings. It is particularly adapted to the life sciences domain as many of the ontologies in this domain benefit from synonym terms taken from the Unified Medical Language System and that can be used by our IR strategy. The ServOMap system we implemented is able to deal with hundreds of thousands entities with an efficient computation time. BioMed Central 2014-10-28 /pmc/articles/PMC4236493/ /pubmed/25411633 http://dx.doi.org/10.1186/2041-1480-5-44 Text en © Diallo; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Diallo, Gayo An effective method of large scale ontology matching |
title | An effective method of large scale ontology matching |
title_full | An effective method of large scale ontology matching |
title_fullStr | An effective method of large scale ontology matching |
title_full_unstemmed | An effective method of large scale ontology matching |
title_short | An effective method of large scale ontology matching |
title_sort | effective method of large scale ontology matching |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236493/ https://www.ncbi.nlm.nih.gov/pubmed/25411633 http://dx.doi.org/10.1186/2041-1480-5-44 |
work_keys_str_mv | AT diallogayo aneffectivemethodoflargescaleontologymatching AT diallogayo effectivemethodoflargescaleontologymatching |