Cargando…

An effective method of large scale ontology matching

BACKGROUND: We are currently facing a proliferation of heterogeneous biomedical data sources accessible through various knowledge-based applications. These data are annotated by increasingly extensive and widely disseminated knowledge organisation systems ranging from simple terminologies and struct...

Descripción completa

Detalles Bibliográficos
Autor principal:	Diallo, Gayo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236493/ https://www.ncbi.nlm.nih.gov/pubmed/25411633 http://dx.doi.org/10.1186/2041-1480-5-44

_version_	1782345175017717760
author	Diallo, Gayo
author_facet	Diallo, Gayo
author_sort	Diallo, Gayo
collection	PubMed
description	BACKGROUND: We are currently facing a proliferation of heterogeneous biomedical data sources accessible through various knowledge-based applications. These data are annotated by increasingly extensive and widely disseminated knowledge organisation systems ranging from simple terminologies and structured vocabularies to formal ontologies. In order to solve the interoperability issue, which arises due to the heterogeneity of these ontologies, an alignment task is usually performed. However, while significant effort has been made to provide tools that automatically align small ontologies containing hundreds or thousands of entities, little attention has been paid to the matching of large sized ontologies in the life sciences domain. RESULTS: We have designed and implemented ServOMap, an effective method for large scale ontology matching. It is a fast and efficient high precision system able to perform matching of input ontologies containing hundreds of thousands of entities. The system, which was included in the 2012 and 2013 editions of the Ontology Alignment Evaluation Initiative campaign, performed very well. It was ranked among the top systems for the large ontologies matching. CONCLUSIONS: We proposed an approach for large scale ontology matching relying on Information Retrieval (IR) techniques and the combination of lexical and machine learning contextual similarity computing for the generation of candidate mappings. It is particularly adapted to the life sciences domain as many of the ontologies in this domain benefit from synonym terms taken from the Unified Medical Language System and that can be used by our IR strategy. The ServOMap system we implemented is able to deal with hundreds of thousands entities with an efficient computation time.
format	Online Article Text
id	pubmed-4236493
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-42364932014-11-19 An effective method of large scale ontology matching Diallo, Gayo J Biomed Semantics Research BACKGROUND: We are currently facing a proliferation of heterogeneous biomedical data sources accessible through various knowledge-based applications. These data are annotated by increasingly extensive and widely disseminated knowledge organisation systems ranging from simple terminologies and structured vocabularies to formal ontologies. In order to solve the interoperability issue, which arises due to the heterogeneity of these ontologies, an alignment task is usually performed. However, while significant effort has been made to provide tools that automatically align small ontologies containing hundreds or thousands of entities, little attention has been paid to the matching of large sized ontologies in the life sciences domain. RESULTS: We have designed and implemented ServOMap, an effective method for large scale ontology matching. It is a fast and efficient high precision system able to perform matching of input ontologies containing hundreds of thousands of entities. The system, which was included in the 2012 and 2013 editions of the Ontology Alignment Evaluation Initiative campaign, performed very well. It was ranked among the top systems for the large ontologies matching. CONCLUSIONS: We proposed an approach for large scale ontology matching relying on Information Retrieval (IR) techniques and the combination of lexical and machine learning contextual similarity computing for the generation of candidate mappings. It is particularly adapted to the life sciences domain as many of the ontologies in this domain benefit from synonym terms taken from the Unified Medical Language System and that can be used by our IR strategy. The ServOMap system we implemented is able to deal with hundreds of thousands entities with an efficient computation time. BioMed Central 2014-10-28 /pmc/articles/PMC4236493/ /pubmed/25411633 http://dx.doi.org/10.1186/2041-1480-5-44 Text en © Diallo; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle	Research Diallo, Gayo An effective method of large scale ontology matching
title	An effective method of large scale ontology matching
title_full	An effective method of large scale ontology matching
title_fullStr	An effective method of large scale ontology matching
title_full_unstemmed	An effective method of large scale ontology matching
title_short	An effective method of large scale ontology matching
title_sort	effective method of large scale ontology matching
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236493/ https://www.ncbi.nlm.nih.gov/pubmed/25411633 http://dx.doi.org/10.1186/2041-1480-5-44
work_keys_str_mv	AT diallogayo aneffectivemethodoflargescaleontologymatching AT diallogayo effectivemethodoflargescaleontologymatching

An effective method of large scale ontology matching

Ejemplares similares