Cargando…

Multi-domain semantic similarity in biomedical research

BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ferreira, João D., Couto, Francisco M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6538554/ https://www.ncbi.nlm.nih.gov/pubmed/31138117 http://dx.doi.org/10.1186/s12859-019-2810-9

_version_	1783422187223384064
author	Ferreira, João D. Couto, Francisco M.
author_facet	Ferreira, João D. Couto, Francisco M.
author_sort	Ferreira, João D.
collection	PubMed
description	BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For example, metabolic pathways are annotated with their enzymes and their metabolites, and thus similarity measures should compare them with respect to both of those domains simultaneously. RESULTS: In this paper, we propose two approaches to lift existing single-ontology semantic similarity measures into multi-domain measures. The aggregative approach compares domains independently and averages the various similarity values into a final score. The integrative approach integrates all the relevant ontologies into a single one, calculating similarity in the resulting multi-domain ontology using the single-ontology measure. CONCLUSIONS: We evaluated the two approaches in a multidisciplinary epidemiology dataset by evaluating the capacity of the similarity measures to predict new annotations based on the existing ones. The results show a promising increase in performance of the multi-domain measures over the single-ontology ones in the vast majority of the cases. These results show that multi-domain measures outperform single-domain ones, and should be considered by the community as a starting point to study more efficient multi-domain semantic similarity measures.
format	Online Article Text
id	pubmed-6538554
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-65385542019-06-03 Multi-domain semantic similarity in biomedical research Ferreira, João D. Couto, Francisco M. BMC Bioinformatics Research BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For example, metabolic pathways are annotated with their enzymes and their metabolites, and thus similarity measures should compare them with respect to both of those domains simultaneously. RESULTS: In this paper, we propose two approaches to lift existing single-ontology semantic similarity measures into multi-domain measures. The aggregative approach compares domains independently and averages the various similarity values into a final score. The integrative approach integrates all the relevant ontologies into a single one, calculating similarity in the resulting multi-domain ontology using the single-ontology measure. CONCLUSIONS: We evaluated the two approaches in a multidisciplinary epidemiology dataset by evaluating the capacity of the similarity measures to predict new annotations based on the existing ones. The results show a promising increase in performance of the multi-domain measures over the single-ontology ones in the vast majority of the cases. These results show that multi-domain measures outperform single-domain ones, and should be considered by the community as a starting point to study more efficient multi-domain semantic similarity measures. BioMed Central 2019-05-29 /pmc/articles/PMC6538554/ /pubmed/31138117 http://dx.doi.org/10.1186/s12859-019-2810-9 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Ferreira, João D. Couto, Francisco M. Multi-domain semantic similarity in biomedical research
title	Multi-domain semantic similarity in biomedical research
title_full	Multi-domain semantic similarity in biomedical research
title_fullStr	Multi-domain semantic similarity in biomedical research
title_full_unstemmed	Multi-domain semantic similarity in biomedical research
title_short	Multi-domain semantic similarity in biomedical research
title_sort	multi-domain semantic similarity in biomedical research
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6538554/ https://www.ncbi.nlm.nih.gov/pubmed/31138117 http://dx.doi.org/10.1186/s12859-019-2810-9
work_keys_str_mv	AT ferreirajoaod multidomainsemanticsimilarityinbiomedicalresearch AT coutofranciscom multidomainsemanticsimilarityinbiomedicalresearch

Multi-domain semantic similarity in biomedical research

Ejemplares similares