Cargando…

Multi-domain semantic similarity in biomedical research

BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferreira, João D., Couto, Francisco M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6538554/
https://www.ncbi.nlm.nih.gov/pubmed/31138117
http://dx.doi.org/10.1186/s12859-019-2810-9
_version_ 1783422187223384064
author Ferreira, João D.
Couto, Francisco M.
author_facet Ferreira, João D.
Couto, Francisco M.
author_sort Ferreira, João D.
collection PubMed
description BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For example, metabolic pathways are annotated with their enzymes and their metabolites, and thus similarity measures should compare them with respect to both of those domains simultaneously. RESULTS: In this paper, we propose two approaches to lift existing single-ontology semantic similarity measures into multi-domain measures. The aggregative approach compares domains independently and averages the various similarity values into a final score. The integrative approach integrates all the relevant ontologies into a single one, calculating similarity in the resulting multi-domain ontology using the single-ontology measure. CONCLUSIONS: We evaluated the two approaches in a multidisciplinary epidemiology dataset by evaluating the capacity of the similarity measures to predict new annotations based on the existing ones. The results show a promising increase in performance of the multi-domain measures over the single-ontology ones in the vast majority of the cases. These results show that multi-domain measures outperform single-domain ones, and should be considered by the community as a starting point to study more efficient multi-domain semantic similarity measures.
format Online
Article
Text
id pubmed-6538554
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65385542019-06-03 Multi-domain semantic similarity in biomedical research Ferreira, João D. Couto, Francisco M. BMC Bioinformatics Research BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For example, metabolic pathways are annotated with their enzymes and their metabolites, and thus similarity measures should compare them with respect to both of those domains simultaneously. RESULTS: In this paper, we propose two approaches to lift existing single-ontology semantic similarity measures into multi-domain measures. The aggregative approach compares domains independently and averages the various similarity values into a final score. The integrative approach integrates all the relevant ontologies into a single one, calculating similarity in the resulting multi-domain ontology using the single-ontology measure. CONCLUSIONS: We evaluated the two approaches in a multidisciplinary epidemiology dataset by evaluating the capacity of the similarity measures to predict new annotations based on the existing ones. The results show a promising increase in performance of the multi-domain measures over the single-ontology ones in the vast majority of the cases. These results show that multi-domain measures outperform single-domain ones, and should be considered by the community as a starting point to study more efficient multi-domain semantic similarity measures. BioMed Central 2019-05-29 /pmc/articles/PMC6538554/ /pubmed/31138117 http://dx.doi.org/10.1186/s12859-019-2810-9 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Ferreira, João D.
Couto, Francisco M.
Multi-domain semantic similarity in biomedical research
title Multi-domain semantic similarity in biomedical research
title_full Multi-domain semantic similarity in biomedical research
title_fullStr Multi-domain semantic similarity in biomedical research
title_full_unstemmed Multi-domain semantic similarity in biomedical research
title_short Multi-domain semantic similarity in biomedical research
title_sort multi-domain semantic similarity in biomedical research
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6538554/
https://www.ncbi.nlm.nih.gov/pubmed/31138117
http://dx.doi.org/10.1186/s12859-019-2810-9
work_keys_str_mv AT ferreirajoaod multidomainsemanticsimilarityinbiomedicalresearch
AT coutofranciscom multidomainsemanticsimilarityinbiomedicalresearch