Cargando…
Multi-domain semantic similarity in biomedical research
BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6538554/ https://www.ncbi.nlm.nih.gov/pubmed/31138117 http://dx.doi.org/10.1186/s12859-019-2810-9 |
_version_ | 1783422187223384064 |
---|---|
author | Ferreira, João D. Couto, Francisco M. |
author_facet | Ferreira, João D. Couto, Francisco M. |
author_sort | Ferreira, João D. |
collection | PubMed |
description | BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For example, metabolic pathways are annotated with their enzymes and their metabolites, and thus similarity measures should compare them with respect to both of those domains simultaneously. RESULTS: In this paper, we propose two approaches to lift existing single-ontology semantic similarity measures into multi-domain measures. The aggregative approach compares domains independently and averages the various similarity values into a final score. The integrative approach integrates all the relevant ontologies into a single one, calculating similarity in the resulting multi-domain ontology using the single-ontology measure. CONCLUSIONS: We evaluated the two approaches in a multidisciplinary epidemiology dataset by evaluating the capacity of the similarity measures to predict new annotations based on the existing ones. The results show a promising increase in performance of the multi-domain measures over the single-ontology ones in the vast majority of the cases. These results show that multi-domain measures outperform single-domain ones, and should be considered by the community as a starting point to study more efficient multi-domain semantic similarity measures. |
format | Online Article Text |
id | pubmed-6538554 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-65385542019-06-03 Multi-domain semantic similarity in biomedical research Ferreira, João D. Couto, Francisco M. BMC Bioinformatics Research BACKGROUND: Given the increasing amount of biomedical resources that are being annotated with concepts from more than one ontology and covering multiple domains of knowledge, it is important to devise mechanisms to compare these resources that take into account the various domains of annotation. For example, metabolic pathways are annotated with their enzymes and their metabolites, and thus similarity measures should compare them with respect to both of those domains simultaneously. RESULTS: In this paper, we propose two approaches to lift existing single-ontology semantic similarity measures into multi-domain measures. The aggregative approach compares domains independently and averages the various similarity values into a final score. The integrative approach integrates all the relevant ontologies into a single one, calculating similarity in the resulting multi-domain ontology using the single-ontology measure. CONCLUSIONS: We evaluated the two approaches in a multidisciplinary epidemiology dataset by evaluating the capacity of the similarity measures to predict new annotations based on the existing ones. The results show a promising increase in performance of the multi-domain measures over the single-ontology ones in the vast majority of the cases. These results show that multi-domain measures outperform single-domain ones, and should be considered by the community as a starting point to study more efficient multi-domain semantic similarity measures. BioMed Central 2019-05-29 /pmc/articles/PMC6538554/ /pubmed/31138117 http://dx.doi.org/10.1186/s12859-019-2810-9 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Ferreira, João D. Couto, Francisco M. Multi-domain semantic similarity in biomedical research |
title | Multi-domain semantic similarity in biomedical research |
title_full | Multi-domain semantic similarity in biomedical research |
title_fullStr | Multi-domain semantic similarity in biomedical research |
title_full_unstemmed | Multi-domain semantic similarity in biomedical research |
title_short | Multi-domain semantic similarity in biomedical research |
title_sort | multi-domain semantic similarity in biomedical research |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6538554/ https://www.ncbi.nlm.nih.gov/pubmed/31138117 http://dx.doi.org/10.1186/s12859-019-2810-9 |
work_keys_str_mv | AT ferreirajoaod multidomainsemanticsimilarityinbiomedicalresearch AT coutofranciscom multidomainsemanticsimilarityinbiomedicalresearch |