Cargando…

Exploring and linking biomedical resources through multidimensional semantic spaces

BACKGROUND: The semantic integration of biomedical resources is still a challenging issue which is required for effective information processing and data analysis. The availability of comprehensive knowledge resources such as biomedical ontologies and integrated thesauri greatly facilitates this int...

Descripción completa

Detalles Bibliográficos
Autores principales: Berlanga, Rafael, Jiménez-Ruiz, Ernesto, Nebot, Victoria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3471347/
https://www.ncbi.nlm.nih.gov/pubmed/22373409
http://dx.doi.org/10.1186/1471-2105-13-S1-S6
_version_ 1782246411500257280
author Berlanga, Rafael
Jiménez-Ruiz, Ernesto
Nebot, Victoria
author_facet Berlanga, Rafael
Jiménez-Ruiz, Ernesto
Nebot, Victoria
author_sort Berlanga, Rafael
collection PubMed
description BACKGROUND: The semantic integration of biomedical resources is still a challenging issue which is required for effective information processing and data analysis. The availability of comprehensive knowledge resources such as biomedical ontologies and integrated thesauri greatly facilitates this integration effort by means of semantic annotation, which allows disparate data formats and contents to be expressed under a common semantic space. In this paper, we propose a multidimensional representation for such a semantic space, where dimensions regard the different perspectives in biomedical research (e.g., population, disease, anatomy and protein/genes). RESULTS: This paper presents a novel method for building multidimensional semantic spaces from semantically annotated biomedical data collections. This method consists of two main processes: knowledge and data normalization. The former one arranges the concepts provided by a reference knowledge resource (e.g., biomedical ontologies and thesauri) into a set of hierarchical dimensions for analysis purposes. The latter one reduces the annotation set associated to each collection item into a set of points of the multidimensional space. Additionally, we have developed a visual tool, called 3D-Browser, which implements OLAP-like operators over the generated multidimensional space. The method and the tool have been tested and evaluated in the context of the Health-e-Child (HeC) project. Automatic semantic annotation was applied to tag three collections of abstracts taken from PubMed, one for each target disease of the project, the Uniprot database, and the HeC patient record database. We adopted the UMLS Meta-thesaurus 2010AA as the reference knowledge resource. CONCLUSIONS: Current knowledge resources and semantic-aware technology make possible the integration of biomedical resources. Such an integration is performed through semantic annotation of the intended biomedical data resources. This paper shows how these annotations can be exploited for integration, exploration, and analysis tasks. Results over a real scenario demonstrate the viability and usefulness of the approach, as well as the quality of the generated multidimensional semantic spaces.
format Online
Article
Text
id pubmed-3471347
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34713472012-10-18 Exploring and linking biomedical resources through multidimensional semantic spaces Berlanga, Rafael Jiménez-Ruiz, Ernesto Nebot, Victoria BMC Bioinformatics Research BACKGROUND: The semantic integration of biomedical resources is still a challenging issue which is required for effective information processing and data analysis. The availability of comprehensive knowledge resources such as biomedical ontologies and integrated thesauri greatly facilitates this integration effort by means of semantic annotation, which allows disparate data formats and contents to be expressed under a common semantic space. In this paper, we propose a multidimensional representation for such a semantic space, where dimensions regard the different perspectives in biomedical research (e.g., population, disease, anatomy and protein/genes). RESULTS: This paper presents a novel method for building multidimensional semantic spaces from semantically annotated biomedical data collections. This method consists of two main processes: knowledge and data normalization. The former one arranges the concepts provided by a reference knowledge resource (e.g., biomedical ontologies and thesauri) into a set of hierarchical dimensions for analysis purposes. The latter one reduces the annotation set associated to each collection item into a set of points of the multidimensional space. Additionally, we have developed a visual tool, called 3D-Browser, which implements OLAP-like operators over the generated multidimensional space. The method and the tool have been tested and evaluated in the context of the Health-e-Child (HeC) project. Automatic semantic annotation was applied to tag three collections of abstracts taken from PubMed, one for each target disease of the project, the Uniprot database, and the HeC patient record database. We adopted the UMLS Meta-thesaurus 2010AA as the reference knowledge resource. CONCLUSIONS: Current knowledge resources and semantic-aware technology make possible the integration of biomedical resources. Such an integration is performed through semantic annotation of the intended biomedical data resources. This paper shows how these annotations can be exploited for integration, exploration, and analysis tasks. Results over a real scenario demonstrate the viability and usefulness of the approach, as well as the quality of the generated multidimensional semantic spaces. BioMed Central 2012-01-25 /pmc/articles/PMC3471347/ /pubmed/22373409 http://dx.doi.org/10.1186/1471-2105-13-S1-S6 Text en Copyright ©2012 Berlanga et al. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Berlanga, Rafael
Jiménez-Ruiz, Ernesto
Nebot, Victoria
Exploring and linking biomedical resources through multidimensional semantic spaces
title Exploring and linking biomedical resources through multidimensional semantic spaces
title_full Exploring and linking biomedical resources through multidimensional semantic spaces
title_fullStr Exploring and linking biomedical resources through multidimensional semantic spaces
title_full_unstemmed Exploring and linking biomedical resources through multidimensional semantic spaces
title_short Exploring and linking biomedical resources through multidimensional semantic spaces
title_sort exploring and linking biomedical resources through multidimensional semantic spaces
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3471347/
https://www.ncbi.nlm.nih.gov/pubmed/22373409
http://dx.doi.org/10.1186/1471-2105-13-S1-S6
work_keys_str_mv AT berlangarafael exploringandlinkingbiomedicalresourcesthroughmultidimensionalsemanticspaces
AT jimenezruizernesto exploringandlinkingbiomedicalresourcesthroughmultidimensionalsemanticspaces
AT nebotvictoria exploringandlinkingbiomedicalresourcesthroughmultidimensionalsemanticspaces