Cargando…

KA-SB: from data integration to large scale reasoning

BACKGROUND: The analysis of information in the biological domain is usually focused on the analysis of data from single on-line data sources. Unfortunately, studying a biological process requires having access to disperse, heterogeneous, autonomous data sources. In this context, an analysis of the i...

Descripción completa

Detalles Bibliográficos
Autores principales: Roldán-García, María del Mar, Navas-Delgado, Ismael, Kerzazi, Amine, Chniber, Othmane, Molina-Castro, Joaquín, Aldana-Montes, José F
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755826/
https://www.ncbi.nlm.nih.gov/pubmed/19796402
http://dx.doi.org/10.1186/1471-2105-10-S10-S5
_version_ 1782172473729482752
author Roldán-García, María del Mar
Navas-Delgado, Ismael
Kerzazi, Amine
Chniber, Othmane
Molina-Castro, Joaquín
Aldana-Montes, José F
author_facet Roldán-García, María del Mar
Navas-Delgado, Ismael
Kerzazi, Amine
Chniber, Othmane
Molina-Castro, Joaquín
Aldana-Montes, José F
author_sort Roldán-García, María del Mar
collection PubMed
description BACKGROUND: The analysis of information in the biological domain is usually focused on the analysis of data from single on-line data sources. Unfortunately, studying a biological process requires having access to disperse, heterogeneous, autonomous data sources. In this context, an analysis of the information is not possible without the integration of such data. METHODS: KA-SB is a querying and analysis system for final users based on combining a data integration solution with a reasoner. Thus, the tool has been created with a process divided into two steps: 1) KOMF, the Khaos Ontology-based Mediator Framework, is used to retrieve information from heterogeneous and distributed databases; 2) the integrated information is crystallized in a (persistent and high performance) reasoner (DBOWL). This information could be further analyzed later (by means of querying and reasoning). RESULTS: In this paper we present a novel system that combines the use of a mediation system with the reasoning capabilities of a large scale reasoner to provide a way of finding new knowledge and of analyzing the integrated information from different databases, which is retrieved as a set of ontology instances. This tool uses a graphical query interface to build user queries easily, which shows a graphical representation of the ontology and allows users o build queries by clicking on the ontology concepts. CONCLUSION: These kinds of systems (based on KOMF) will provide users with very large amounts of information (interpreted as ontology instances once retrieved), which cannot be managed using traditional main memory-based reasoners. We propose a process for creating persistent and scalable knowledgebases from sets of OWL instances obtained by integrating heterogeneous data sources with KOMF. This process has been applied to develop a demo tool http://khaos.uma.es/KA-SB, which uses the BioPax Level 3 ontology as the integration schema, and integrates UNIPROT, KEGG, CHEBI, BRENDA and SABIORK databases.
format Text
id pubmed-2755826
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27558262009-10-03 KA-SB: from data integration to large scale reasoning Roldán-García, María del Mar Navas-Delgado, Ismael Kerzazi, Amine Chniber, Othmane Molina-Castro, Joaquín Aldana-Montes, José F BMC Bioinformatics Research BACKGROUND: The analysis of information in the biological domain is usually focused on the analysis of data from single on-line data sources. Unfortunately, studying a biological process requires having access to disperse, heterogeneous, autonomous data sources. In this context, an analysis of the information is not possible without the integration of such data. METHODS: KA-SB is a querying and analysis system for final users based on combining a data integration solution with a reasoner. Thus, the tool has been created with a process divided into two steps: 1) KOMF, the Khaos Ontology-based Mediator Framework, is used to retrieve information from heterogeneous and distributed databases; 2) the integrated information is crystallized in a (persistent and high performance) reasoner (DBOWL). This information could be further analyzed later (by means of querying and reasoning). RESULTS: In this paper we present a novel system that combines the use of a mediation system with the reasoning capabilities of a large scale reasoner to provide a way of finding new knowledge and of analyzing the integrated information from different databases, which is retrieved as a set of ontology instances. This tool uses a graphical query interface to build user queries easily, which shows a graphical representation of the ontology and allows users o build queries by clicking on the ontology concepts. CONCLUSION: These kinds of systems (based on KOMF) will provide users with very large amounts of information (interpreted as ontology instances once retrieved), which cannot be managed using traditional main memory-based reasoners. We propose a process for creating persistent and scalable knowledgebases from sets of OWL instances obtained by integrating heterogeneous data sources with KOMF. This process has been applied to develop a demo tool http://khaos.uma.es/KA-SB, which uses the BioPax Level 3 ontology as the integration schema, and integrates UNIPROT, KEGG, CHEBI, BRENDA and SABIORK databases. BioMed Central 2009-10-01 /pmc/articles/PMC2755826/ /pubmed/19796402 http://dx.doi.org/10.1186/1471-2105-10-S10-S5 Text en © Roldán-García et al; licensee BioMed Central Ltd. 2009 https://creativecommons.org/licenses/by/2.0/This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. https://creativecommons.org/licenses/by/2.0/ Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Roldán-García, María del Mar
Navas-Delgado, Ismael
Kerzazi, Amine
Chniber, Othmane
Molina-Castro, Joaquín
Aldana-Montes, José F
KA-SB: from data integration to large scale reasoning
title KA-SB: from data integration to large scale reasoning
title_full KA-SB: from data integration to large scale reasoning
title_fullStr KA-SB: from data integration to large scale reasoning
title_full_unstemmed KA-SB: from data integration to large scale reasoning
title_short KA-SB: from data integration to large scale reasoning
title_sort ka-sb: from data integration to large scale reasoning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755826/
https://www.ncbi.nlm.nih.gov/pubmed/19796402
http://dx.doi.org/10.1186/1471-2105-10-S10-S5
work_keys_str_mv AT roldangarciamariadelmar kasbfromdataintegrationtolargescalereasoning
AT navasdelgadoismael kasbfromdataintegrationtolargescalereasoning
AT kerzaziamine kasbfromdataintegrationtolargescalereasoning
AT chniberothmane kasbfromdataintegrationtolargescalereasoning
AT molinacastrojoaquin kasbfromdataintegrationtolargescalereasoning
AT aldanamontesjosef kasbfromdataintegrationtolargescalereasoning