Cargando…
DASMiner: discovering and integrating data from DAS sources
BACKGROUND: DAS is a widely adopted protocol for providing syntactic interoperability among biological databases. The popularity of DAS is due to a simplified and elegant mechanism for data exchange that consists of sources exposing their RESTful interfaces for data access. As a growing number of DA...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2789070/ https://www.ncbi.nlm.nih.gov/pubmed/19919683 http://dx.doi.org/10.1186/1752-0509-3-109 |
_version_ | 1782175026368217088 |
---|---|
author | Veiga, Diogo FT Deus, Helena F Akdemir, Caner Vasconcelos, Ana Tereza R Almeida, Jonas S |
author_facet | Veiga, Diogo FT Deus, Helena F Akdemir, Caner Vasconcelos, Ana Tereza R Almeida, Jonas S |
author_sort | Veiga, Diogo FT |
collection | PubMed |
description | BACKGROUND: DAS is a widely adopted protocol for providing syntactic interoperability among biological databases. The popularity of DAS is due to a simplified and elegant mechanism for data exchange that consists of sources exposing their RESTful interfaces for data access. As a growing number of DAS services are available for molecular biology resources, there is an incentive to explore this protocol in order to advance data discovery and integration among these resources. RESULTS: We developed DASMiner, a Matlab toolkit for querying DAS data sources that enables creation of integrated biological models using the information available in DAS-compliant repositories. DASMiner is composed by a browser application and an API that work together to facilitate gathering of data from different DAS sources, which can be used for creating enriched datasets from multiple sources. The browser is used to formulate queries and navigate data contained in DAS sources. Users can execute queries against these sources in an intuitive fashion, without the need of knowing the specific DAS syntax for the particular source. Using the source's metadata provided by the DAS Registry, the browser's layout adapts to expose only the set of commands and coordinate systems supported by the specific source. For this reason, the browser can interrogate any DAS source, independently of the type of data being served. The API component of DASMiner may be used for programmatic access of DAS sources by programs in Matlab. Once the desired data is found during navigation, the query is exported in the format of an API call to be used within any Matlab application. We illustrate the use of DASMiner by creating integrative models of histone modification maps and protein-protein interaction networks. These enriched datasets were built by retrieving and integrating distributed genomic and proteomic DAS sources using the API. CONCLUSION: The support of the DAS protocol allows that hundreds of molecular biology databases to be treated as a federated, online collection of resources. DASMiner enables full exploration of these resources, and can be used to deploy applications and create integrated views of biological systems using the information deposited in DAS repositories. |
format | Text |
id | pubmed-2789070 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27890702009-12-05 DASMiner: discovering and integrating data from DAS sources Veiga, Diogo FT Deus, Helena F Akdemir, Caner Vasconcelos, Ana Tereza R Almeida, Jonas S BMC Syst Biol Software BACKGROUND: DAS is a widely adopted protocol for providing syntactic interoperability among biological databases. The popularity of DAS is due to a simplified and elegant mechanism for data exchange that consists of sources exposing their RESTful interfaces for data access. As a growing number of DAS services are available for molecular biology resources, there is an incentive to explore this protocol in order to advance data discovery and integration among these resources. RESULTS: We developed DASMiner, a Matlab toolkit for querying DAS data sources that enables creation of integrated biological models using the information available in DAS-compliant repositories. DASMiner is composed by a browser application and an API that work together to facilitate gathering of data from different DAS sources, which can be used for creating enriched datasets from multiple sources. The browser is used to formulate queries and navigate data contained in DAS sources. Users can execute queries against these sources in an intuitive fashion, without the need of knowing the specific DAS syntax for the particular source. Using the source's metadata provided by the DAS Registry, the browser's layout adapts to expose only the set of commands and coordinate systems supported by the specific source. For this reason, the browser can interrogate any DAS source, independently of the type of data being served. The API component of DASMiner may be used for programmatic access of DAS sources by programs in Matlab. Once the desired data is found during navigation, the query is exported in the format of an API call to be used within any Matlab application. We illustrate the use of DASMiner by creating integrative models of histone modification maps and protein-protein interaction networks. These enriched datasets were built by retrieving and integrating distributed genomic and proteomic DAS sources using the API. CONCLUSION: The support of the DAS protocol allows that hundreds of molecular biology databases to be treated as a federated, online collection of resources. DASMiner enables full exploration of these resources, and can be used to deploy applications and create integrated views of biological systems using the information deposited in DAS repositories. BioMed Central 2009-11-17 /pmc/articles/PMC2789070/ /pubmed/19919683 http://dx.doi.org/10.1186/1752-0509-3-109 Text en Copyright ©2009 Veiga et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Veiga, Diogo FT Deus, Helena F Akdemir, Caner Vasconcelos, Ana Tereza R Almeida, Jonas S DASMiner: discovering and integrating data from DAS sources |
title | DASMiner: discovering and integrating data from DAS sources |
title_full | DASMiner: discovering and integrating data from DAS sources |
title_fullStr | DASMiner: discovering and integrating data from DAS sources |
title_full_unstemmed | DASMiner: discovering and integrating data from DAS sources |
title_short | DASMiner: discovering and integrating data from DAS sources |
title_sort | dasminer: discovering and integrating data from das sources |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2789070/ https://www.ncbi.nlm.nih.gov/pubmed/19919683 http://dx.doi.org/10.1186/1752-0509-3-109 |
work_keys_str_mv | AT veigadiogoft dasminerdiscoveringandintegratingdatafromdassources AT deushelenaf dasminerdiscoveringandintegratingdatafromdassources AT akdemircaner dasminerdiscoveringandintegratingdatafromdassources AT vasconcelosanaterezar dasminerdiscoveringandintegratingdatafromdassources AT almeidajonass dasminerdiscoveringandintegratingdatafromdassources |