Cargando…

Integrated querying and version control of context-specific biological networks

MOTIVATION: Biomolecular data stored in public databases is increasingly specialized to organisms, context/pathology and tissue type, potentially resulting in significant overhead for analyses. These networks are often specializations of generic interaction sets, presenting opportunities for reducin...

Descripción completa

Detalles Bibliográficos
Autores principales: Cowman, Tyler, Coşkun, Mustafa, Grama, Ananth, Koyutürk, Mehmet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7158887/
https://www.ncbi.nlm.nih.gov/pubmed/32294194
http://dx.doi.org/10.1093/database/baaa018
_version_ 1783522571972509696
author Cowman, Tyler
Coşkun, Mustafa
Grama, Ananth
Koyutürk, Mehmet
author_facet Cowman, Tyler
Coşkun, Mustafa
Grama, Ananth
Koyutürk, Mehmet
author_sort Cowman, Tyler
collection PubMed
description MOTIVATION: Biomolecular data stored in public databases is increasingly specialized to organisms, context/pathology and tissue type, potentially resulting in significant overhead for analyses. These networks are often specializations of generic interaction sets, presenting opportunities for reducing storage and computational cost. Therefore, it is desirable to develop effective compression and storage techniques, along with efficient algorithms and a flexible query interface capable of operating on compressed data structures. Current graph databases offer varying levels of support for network integration. However, these solutions do not provide efficient methods for the storage and querying of versioned networks. RESULTS: We present VerTIoN, a framework consisting of novel data structures and associated query mechanisms for integrated querying of versioned context-specific biological networks. As a use case for our framework, we study network proximity queries in which the user can select and compose a combination of tissue-specific and generic networks. Using our compressed version tree data structure, in conjunction with state-of-the-art numerical techniques, we demonstrate real-time querying of large network databases. CONCLUSION: Our results show that it is possible to support flexible queries defined on heterogeneous networks composed at query time while drastically reducing response time for multiple simultaneous queries. The flexibility offered by VerTIoN in composing integrated network versions opens significant new avenues for the utilization of ever increasing volume of context-specific network data in a broad range of biomedical applications. AVAILABILITY AND IMPLEMENTATION: VerTIoN is implemented as a C++ library and is available at http://compbio.case.edu/omics/software/vertion and https://github.com/tjcowman/vertion CONTACT: tyler.cowman@case.edu
format Online
Article
Text
id pubmed-7158887
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-71588872020-04-20 Integrated querying and version control of context-specific biological networks Cowman, Tyler Coşkun, Mustafa Grama, Ananth Koyutürk, Mehmet Database (Oxford) Original Article MOTIVATION: Biomolecular data stored in public databases is increasingly specialized to organisms, context/pathology and tissue type, potentially resulting in significant overhead for analyses. These networks are often specializations of generic interaction sets, presenting opportunities for reducing storage and computational cost. Therefore, it is desirable to develop effective compression and storage techniques, along with efficient algorithms and a flexible query interface capable of operating on compressed data structures. Current graph databases offer varying levels of support for network integration. However, these solutions do not provide efficient methods for the storage and querying of versioned networks. RESULTS: We present VerTIoN, a framework consisting of novel data structures and associated query mechanisms for integrated querying of versioned context-specific biological networks. As a use case for our framework, we study network proximity queries in which the user can select and compose a combination of tissue-specific and generic networks. Using our compressed version tree data structure, in conjunction with state-of-the-art numerical techniques, we demonstrate real-time querying of large network databases. CONCLUSION: Our results show that it is possible to support flexible queries defined on heterogeneous networks composed at query time while drastically reducing response time for multiple simultaneous queries. The flexibility offered by VerTIoN in composing integrated network versions opens significant new avenues for the utilization of ever increasing volume of context-specific network data in a broad range of biomedical applications. AVAILABILITY AND IMPLEMENTATION: VerTIoN is implemented as a C++ library and is available at http://compbio.case.edu/omics/software/vertion and https://github.com/tjcowman/vertion CONTACT: tyler.cowman@case.edu Oxford University Press 2020-04-15 /pmc/articles/PMC7158887/ /pubmed/32294194 http://dx.doi.org/10.1093/database/baaa018 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Cowman, Tyler
Coşkun, Mustafa
Grama, Ananth
Koyutürk, Mehmet
Integrated querying and version control of context-specific biological networks
title Integrated querying and version control of context-specific biological networks
title_full Integrated querying and version control of context-specific biological networks
title_fullStr Integrated querying and version control of context-specific biological networks
title_full_unstemmed Integrated querying and version control of context-specific biological networks
title_short Integrated querying and version control of context-specific biological networks
title_sort integrated querying and version control of context-specific biological networks
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7158887/
https://www.ncbi.nlm.nih.gov/pubmed/32294194
http://dx.doi.org/10.1093/database/baaa018
work_keys_str_mv AT cowmantyler integratedqueryingandversioncontrolofcontextspecificbiologicalnetworks
AT coskunmustafa integratedqueryingandversioncontrolofcontextspecificbiologicalnetworks
AT gramaananth integratedqueryingandversioncontrolofcontextspecificbiologicalnetworks
AT koyuturkmehmet integratedqueryingandversioncontrolofcontextspecificbiologicalnetworks