Cargando…
Applying graph database technology for analyzing perturbed co-expression networks in cancer
Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corr...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731929/ https://www.ncbi.nlm.nih.gov/pubmed/33306799 http://dx.doi.org/10.1093/database/baaa110 |
_version_ | 1783621995938709504 |
---|---|
author | Simpson, Claire M Gnad, Florian |
author_facet | Simpson, Claire M Gnad, Florian |
author_sort | Simpson, Claire M |
collection | PubMed |
description | Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data. |
format | Online Article Text |
id | pubmed-7731929 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-77319292020-12-16 Applying graph database technology for analyzing perturbed co-expression networks in cancer Simpson, Claire M Gnad, Florian Database (Oxford) Original Article Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data. Oxford University Press 2020-12-11 /pmc/articles/PMC7731929/ /pubmed/33306799 http://dx.doi.org/10.1093/database/baaa110 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Simpson, Claire M Gnad, Florian Applying graph database technology for analyzing perturbed co-expression networks in cancer |
title | Applying graph database technology for analyzing perturbed co-expression networks in cancer |
title_full | Applying graph database technology for analyzing perturbed co-expression networks in cancer |
title_fullStr | Applying graph database technology for analyzing perturbed co-expression networks in cancer |
title_full_unstemmed | Applying graph database technology for analyzing perturbed co-expression networks in cancer |
title_short | Applying graph database technology for analyzing perturbed co-expression networks in cancer |
title_sort | applying graph database technology for analyzing perturbed co-expression networks in cancer |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731929/ https://www.ncbi.nlm.nih.gov/pubmed/33306799 http://dx.doi.org/10.1093/database/baaa110 |
work_keys_str_mv | AT simpsonclairem applyinggraphdatabasetechnologyforanalyzingperturbedcoexpressionnetworksincancer AT gnadflorian applyinggraphdatabasetechnologyforanalyzingperturbedcoexpressionnetworksincancer |