Cargando…

Applying graph database technology for analyzing perturbed co-expression networks in cancer

Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corr...

Descripción completa

Detalles Bibliográficos
Autores principales: Simpson, Claire M, Gnad, Florian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731929/
https://www.ncbi.nlm.nih.gov/pubmed/33306799
http://dx.doi.org/10.1093/database/baaa110
_version_ 1783621995938709504
author Simpson, Claire M
Gnad, Florian
author_facet Simpson, Claire M
Gnad, Florian
author_sort Simpson, Claire M
collection PubMed
description Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data.
format Online
Article
Text
id pubmed-7731929
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77319292020-12-16 Applying graph database technology for analyzing perturbed co-expression networks in cancer Simpson, Claire M Gnad, Florian Database (Oxford) Original Article Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data. Oxford University Press 2020-12-11 /pmc/articles/PMC7731929/ /pubmed/33306799 http://dx.doi.org/10.1093/database/baaa110 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Simpson, Claire M
Gnad, Florian
Applying graph database technology for analyzing perturbed co-expression networks in cancer
title Applying graph database technology for analyzing perturbed co-expression networks in cancer
title_full Applying graph database technology for analyzing perturbed co-expression networks in cancer
title_fullStr Applying graph database technology for analyzing perturbed co-expression networks in cancer
title_full_unstemmed Applying graph database technology for analyzing perturbed co-expression networks in cancer
title_short Applying graph database technology for analyzing perturbed co-expression networks in cancer
title_sort applying graph database technology for analyzing perturbed co-expression networks in cancer
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731929/
https://www.ncbi.nlm.nih.gov/pubmed/33306799
http://dx.doi.org/10.1093/database/baaa110
work_keys_str_mv AT simpsonclairem applyinggraphdatabasetechnologyforanalyzingperturbedcoexpressionnetworksincancer
AT gnadflorian applyinggraphdatabasetechnologyforanalyzingperturbedcoexpressionnetworksincancer