Cargando…

Latent network-based representations for large-scale gene expression data analysis

BACKGROUND: With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-through...

Descripción completa

Detalles Bibliográficos
Autores principales: Dhifli, Wajdi, Puig, Julia, Dispot, Aurélien, Elati, Mohamed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7394327/
https://www.ncbi.nlm.nih.gov/pubmed/30717663
http://dx.doi.org/10.1186/s12859-018-2481-y
_version_ 1783565210079985664
author Dhifli, Wajdi
Puig, Julia
Dispot, Aurélien
Elati, Mohamed
author_facet Dhifli, Wajdi
Puig, Julia
Dispot, Aurélien
Elati, Mohamed
author_sort Dhifli, Wajdi
collection PubMed
description BACKGROUND: With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-throughput biological data while taking into account its biological context. In gene expression data analysis, genes have widely been considered as independent components. However, a systemic view shows that they act synergistically in living cells, forming functional complexes and more generally a biological system. RESULTS: In this paper, we propose LatNet, a signal transformation framework that, starting from an initial large-scale gene expression data, allows to generate new representations based on latent network-based relationships between the genes. LatNet aims to leverage system level relations between the genes as an underlying hidden structure to derive the new transformed latent signals. We present a concrete implementation of our framework, based on a gene regulatory network structure and two signal transformation approaches, to quantify latent network-based activity of regulators, as well as gene perturbation signals. The new gene/regulator signals are at the level of each sample of the input data and, thus, could directly be used instead of the initial expression signals for major bioinformatics analysis, including diagnosis and personalized medicine. CONCLUSION: Multiple patterns could be hidden or weakly observed in expression data. LatNet helps in uncovering latent signals that could emphasize hidden patterns based on the relations between the genes and, thus, enhancing the performance of gene expression-based analysis algorithms. We use LatNet for the analysis of real-world gene expression data of bladder cancer and we show the efficiency of our transformation framework as compared to using the initial expression data.
format Online
Article
Text
id pubmed-7394327
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73943272020-08-05 Latent network-based representations for large-scale gene expression data analysis Dhifli, Wajdi Puig, Julia Dispot, Aurélien Elati, Mohamed BMC Bioinformatics Research BACKGROUND: With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-throughput biological data while taking into account its biological context. In gene expression data analysis, genes have widely been considered as independent components. However, a systemic view shows that they act synergistically in living cells, forming functional complexes and more generally a biological system. RESULTS: In this paper, we propose LatNet, a signal transformation framework that, starting from an initial large-scale gene expression data, allows to generate new representations based on latent network-based relationships between the genes. LatNet aims to leverage system level relations between the genes as an underlying hidden structure to derive the new transformed latent signals. We present a concrete implementation of our framework, based on a gene regulatory network structure and two signal transformation approaches, to quantify latent network-based activity of regulators, as well as gene perturbation signals. The new gene/regulator signals are at the level of each sample of the input data and, thus, could directly be used instead of the initial expression signals for major bioinformatics analysis, including diagnosis and personalized medicine. CONCLUSION: Multiple patterns could be hidden or weakly observed in expression data. LatNet helps in uncovering latent signals that could emphasize hidden patterns based on the relations between the genes and, thus, enhancing the performance of gene expression-based analysis algorithms. We use LatNet for the analysis of real-world gene expression data of bladder cancer and we show the efficiency of our transformation framework as compared to using the initial expression data. BioMed Central 2019-02-04 /pmc/articles/PMC7394327/ /pubmed/30717663 http://dx.doi.org/10.1186/s12859-018-2481-y Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Dhifli, Wajdi
Puig, Julia
Dispot, Aurélien
Elati, Mohamed
Latent network-based representations for large-scale gene expression data analysis
title Latent network-based representations for large-scale gene expression data analysis
title_full Latent network-based representations for large-scale gene expression data analysis
title_fullStr Latent network-based representations for large-scale gene expression data analysis
title_full_unstemmed Latent network-based representations for large-scale gene expression data analysis
title_short Latent network-based representations for large-scale gene expression data analysis
title_sort latent network-based representations for large-scale gene expression data analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7394327/
https://www.ncbi.nlm.nih.gov/pubmed/30717663
http://dx.doi.org/10.1186/s12859-018-2481-y
work_keys_str_mv AT dhifliwajdi latentnetworkbasedrepresentationsforlargescalegeneexpressiondataanalysis
AT puigjulia latentnetworkbasedrepresentationsforlargescalegeneexpressiondataanalysis
AT dispotaurelien latentnetworkbasedrepresentationsforlargescalegeneexpressiondataanalysis
AT elatimohamed latentnetworkbasedrepresentationsforlargescalegeneexpressiondataanalysis