Cargando…

Factor analysis for gene regulatory networks and transcription factor activity profiles

BACKGROUND: Most existing algorithms for the inference of the structure of gene regulatory networks from gene expression data assume that the activity levels of transcription factors (TFs) are proportional to their mRNA levels. This assumption is invalid for most biological systems. However, one mig...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pournara, Iosifina, Wernisch, Lorenz
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2007
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1821042/ https://www.ncbi.nlm.nih.gov/pubmed/17319944 http://dx.doi.org/10.1186/1471-2105-8-61

_version_	1782132675142746112
author	Pournara, Iosifina Wernisch, Lorenz
author_facet	Pournara, Iosifina Wernisch, Lorenz
author_sort	Pournara, Iosifina
collection	PubMed
description	BACKGROUND: Most existing algorithms for the inference of the structure of gene regulatory networks from gene expression data assume that the activity levels of transcription factors (TFs) are proportional to their mRNA levels. This assumption is invalid for most biological systems. However, one might be able to reconstruct unobserved activity profiles of TFs from the expression profiles of target genes. A simple model is a two-layer network with unobserved TF variables in the first layer and observed gene expression variables in the second layer. TFs are connected to regulated genes by weighted edges. The weights, known as factor loadings, indicate the strength and direction of regulation. Of particular interest are methods that produce sparse networks, networks with few edges, since it is known that most genes are regulated by only a small number of TFs, and most TFs regulate only a small number of genes. RESULTS: In this paper, we explore the performance of five factor analysis algorithms, Bayesian as well as classical, on problems with biological context using both simulated and real data. Factor analysis (FA) models are used in order to describe a larger number of observed variables by a smaller number of unobserved variables, the factors, whereby all correlation between observed variables is explained by common factors. Bayesian FA methods allow one to infer sparse networks by enforcing sparsity through priors. In contrast, in the classical FA, matrix rotation methods are used to enforce sparsity and thus to increase the interpretability of the inferred factor loadings matrix. However, we also show that Bayesian FA models that do not impose sparsity through the priors can still be used for the reconstruction of a gene regulatory network if applied in conjunction with matrix rotation methods. Finally, we show the added advantage of merging the information derived from all algorithms in order to obtain a combined result. CONCLUSION: Most of the algorithms tested are successful in reconstructing the connectivity structure as well as the TF profiles. Moreover, we demonstrate that if the underlying network is sparse it is still possible to reconstruct hidden activity profiles of TFs to some degree without prior connectivity information.
format	Text
id	pubmed-1821042
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-18210422007-03-14 Factor analysis for gene regulatory networks and transcription factor activity profiles Pournara, Iosifina Wernisch, Lorenz BMC Bioinformatics Research Article BACKGROUND: Most existing algorithms for the inference of the structure of gene regulatory networks from gene expression data assume that the activity levels of transcription factors (TFs) are proportional to their mRNA levels. This assumption is invalid for most biological systems. However, one might be able to reconstruct unobserved activity profiles of TFs from the expression profiles of target genes. A simple model is a two-layer network with unobserved TF variables in the first layer and observed gene expression variables in the second layer. TFs are connected to regulated genes by weighted edges. The weights, known as factor loadings, indicate the strength and direction of regulation. Of particular interest are methods that produce sparse networks, networks with few edges, since it is known that most genes are regulated by only a small number of TFs, and most TFs regulate only a small number of genes. RESULTS: In this paper, we explore the performance of five factor analysis algorithms, Bayesian as well as classical, on problems with biological context using both simulated and real data. Factor analysis (FA) models are used in order to describe a larger number of observed variables by a smaller number of unobserved variables, the factors, whereby all correlation between observed variables is explained by common factors. Bayesian FA methods allow one to infer sparse networks by enforcing sparsity through priors. In contrast, in the classical FA, matrix rotation methods are used to enforce sparsity and thus to increase the interpretability of the inferred factor loadings matrix. However, we also show that Bayesian FA models that do not impose sparsity through the priors can still be used for the reconstruction of a gene regulatory network if applied in conjunction with matrix rotation methods. Finally, we show the added advantage of merging the information derived from all algorithms in order to obtain a combined result. CONCLUSION: Most of the algorithms tested are successful in reconstructing the connectivity structure as well as the TF profiles. Moreover, we demonstrate that if the underlying network is sparse it is still possible to reconstruct hidden activity profiles of TFs to some degree without prior connectivity information. BioMed Central 2007-02-23 /pmc/articles/PMC1821042/ /pubmed/17319944 http://dx.doi.org/10.1186/1471-2105-8-61 Text en Copyright © 2007 Pournara and Wernisch; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Pournara, Iosifina Wernisch, Lorenz Factor analysis for gene regulatory networks and transcription factor activity profiles
title	Factor analysis for gene regulatory networks and transcription factor activity profiles
title_full	Factor analysis for gene regulatory networks and transcription factor activity profiles
title_fullStr	Factor analysis for gene regulatory networks and transcription factor activity profiles
title_full_unstemmed	Factor analysis for gene regulatory networks and transcription factor activity profiles
title_short	Factor analysis for gene regulatory networks and transcription factor activity profiles
title_sort	factor analysis for gene regulatory networks and transcription factor activity profiles
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1821042/ https://www.ncbi.nlm.nih.gov/pubmed/17319944 http://dx.doi.org/10.1186/1471-2105-8-61
work_keys_str_mv	AT pournaraiosifina factoranalysisforgeneregulatorynetworksandtranscriptionfactoractivityprofiles AT wernischlorenz factoranalysisforgeneregulatorynetworksandtranscriptionfactoractivityprofiles

Factor analysis for gene regulatory networks and transcription factor activity profiles

Ejemplares similares