Cargando…

Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State

Precise patterns of spatial and temporal gene expression are central to metazoan complexity and act as a driving force for embryonic development. While there has been substantial progress in dissecting and predicting cis-regulatory activity, our understanding of how information from multiple enhance...

Descripción completa

Detalles Bibliográficos
Autores principales: Wilczynski, Bartek, Liu, Ya-Hsin, Yeo, Zhen Xuan, Furlong, Eileen E. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3516547/
https://www.ncbi.nlm.nih.gov/pubmed/23236268
http://dx.doi.org/10.1371/journal.pcbi.1002798
_version_ 1782252323032006656
author Wilczynski, Bartek
Liu, Ya-Hsin
Yeo, Zhen Xuan
Furlong, Eileen E. M.
author_facet Wilczynski, Bartek
Liu, Ya-Hsin
Yeo, Zhen Xuan
Furlong, Eileen E. M.
author_sort Wilczynski, Bartek
collection PubMed
description Precise patterns of spatial and temporal gene expression are central to metazoan complexity and act as a driving force for embryonic development. While there has been substantial progress in dissecting and predicting cis-regulatory activity, our understanding of how information from multiple enhancer elements converge to regulate a gene's expression remains elusive. This is in large part due to the number of different biological processes involved in mediating regulation as well as limited availability of experimental measurements for many of them. Here, we used a Bayesian approach to model diverse experimental regulatory data, leading to accurate predictions of both spatial and temporal aspects of gene expression. We integrated whole-embryo information on transcription factor recruitment to multiple cis-regulatory modules, insulator binding and histone modification status in the vicinity of individual gene loci, at a genome-wide scale during Drosophila development. The model uses Bayesian networks to represent the relation between transcription factor occupancy and enhancer activity in specific tissues and stages. All parameters are optimized in an Expectation Maximization procedure providing a model capable of predicting tissue- and stage-specific activity of new, previously unassayed genes. Performing the optimization with subsets of input data demonstrated that neither enhancer occupancy nor chromatin state alone can explain all gene expression patterns, but taken together allow for accurate predictions of spatio-temporal activity. Model predictions were validated using the expression patterns of more than 600 genes recently made available by the BDGP consortium, demonstrating an average 15-fold enrichment of genes expressed in the predicted tissue over a naïve model. We further validated the model by experimentally testing the expression of 20 predicted target genes of unknown expression, resulting in an accuracy of 95% for temporal predictions and 50% for spatial. While this is, to our knowledge, the first genome-wide approach to predict tissue-specific gene expression in metazoan development, our results suggest that integrative models of this type will become more prevalent in the future.
format Online
Article
Text
id pubmed-3516547
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35165472012-12-12 Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State Wilczynski, Bartek Liu, Ya-Hsin Yeo, Zhen Xuan Furlong, Eileen E. M. PLoS Comput Biol Research Article Precise patterns of spatial and temporal gene expression are central to metazoan complexity and act as a driving force for embryonic development. While there has been substantial progress in dissecting and predicting cis-regulatory activity, our understanding of how information from multiple enhancer elements converge to regulate a gene's expression remains elusive. This is in large part due to the number of different biological processes involved in mediating regulation as well as limited availability of experimental measurements for many of them. Here, we used a Bayesian approach to model diverse experimental regulatory data, leading to accurate predictions of both spatial and temporal aspects of gene expression. We integrated whole-embryo information on transcription factor recruitment to multiple cis-regulatory modules, insulator binding and histone modification status in the vicinity of individual gene loci, at a genome-wide scale during Drosophila development. The model uses Bayesian networks to represent the relation between transcription factor occupancy and enhancer activity in specific tissues and stages. All parameters are optimized in an Expectation Maximization procedure providing a model capable of predicting tissue- and stage-specific activity of new, previously unassayed genes. Performing the optimization with subsets of input data demonstrated that neither enhancer occupancy nor chromatin state alone can explain all gene expression patterns, but taken together allow for accurate predictions of spatio-temporal activity. Model predictions were validated using the expression patterns of more than 600 genes recently made available by the BDGP consortium, demonstrating an average 15-fold enrichment of genes expressed in the predicted tissue over a naïve model. We further validated the model by experimentally testing the expression of 20 predicted target genes of unknown expression, resulting in an accuracy of 95% for temporal predictions and 50% for spatial. While this is, to our knowledge, the first genome-wide approach to predict tissue-specific gene expression in metazoan development, our results suggest that integrative models of this type will become more prevalent in the future. Public Library of Science 2012-12-06 /pmc/articles/PMC3516547/ /pubmed/23236268 http://dx.doi.org/10.1371/journal.pcbi.1002798 Text en © 2012 Wilczynski et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wilczynski, Bartek
Liu, Ya-Hsin
Yeo, Zhen Xuan
Furlong, Eileen E. M.
Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State
title Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State
title_full Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State
title_fullStr Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State
title_full_unstemmed Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State
title_short Predicting Spatial and Temporal Gene Expression Using an Integrative Model of Transcription Factor Occupancy and Chromatin State
title_sort predicting spatial and temporal gene expression using an integrative model of transcription factor occupancy and chromatin state
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3516547/
https://www.ncbi.nlm.nih.gov/pubmed/23236268
http://dx.doi.org/10.1371/journal.pcbi.1002798
work_keys_str_mv AT wilczynskibartek predictingspatialandtemporalgeneexpressionusinganintegrativemodeloftranscriptionfactoroccupancyandchromatinstate
AT liuyahsin predictingspatialandtemporalgeneexpressionusinganintegrativemodeloftranscriptionfactoroccupancyandchromatinstate
AT yeozhenxuan predictingspatialandtemporalgeneexpressionusinganintegrativemodeloftranscriptionfactoroccupancyandchromatinstate
AT furlongeileenem predictingspatialandtemporalgeneexpressionusinganintegrativemodeloftranscriptionfactoroccupancyandchromatinstate