Cargando…

Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism

Characterization of cell type specific regulatory networks and elements is a major challenge in genomics, and emerging strategies frequently employ high-throughput genome-wide assays of transcription factor (TF) to DNA binding, histone modifications or chromatin state. However, these experiments rem...

Descripción completa

Detalles Bibliográficos
Autores principales: Blatti, Charles, Kazemian, Majid, Wolfe, Scot, Brodsky, Michael, Sinha, Saurabh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417154/
https://www.ncbi.nlm.nih.gov/pubmed/25791631
http://dx.doi.org/10.1093/nar/gkv195
_version_ 1782369317789106176
author Blatti, Charles
Kazemian, Majid
Wolfe, Scot
Brodsky, Michael
Sinha, Saurabh
author_facet Blatti, Charles
Kazemian, Majid
Wolfe, Scot
Brodsky, Michael
Sinha, Saurabh
author_sort Blatti, Charles
collection PubMed
description Characterization of cell type specific regulatory networks and elements is a major challenge in genomics, and emerging strategies frequently employ high-throughput genome-wide assays of transcription factor (TF) to DNA binding, histone modifications or chromatin state. However, these experiments remain too difficult/expensive for many laboratories to apply comprehensively to their system of interest. Here, we explore the potential of elucidating regulatory systems in varied cell types using computational techniques that rely on only data of gene expression, low-resolution chromatin accessibility, and TF–DNA binding specificities (‘motifs’). We show that static computational motif scans overlaid with chromatin accessibility data reasonably approximate experimentally measured TF–DNA binding. We demonstrate that predicted binding profiles and expression patterns of hundreds of TFs are sufficient to identify major regulators of ∼200 spatiotemporal expression domains in the Drosophila embryo. We are then able to learn reliable statistical models of enhancer activity for over 70 expression domains and apply those models to annotate domain specific enhancers genome-wide. Throughout this work, we apply our motif and accessibility based approach to comprehensively characterize the regulatory network of fruitfly embryonic development and show that the accuracy of our computational method compares favorably to approaches that rely on data from many experimental assays.
format Online
Article
Text
id pubmed-4417154
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44171542015-06-02 Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism Blatti, Charles Kazemian, Majid Wolfe, Scot Brodsky, Michael Sinha, Saurabh Nucleic Acids Res Gene regulation, Chromatin and Epigenetics Characterization of cell type specific regulatory networks and elements is a major challenge in genomics, and emerging strategies frequently employ high-throughput genome-wide assays of transcription factor (TF) to DNA binding, histone modifications or chromatin state. However, these experiments remain too difficult/expensive for many laboratories to apply comprehensively to their system of interest. Here, we explore the potential of elucidating regulatory systems in varied cell types using computational techniques that rely on only data of gene expression, low-resolution chromatin accessibility, and TF–DNA binding specificities (‘motifs’). We show that static computational motif scans overlaid with chromatin accessibility data reasonably approximate experimentally measured TF–DNA binding. We demonstrate that predicted binding profiles and expression patterns of hundreds of TFs are sufficient to identify major regulators of ∼200 spatiotemporal expression domains in the Drosophila embryo. We are then able to learn reliable statistical models of enhancer activity for over 70 expression domains and apply those models to annotate domain specific enhancers genome-wide. Throughout this work, we apply our motif and accessibility based approach to comprehensively characterize the regulatory network of fruitfly embryonic development and show that the accuracy of our computational method compares favorably to approaches that rely on data from many experimental assays. Oxford University Press 2015-04-30 2015-03-19 /pmc/articles/PMC4417154/ /pubmed/25791631 http://dx.doi.org/10.1093/nar/gkv195 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Gene regulation, Chromatin and Epigenetics
Blatti, Charles
Kazemian, Majid
Wolfe, Scot
Brodsky, Michael
Sinha, Saurabh
Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism
title Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism
title_full Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism
title_fullStr Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism
title_full_unstemmed Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism
title_short Integrating motif, DNA accessibility and gene expression data to build regulatory maps in an organism
title_sort integrating motif, dna accessibility and gene expression data to build regulatory maps in an organism
topic Gene regulation, Chromatin and Epigenetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4417154/
https://www.ncbi.nlm.nih.gov/pubmed/25791631
http://dx.doi.org/10.1093/nar/gkv195
work_keys_str_mv AT blatticharles integratingmotifdnaaccessibilityandgeneexpressiondatatobuildregulatorymapsinanorganism
AT kazemianmajid integratingmotifdnaaccessibilityandgeneexpressiondatatobuildregulatorymapsinanorganism
AT wolfescot integratingmotifdnaaccessibilityandgeneexpressiondatatobuildregulatorymapsinanorganism
AT brodskymichael integratingmotifdnaaccessibilityandgeneexpressiondatatobuildregulatorymapsinanorganism
AT sinhasaurabh integratingmotifdnaaccessibilityandgeneexpressiondatatobuildregulatorymapsinanorganism