Cargando…

Simulations of Enhancer Evolution Provide Mechanistic Insights into Gene Regulation

There is growing interest in models of regulatory sequence evolution. However, existing models specifically designed for regulatory sequences consider the independent evolution of individual transcription factor (TF)–binding sites, ignoring that the function and evolution of a binding site depends o...

Descripción completa

Detalles Bibliográficos
Autores principales: Duque, Thyago, Samee, Md. Abul Hassan, Kazemian, Majid, Pham, Hannah N., Brodsky, Michael H., Sinha, Saurabh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3879441/
https://www.ncbi.nlm.nih.gov/pubmed/24097306
http://dx.doi.org/10.1093/molbev/mst170
_version_ 1782297985807286272
author Duque, Thyago
Samee, Md. Abul Hassan
Kazemian, Majid
Pham, Hannah N.
Brodsky, Michael H.
Sinha, Saurabh
author_facet Duque, Thyago
Samee, Md. Abul Hassan
Kazemian, Majid
Pham, Hannah N.
Brodsky, Michael H.
Sinha, Saurabh
author_sort Duque, Thyago
collection PubMed
description There is growing interest in models of regulatory sequence evolution. However, existing models specifically designed for regulatory sequences consider the independent evolution of individual transcription factor (TF)–binding sites, ignoring that the function and evolution of a binding site depends on its context, typically the cis-regulatory module (CRM) in which the site is located. Moreover, existing models do not account for the gene-specific roles of TF-binding sites, primarily because their roles often are not well understood. We introduce two models of regulatory sequence evolution that address some of the shortcomings of existing models and implement simulation frameworks based on them. One model simulates the evolution of an individual binding site in the context of a CRM, while the other evolves an entire CRM. Both models use a state-of-the art sequence-to-expression model to predict the effects of mutations on the regulatory output of the CRM and determine the strength of selection. We use the new framework to simulate the evolution of TF-binding sites in 37 well-studied CRMs belonging to the anterior–posterior patterning system in Drosophila embryos. We show that these simulations provide accurate fits to evolutionary data from 12 Drosophila genomes, which includes statistics of binding site conservation on relatively short evolutionary scales and site loss across larger divergence times. The new framework allows us, for the first time, to test hypotheses regarding the underlying cis-regulatory code by directly comparing the evolutionary implications of the hypothesis with the observed evolutionary dynamics of binding sites. Using this capability, we find that explicitly modeling self-cooperative DNA binding by the TF Caudal (CAD) provides significantly better fits than an otherwise identical evolutionary simulation that lacks this mechanistic aspect. This hypothesis is further supported by a statistical analysis of the distribution of intersite spacing between adjacent CAD sites. Experimental tests confirm direct homodimeric interaction between CAD molecules as well as self-cooperative DNA binding by CAD. We note that computational modeling of the D. melanogaster CRMs alone did not yield significant evidence to support CAD self-cooperativity. We thus demonstrate how specific mechanistic details encoded in CRMs can be revealed by modeling their evolution and fitting such models to multispecies data.
format Online
Article
Text
id pubmed-3879441
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-38794412014-01-03 Simulations of Enhancer Evolution Provide Mechanistic Insights into Gene Regulation Duque, Thyago Samee, Md. Abul Hassan Kazemian, Majid Pham, Hannah N. Brodsky, Michael H. Sinha, Saurabh Mol Biol Evol Methods There is growing interest in models of regulatory sequence evolution. However, existing models specifically designed for regulatory sequences consider the independent evolution of individual transcription factor (TF)–binding sites, ignoring that the function and evolution of a binding site depends on its context, typically the cis-regulatory module (CRM) in which the site is located. Moreover, existing models do not account for the gene-specific roles of TF-binding sites, primarily because their roles often are not well understood. We introduce two models of regulatory sequence evolution that address some of the shortcomings of existing models and implement simulation frameworks based on them. One model simulates the evolution of an individual binding site in the context of a CRM, while the other evolves an entire CRM. Both models use a state-of-the art sequence-to-expression model to predict the effects of mutations on the regulatory output of the CRM and determine the strength of selection. We use the new framework to simulate the evolution of TF-binding sites in 37 well-studied CRMs belonging to the anterior–posterior patterning system in Drosophila embryos. We show that these simulations provide accurate fits to evolutionary data from 12 Drosophila genomes, which includes statistics of binding site conservation on relatively short evolutionary scales and site loss across larger divergence times. The new framework allows us, for the first time, to test hypotheses regarding the underlying cis-regulatory code by directly comparing the evolutionary implications of the hypothesis with the observed evolutionary dynamics of binding sites. Using this capability, we find that explicitly modeling self-cooperative DNA binding by the TF Caudal (CAD) provides significantly better fits than an otherwise identical evolutionary simulation that lacks this mechanistic aspect. This hypothesis is further supported by a statistical analysis of the distribution of intersite spacing between adjacent CAD sites. Experimental tests confirm direct homodimeric interaction between CAD molecules as well as self-cooperative DNA binding by CAD. We note that computational modeling of the D. melanogaster CRMs alone did not yield significant evidence to support CAD self-cooperativity. We thus demonstrate how specific mechanistic details encoded in CRMs can be revealed by modeling their evolution and fitting such models to multispecies data. Oxford University Press 2014-01 2013-10-04 /pmc/articles/PMC3879441/ /pubmed/24097306 http://dx.doi.org/10.1093/molbev/mst170 Text en © The Author 2013. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Duque, Thyago
Samee, Md. Abul Hassan
Kazemian, Majid
Pham, Hannah N.
Brodsky, Michael H.
Sinha, Saurabh
Simulations of Enhancer Evolution Provide Mechanistic Insights into Gene Regulation
title Simulations of Enhancer Evolution Provide Mechanistic Insights into Gene Regulation
title_full Simulations of Enhancer Evolution Provide Mechanistic Insights into Gene Regulation
title_fullStr Simulations of Enhancer Evolution Provide Mechanistic Insights into Gene Regulation
title_full_unstemmed Simulations of Enhancer Evolution Provide Mechanistic Insights into Gene Regulation
title_short Simulations of Enhancer Evolution Provide Mechanistic Insights into Gene Regulation
title_sort simulations of enhancer evolution provide mechanistic insights into gene regulation
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3879441/
https://www.ncbi.nlm.nih.gov/pubmed/24097306
http://dx.doi.org/10.1093/molbev/mst170
work_keys_str_mv AT duquethyago simulationsofenhancerevolutionprovidemechanisticinsightsintogeneregulation
AT sameemdabulhassan simulationsofenhancerevolutionprovidemechanisticinsightsintogeneregulation
AT kazemianmajid simulationsofenhancerevolutionprovidemechanisticinsightsintogeneregulation
AT phamhannahn simulationsofenhancerevolutionprovidemechanisticinsightsintogeneregulation
AT brodskymichaelh simulationsofenhancerevolutionprovidemechanisticinsightsintogeneregulation
AT sinhasaurabh simulationsofenhancerevolutionprovidemechanisticinsightsintogeneregulation