Cargando…

Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome

A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription facto...

Descripción completa

Detalles Bibliográficos
Autores principales: Dresch, Jacqueline M., Zellers, Rowan G., Bork, Daniel K., Drewell, Robert A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4907338/
https://www.ncbi.nlm.nih.gov/pubmed/27330274
http://dx.doi.org/10.4137/GRSB.S38462
_version_ 1782437530059145216
author Dresch, Jacqueline M.
Zellers, Rowan G.
Bork, Daniel K.
Drewell, Robert A.
author_facet Dresch, Jacqueline M.
Zellers, Rowan G.
Bork, Daniel K.
Drewell, Robert A.
author_sort Dresch, Jacqueline M.
collection PubMed
description A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development.
format Online
Article
Text
id pubmed-4907338
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-49073382016-06-17 Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome Dresch, Jacqueline M. Zellers, Rowan G. Bork, Daniel K. Drewell, Robert A. Gene Regul Syst Bio Original Research A long-standing objective in modern biology is to characterize the molecular components that drive the development of an organism. At the heart of eukaryotic development lies gene regulation. On the molecular level, much of the research in this field has focused on the binding of transcription factors (TFs) to regulatory regions in the genome known as cis-regulatory modules (CRMs). However, relatively little is known about the sequence-specific binding preferences of many TFs, especially with respect to the possible interdependencies between the nucleotides that make up binding sites. A particular limitation of many existing algorithms that aim to predict binding site sequences is that they do not allow for dependencies between nonadjacent nucleotides. In this study, we use a recently developed computational algorithm, MARZ, to compare binding site sequences using 32 distinct models in a systematic and unbiased approach to explore nucleotide dependencies within binding sites for 15 distinct TFs known to be critical to Drosophila development. Our results indicate that many of these proteins have varying levels of nucleotide interdependencies within their DNA recognition sequences, and that, in some cases, models that account for these dependencies greatly outperform traditional models that are used to predict binding sites. We also directly compare the ability of different models to identify the known KRUPPEL TF binding sites in CRMs and demonstrate that a more complex model that accounts for nucleotide interdependencies performs better when compared with simple models. This ability to identify TFs with critical nucleotide interdependencies in their binding sites will lead to a deeper understanding of how these molecular characteristics contribute to the architecture of CRMs and the precise regulation of transcription during organismal development. Libertas Academica 2016-06-12 /pmc/articles/PMC4907338/ /pubmed/27330274 http://dx.doi.org/10.4137/GRSB.S38462 Text en © 2016 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License.
spellingShingle Original Research
Dresch, Jacqueline M.
Zellers, Rowan G.
Bork, Daniel K.
Drewell, Robert A.
Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome
title Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome
title_full Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome
title_fullStr Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome
title_full_unstemmed Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome
title_short Nucleotide Interdependency in Transcription Factor Binding Sites in the Drosophila Genome
title_sort nucleotide interdependency in transcription factor binding sites in the drosophila genome
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4907338/
https://www.ncbi.nlm.nih.gov/pubmed/27330274
http://dx.doi.org/10.4137/GRSB.S38462
work_keys_str_mv AT dreschjacquelinem nucleotideinterdependencyintranscriptionfactorbindingsitesinthedrosophilagenome
AT zellersrowang nucleotideinterdependencyintranscriptionfactorbindingsitesinthedrosophilagenome
AT borkdanielk nucleotideinterdependencyintranscriptionfactorbindingsitesinthedrosophilagenome
AT drewellroberta nucleotideinterdependencyintranscriptionfactorbindingsitesinthedrosophilagenome