Cargando…

Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model

BACKGROUND: The goal of most programs developed to find transcription factor binding sites (TFBSs) is the identification of discrete sequence motifs that are significantly over-represented in a given set of sequences where a transcription factor (TF) is expected to bind. These programs assume that t...

Descripción completa

Detalles Bibliográficos
Autores principales: Oliver, Patricia, Peralta-Gil, Martín, Tabche, María-Luisa, Merino, Enrique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5002191/
https://www.ncbi.nlm.nih.gov/pubmed/27567672
http://dx.doi.org/10.1186/s12864-016-3025-3
_version_ 1782450534966362112
author Oliver, Patricia
Peralta-Gil, Martín
Tabche, María-Luisa
Merino, Enrique
author_facet Oliver, Patricia
Peralta-Gil, Martín
Tabche, María-Luisa
Merino, Enrique
author_sort Oliver, Patricia
collection PubMed
description BACKGROUND: The goal of most programs developed to find transcription factor binding sites (TFBSs) is the identification of discrete sequence motifs that are significantly over-represented in a given set of sequences where a transcription factor (TF) is expected to bind. These programs assume that the nucleotide conservation of a specific motif is indicative of a selective pressure required for the recognition of a TF for its corresponding TFBS. Despite their extensive use, the accuracies reached with these programs remain low. In many cases, true TFBSs are excluded from the identification process, especially when they correspond to low-affinity but important binding sites of regulatory systems. RESULTS: We developed a computational protocol based on molecular and structural criteria to perform biologically meaningful and accurate phylogenetic footprinting analyses. Our protocol considers fundamental aspects of the TF-DNA binding process, such as: i) the active homodimeric conformations of TFs that impose symmetric structures on the TFBSs, ii) the cooperative binding of TFs, iii) the effects of the presence or absence of co-inducers, iv) the proximity between two TFBSs or one TFBS and a promoter that leads to very long spurious motifs, v) the presence of AT-rich sequences not recognized by the TF but that are required for DNA flexibility, and vi) the dynamic order in which the different binding events take place to determine a regulatory response (i.e., activation or repression). In our protocol, the abovementioned criteria were used to analyze a profile of consensus motifs generated from canonical Phylogenetic Footprinting Analyses using a set of analysis windows of incremental sizes. To evaluate the performance of our protocol, we analyzed six members of the LysR-type TF family in Gammaproteobacteria. CONCLUSIONS: The identification of TFBSs based exclusively on the significance of the over-representation of motifs in a set of sequences might lead to inaccurate results. The consideration of different molecular and structural properties of the regulatory systems benefits the identification of TFBSs and enables the development of elaborate, biologically meaningful and precise regulatory models that offer a more integrated view of the dynamics of the regulatory process of transcription. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3025-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5002191
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50021912016-08-28 Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model Oliver, Patricia Peralta-Gil, Martín Tabche, María-Luisa Merino, Enrique BMC Genomics Methodology Article BACKGROUND: The goal of most programs developed to find transcription factor binding sites (TFBSs) is the identification of discrete sequence motifs that are significantly over-represented in a given set of sequences where a transcription factor (TF) is expected to bind. These programs assume that the nucleotide conservation of a specific motif is indicative of a selective pressure required for the recognition of a TF for its corresponding TFBS. Despite their extensive use, the accuracies reached with these programs remain low. In many cases, true TFBSs are excluded from the identification process, especially when they correspond to low-affinity but important binding sites of regulatory systems. RESULTS: We developed a computational protocol based on molecular and structural criteria to perform biologically meaningful and accurate phylogenetic footprinting analyses. Our protocol considers fundamental aspects of the TF-DNA binding process, such as: i) the active homodimeric conformations of TFs that impose symmetric structures on the TFBSs, ii) the cooperative binding of TFs, iii) the effects of the presence or absence of co-inducers, iv) the proximity between two TFBSs or one TFBS and a promoter that leads to very long spurious motifs, v) the presence of AT-rich sequences not recognized by the TF but that are required for DNA flexibility, and vi) the dynamic order in which the different binding events take place to determine a regulatory response (i.e., activation or repression). In our protocol, the abovementioned criteria were used to analyze a profile of consensus motifs generated from canonical Phylogenetic Footprinting Analyses using a set of analysis windows of incremental sizes. To evaluate the performance of our protocol, we analyzed six members of the LysR-type TF family in Gammaproteobacteria. CONCLUSIONS: The identification of TFBSs based exclusively on the significance of the over-representation of motifs in a set of sequences might lead to inaccurate results. The consideration of different molecular and structural properties of the regulatory systems benefits the identification of TFBSs and enables the development of elaborate, biologically meaningful and precise regulatory models that offer a more integrated view of the dynamics of the regulatory process of transcription. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3025-3) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-27 /pmc/articles/PMC5002191/ /pubmed/27567672 http://dx.doi.org/10.1186/s12864-016-3025-3 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Oliver, Patricia
Peralta-Gil, Martín
Tabche, María-Luisa
Merino, Enrique
Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model
title Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model
title_full Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model
title_fullStr Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model
title_full_unstemmed Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model
title_short Molecular and structural considerations of TF-DNA binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the LysR-type transcriptional regulator family as a study model
title_sort molecular and structural considerations of tf-dna binding for the generation of biologically meaningful and accurate phylogenetic footprinting analysis: the lysr-type transcriptional regulator family as a study model
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5002191/
https://www.ncbi.nlm.nih.gov/pubmed/27567672
http://dx.doi.org/10.1186/s12864-016-3025-3
work_keys_str_mv AT oliverpatricia molecularandstructuralconsiderationsoftfdnabindingforthegenerationofbiologicallymeaningfulandaccuratephylogeneticfootprintinganalysisthelysrtypetranscriptionalregulatorfamilyasastudymodel
AT peraltagilmartin molecularandstructuralconsiderationsoftfdnabindingforthegenerationofbiologicallymeaningfulandaccuratephylogeneticfootprintinganalysisthelysrtypetranscriptionalregulatorfamilyasastudymodel
AT tabchemarialuisa molecularandstructuralconsiderationsoftfdnabindingforthegenerationofbiologicallymeaningfulandaccuratephylogeneticfootprintinganalysisthelysrtypetranscriptionalregulatorfamilyasastudymodel
AT merinoenrique molecularandstructuralconsiderationsoftfdnabindingforthegenerationofbiologicallymeaningfulandaccuratephylogeneticfootprintinganalysisthelysrtypetranscriptionalregulatorfamilyasastudymodel