Cargando…

ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites

Eukaryotic gene expression is regulated by transcription factors (TFs) binding to promoter as well as distal enhancers. TFs recognize short, but specific binding sites (TFBSs) that are located within the promoter and enhancer regions. Functionally relevant TFBSs are often highly conserved during evo...

Descripción completa

Detalles Bibliográficos
Autores principales: Lelieveld, Stefan H., Schütte, Judith, Dijkstra, Maurits J.J., Bawono, Punto, Kinston, Sarah J., Göttgens, Berthold, Heringa, Jaap, Bonzanni, Nicola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856970/
https://www.ncbi.nlm.nih.gov/pubmed/26721389
http://dx.doi.org/10.1093/nar/gkv1518
_version_ 1782430574088028160
author Lelieveld, Stefan H.
Schütte, Judith
Dijkstra, Maurits J.J.
Bawono, Punto
Kinston, Sarah J.
Göttgens, Berthold
Heringa, Jaap
Bonzanni, Nicola
author_facet Lelieveld, Stefan H.
Schütte, Judith
Dijkstra, Maurits J.J.
Bawono, Punto
Kinston, Sarah J.
Göttgens, Berthold
Heringa, Jaap
Bonzanni, Nicola
author_sort Lelieveld, Stefan H.
collection PubMed
description Eukaryotic gene expression is regulated by transcription factors (TFs) binding to promoter as well as distal enhancers. TFs recognize short, but specific binding sites (TFBSs) that are located within the promoter and enhancer regions. Functionally relevant TFBSs are often highly conserved during evolution leaving a strong phylogenetic signal. While multiple sequence alignment (MSA) is a potent tool to detect the phylogenetic signal, the current MSA implementations are optimized to align the maximum number of identical nucleotides. This approach might result in the omission of conserved motifs that contain interchangeable nucleotides such as the ETS motif (IUPAC code: GGAW). Here, we introduce ConBind, a novel method to enhance alignment of short motifs, even if their mutual sequence similarity is only partial. ConBind improves the identification of conserved TFBSs by improving the alignment accuracy of TFBS families within orthologous DNA sequences. Functional validation of the Gfi1b + 13 enhancer reveals that ConBind identifies additional functionally important ETS binding sites that were missed by all other tested alignment tools. In addition to the analysis of known regulatory regions, our web tool is useful for the analysis of TFBSs on so far unknown DNA regions identified through ChIP-sequencing.
format Online
Article
Text
id pubmed-4856970
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48569702016-05-09 ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites Lelieveld, Stefan H. Schütte, Judith Dijkstra, Maurits J.J. Bawono, Punto Kinston, Sarah J. Göttgens, Berthold Heringa, Jaap Bonzanni, Nicola Nucleic Acids Res Methods Online Eukaryotic gene expression is regulated by transcription factors (TFs) binding to promoter as well as distal enhancers. TFs recognize short, but specific binding sites (TFBSs) that are located within the promoter and enhancer regions. Functionally relevant TFBSs are often highly conserved during evolution leaving a strong phylogenetic signal. While multiple sequence alignment (MSA) is a potent tool to detect the phylogenetic signal, the current MSA implementations are optimized to align the maximum number of identical nucleotides. This approach might result in the omission of conserved motifs that contain interchangeable nucleotides such as the ETS motif (IUPAC code: GGAW). Here, we introduce ConBind, a novel method to enhance alignment of short motifs, even if their mutual sequence similarity is only partial. ConBind improves the identification of conserved TFBSs by improving the alignment accuracy of TFBS families within orthologous DNA sequences. Functional validation of the Gfi1b + 13 enhancer reveals that ConBind identifies additional functionally important ETS binding sites that were missed by all other tested alignment tools. In addition to the analysis of known regulatory regions, our web tool is useful for the analysis of TFBSs on so far unknown DNA regions identified through ChIP-sequencing. Oxford University Press 2016-05-05 2015-12-31 /pmc/articles/PMC4856970/ /pubmed/26721389 http://dx.doi.org/10.1093/nar/gkv1518 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Lelieveld, Stefan H.
Schütte, Judith
Dijkstra, Maurits J.J.
Bawono, Punto
Kinston, Sarah J.
Göttgens, Berthold
Heringa, Jaap
Bonzanni, Nicola
ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites
title ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites
title_full ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites
title_fullStr ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites
title_full_unstemmed ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites
title_short ConBind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites
title_sort conbind: motif-aware cross-species alignment for the identification of functional transcription factor binding sites
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856970/
https://www.ncbi.nlm.nih.gov/pubmed/26721389
http://dx.doi.org/10.1093/nar/gkv1518
work_keys_str_mv AT lelieveldstefanh conbindmotifawarecrossspeciesalignmentfortheidentificationoffunctionaltranscriptionfactorbindingsites
AT schuttejudith conbindmotifawarecrossspeciesalignmentfortheidentificationoffunctionaltranscriptionfactorbindingsites
AT dijkstramauritsjj conbindmotifawarecrossspeciesalignmentfortheidentificationoffunctionaltranscriptionfactorbindingsites
AT bawonopunto conbindmotifawarecrossspeciesalignmentfortheidentificationoffunctionaltranscriptionfactorbindingsites
AT kinstonsarahj conbindmotifawarecrossspeciesalignmentfortheidentificationoffunctionaltranscriptionfactorbindingsites
AT gottgensberthold conbindmotifawarecrossspeciesalignmentfortheidentificationoffunctionaltranscriptionfactorbindingsites
AT heringajaap conbindmotifawarecrossspeciesalignmentfortheidentificationoffunctionaltranscriptionfactorbindingsites
AT bonzanninicola conbindmotifawarecrossspeciesalignmentfortheidentificationoffunctionaltranscriptionfactorbindingsites