Cargando…

Phylogeny based discovery of regulatory elements

BACKGROUND: Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequenc...

Descripción completa

Detalles Bibliográficos
Autores principales: Gertz, Jason, Fay, Justin C, Cohen, Barak A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1525002/
https://www.ncbi.nlm.nih.gov/pubmed/16716228
http://dx.doi.org/10.1186/1471-2105-7-266
_version_ 1782128876610125824
author Gertz, Jason
Fay, Justin C
Cohen, Barak A
author_facet Gertz, Jason
Fay, Justin C
Cohen, Barak A
author_sort Gertz, Jason
collection PubMed
description BACKGROUND: Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequences. RESULTS: We developed a probabilistic framework that combines an HKY85 model, which assigns probabilities to different base substitutions between species, and weight matrix models of transcription factor binding sites, which describe the probabilities of observing particular nucleotides at specific positions in the binding site. The method incorporates the phylogenies of the species under consideration and takes into account the position specific variation of transcription factor binding sites. Using our framework we assessed the suitability of alignments of genomic sequences from commonly used species as substrates for comparative genomic approaches to regulatory motif finding. We then applied this technique to Saccharomyces cerevisiae and related species by examining all possible six base pair DNA sequences (hexamers) and identifying sequences that are conserved in a significant number of promoters. By combining similar conserved hexamers we reconstructed known cis-regulatory motifs and made predictions of previously unidentified motifs. We tested one prediction experimentally, finding it to be a regulatory element involved in the transcriptional response to glucose. CONCLUSION: The experimental validation of a regulatory element prediction missed by other large-scale motif finding studies demonstrates that our approach is a useful addition to the current suite of tools for finding regulatory motifs.
format Text
id pubmed-1525002
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15250022006-08-01 Phylogeny based discovery of regulatory elements Gertz, Jason Fay, Justin C Cohen, Barak A BMC Bioinformatics Research Article BACKGROUND: Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequences. RESULTS: We developed a probabilistic framework that combines an HKY85 model, which assigns probabilities to different base substitutions between species, and weight matrix models of transcription factor binding sites, which describe the probabilities of observing particular nucleotides at specific positions in the binding site. The method incorporates the phylogenies of the species under consideration and takes into account the position specific variation of transcription factor binding sites. Using our framework we assessed the suitability of alignments of genomic sequences from commonly used species as substrates for comparative genomic approaches to regulatory motif finding. We then applied this technique to Saccharomyces cerevisiae and related species by examining all possible six base pair DNA sequences (hexamers) and identifying sequences that are conserved in a significant number of promoters. By combining similar conserved hexamers we reconstructed known cis-regulatory motifs and made predictions of previously unidentified motifs. We tested one prediction experimentally, finding it to be a regulatory element involved in the transcriptional response to glucose. CONCLUSION: The experimental validation of a regulatory element prediction missed by other large-scale motif finding studies demonstrates that our approach is a useful addition to the current suite of tools for finding regulatory motifs. BioMed Central 2006-05-22 /pmc/articles/PMC1525002/ /pubmed/16716228 http://dx.doi.org/10.1186/1471-2105-7-266 Text en Copyright © 2006 Gertz et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Gertz, Jason
Fay, Justin C
Cohen, Barak A
Phylogeny based discovery of regulatory elements
title Phylogeny based discovery of regulatory elements
title_full Phylogeny based discovery of regulatory elements
title_fullStr Phylogeny based discovery of regulatory elements
title_full_unstemmed Phylogeny based discovery of regulatory elements
title_short Phylogeny based discovery of regulatory elements
title_sort phylogeny based discovery of regulatory elements
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1525002/
https://www.ncbi.nlm.nih.gov/pubmed/16716228
http://dx.doi.org/10.1186/1471-2105-7-266
work_keys_str_mv AT gertzjason phylogenybaseddiscoveryofregulatoryelements
AT fayjustinc phylogenybaseddiscoveryofregulatoryelements
AT cohenbaraka phylogenybaseddiscoveryofregulatoryelements