Cargando…
Phylogeny based discovery of regulatory elements
BACKGROUND: Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequenc...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1525002/ https://www.ncbi.nlm.nih.gov/pubmed/16716228 http://dx.doi.org/10.1186/1471-2105-7-266 |
_version_ | 1782128876610125824 |
---|---|
author | Gertz, Jason Fay, Justin C Cohen, Barak A |
author_facet | Gertz, Jason Fay, Justin C Cohen, Barak A |
author_sort | Gertz, Jason |
collection | PubMed |
description | BACKGROUND: Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequences. RESULTS: We developed a probabilistic framework that combines an HKY85 model, which assigns probabilities to different base substitutions between species, and weight matrix models of transcription factor binding sites, which describe the probabilities of observing particular nucleotides at specific positions in the binding site. The method incorporates the phylogenies of the species under consideration and takes into account the position specific variation of transcription factor binding sites. Using our framework we assessed the suitability of alignments of genomic sequences from commonly used species as substrates for comparative genomic approaches to regulatory motif finding. We then applied this technique to Saccharomyces cerevisiae and related species by examining all possible six base pair DNA sequences (hexamers) and identifying sequences that are conserved in a significant number of promoters. By combining similar conserved hexamers we reconstructed known cis-regulatory motifs and made predictions of previously unidentified motifs. We tested one prediction experimentally, finding it to be a regulatory element involved in the transcriptional response to glucose. CONCLUSION: The experimental validation of a regulatory element prediction missed by other large-scale motif finding studies demonstrates that our approach is a useful addition to the current suite of tools for finding regulatory motifs. |
format | Text |
id | pubmed-1525002 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15250022006-08-01 Phylogeny based discovery of regulatory elements Gertz, Jason Fay, Justin C Cohen, Barak A BMC Bioinformatics Research Article BACKGROUND: Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequences. RESULTS: We developed a probabilistic framework that combines an HKY85 model, which assigns probabilities to different base substitutions between species, and weight matrix models of transcription factor binding sites, which describe the probabilities of observing particular nucleotides at specific positions in the binding site. The method incorporates the phylogenies of the species under consideration and takes into account the position specific variation of transcription factor binding sites. Using our framework we assessed the suitability of alignments of genomic sequences from commonly used species as substrates for comparative genomic approaches to regulatory motif finding. We then applied this technique to Saccharomyces cerevisiae and related species by examining all possible six base pair DNA sequences (hexamers) and identifying sequences that are conserved in a significant number of promoters. By combining similar conserved hexamers we reconstructed known cis-regulatory motifs and made predictions of previously unidentified motifs. We tested one prediction experimentally, finding it to be a regulatory element involved in the transcriptional response to glucose. CONCLUSION: The experimental validation of a regulatory element prediction missed by other large-scale motif finding studies demonstrates that our approach is a useful addition to the current suite of tools for finding regulatory motifs. BioMed Central 2006-05-22 /pmc/articles/PMC1525002/ /pubmed/16716228 http://dx.doi.org/10.1186/1471-2105-7-266 Text en Copyright © 2006 Gertz et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Gertz, Jason Fay, Justin C Cohen, Barak A Phylogeny based discovery of regulatory elements |
title | Phylogeny based discovery of regulatory elements |
title_full | Phylogeny based discovery of regulatory elements |
title_fullStr | Phylogeny based discovery of regulatory elements |
title_full_unstemmed | Phylogeny based discovery of regulatory elements |
title_short | Phylogeny based discovery of regulatory elements |
title_sort | phylogeny based discovery of regulatory elements |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1525002/ https://www.ncbi.nlm.nih.gov/pubmed/16716228 http://dx.doi.org/10.1186/1471-2105-7-266 |
work_keys_str_mv | AT gertzjason phylogenybaseddiscoveryofregulatoryelements AT fayjustinc phylogenybaseddiscoveryofregulatoryelements AT cohenbaraka phylogenybaseddiscoveryofregulatoryelements |