Cargando…

CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing

Functional turnover of transcription factor binding sites (TFBSs), such as whole-motif loss or gain, are common events during genome evolution. Conventional probabilistic phylogenetic shadowing methods model the evolution of genomes only at nucleotide level, and lack the ability to capture the evolu...

Descripción completa

Detalles Bibliográficos
Autores principales: Ray, Pradipta, Shringarpure, Suyash, Kolar, Mladen, Xing, Eric P.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2396503/
https://www.ncbi.nlm.nih.gov/pubmed/18535663
http://dx.doi.org/10.1371/journal.pcbi.1000090
_version_ 1782155570024808448
author Ray, Pradipta
Shringarpure, Suyash
Kolar, Mladen
Xing, Eric P.
author_facet Ray, Pradipta
Shringarpure, Suyash
Kolar, Mladen
Xing, Eric P.
author_sort Ray, Pradipta
collection PubMed
description Functional turnover of transcription factor binding sites (TFBSs), such as whole-motif loss or gain, are common events during genome evolution. Conventional probabilistic phylogenetic shadowing methods model the evolution of genomes only at nucleotide level, and lack the ability to capture the evolutionary dynamics of functional turnover of aligned sequence entities. As a result, comparative genomic search of non-conserved motifs across evolutionarily related taxa remains a difficult challenge, especially in higher eukaryotes, where the cis-regulatory regions containing motifs can be long and divergent; existing methods rely heavily on specialized pattern-driven heuristic search or sampling algorithms, which can be difficult to generalize and hard to interpret based on phylogenetic principles. We propose a new method: Conditional Shadowing via Multi-resolution Evolutionary Trees, or CSMET, which uses a context-dependent probabilistic graphical model that allows aligned sites from different taxa in a multiple alignment to be modeled by either a background or an appropriate motif phylogeny conditioning on the functional specifications of each taxon. The functional specifications themselves are the output of a phylogeny which models the evolution not of individual nucleotides, but of the overall functionality (e.g., functional retention or loss) of the aligned sequence segments over lineages. Combining this method with a hidden Markov model that autocorrelates evolutionary rates on successive sites in the genome, CSMET offers a principled way to take into consideration lineage-specific evolution of TFBSs during motif detection, and a readily computable analytical form of the posterior distribution of motifs under TFBS turnover. On both simulated and real Drosophila cis-regulatory modules, CSMET outperforms other state-of-the-art comparative genomic motif finders.
format Text
id pubmed-2396503
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-23965032008-06-06 CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing Ray, Pradipta Shringarpure, Suyash Kolar, Mladen Xing, Eric P. PLoS Comput Biol Research Article Functional turnover of transcription factor binding sites (TFBSs), such as whole-motif loss or gain, are common events during genome evolution. Conventional probabilistic phylogenetic shadowing methods model the evolution of genomes only at nucleotide level, and lack the ability to capture the evolutionary dynamics of functional turnover of aligned sequence entities. As a result, comparative genomic search of non-conserved motifs across evolutionarily related taxa remains a difficult challenge, especially in higher eukaryotes, where the cis-regulatory regions containing motifs can be long and divergent; existing methods rely heavily on specialized pattern-driven heuristic search or sampling algorithms, which can be difficult to generalize and hard to interpret based on phylogenetic principles. We propose a new method: Conditional Shadowing via Multi-resolution Evolutionary Trees, or CSMET, which uses a context-dependent probabilistic graphical model that allows aligned sites from different taxa in a multiple alignment to be modeled by either a background or an appropriate motif phylogeny conditioning on the functional specifications of each taxon. The functional specifications themselves are the output of a phylogeny which models the evolution not of individual nucleotides, but of the overall functionality (e.g., functional retention or loss) of the aligned sequence segments over lineages. Combining this method with a hidden Markov model that autocorrelates evolutionary rates on successive sites in the genome, CSMET offers a principled way to take into consideration lineage-specific evolution of TFBSs during motif detection, and a readily computable analytical form of the posterior distribution of motifs under TFBS turnover. On both simulated and real Drosophila cis-regulatory modules, CSMET outperforms other state-of-the-art comparative genomic motif finders. Public Library of Science 2008-06-06 /pmc/articles/PMC2396503/ /pubmed/18535663 http://dx.doi.org/10.1371/journal.pcbi.1000090 Text en Ray et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ray, Pradipta
Shringarpure, Suyash
Kolar, Mladen
Xing, Eric P.
CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing
title CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing
title_full CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing
title_fullStr CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing
title_full_unstemmed CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing
title_short CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing
title_sort csmet: comparative genomic motif detection via multi-resolution phylogenetic shadowing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2396503/
https://www.ncbi.nlm.nih.gov/pubmed/18535663
http://dx.doi.org/10.1371/journal.pcbi.1000090
work_keys_str_mv AT raypradipta csmetcomparativegenomicmotifdetectionviamultiresolutionphylogeneticshadowing
AT shringarpuresuyash csmetcomparativegenomicmotifdetectionviamultiresolutionphylogeneticshadowing
AT kolarmladen csmetcomparativegenomicmotifdetectionviamultiresolutionphylogeneticshadowing
AT xingericp csmetcomparativegenomicmotifdetectionviamultiresolutionphylogeneticshadowing