Cargando…
MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules
The discovery and analysis of cis-regulatory modules (CRMs) in metazoan genomes is crucial for understanding the transcriptional control of development and many other biological processes. Cross-species sequence comparison holds much promise for improving computational prediction of CRMs, for elucid...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2065892/ https://www.ncbi.nlm.nih.gov/pubmed/17997594 http://dx.doi.org/10.1371/journal.pcbi.0030216 |
_version_ | 1782137655363895296 |
---|---|
author | Sinha, Saurabh He, Xin |
author_facet | Sinha, Saurabh He, Xin |
author_sort | Sinha, Saurabh |
collection | PubMed |
description | The discovery and analysis of cis-regulatory modules (CRMs) in metazoan genomes is crucial for understanding the transcriptional control of development and many other biological processes. Cross-species sequence comparison holds much promise for improving computational prediction of CRMs, for elucidating their binding site composition, and for understanding how they evolve. Current methods for analyzing orthologous CRMs from multiple species rely upon sequence alignments produced by off-the-shelf alignment algorithms, which do not exploit the presence of binding sites in the sequences. We present here a unified probabilistic framework, called MORPH, that integrates the alignment task with binding site predictions, allowing more robust CRM analysis in two species. The framework sums over all possible alignments of two sequences, thus accounting for alignment ambiguities in a natural way. We perform extensive tests on orthologous CRMs from two moderately diverged species Drosophila melanogaster and D. mojavensis, to demonstrate the advantages of the new approach. We show that it can overcome certain computational artifacts of traditional alignment tools and provide a different, likely more accurate, picture of cis-regulatory evolution than that obtained from existing methods. The burgeoning field of cis-regulatory evolution, which is amply supported by the availability of many related genomes, is currently thwarted by the lack of accurate alignments of regulatory regions. Our work will fill in this void and enable more reliable analysis of CRM evolution. |
format | Text |
id | pubmed-2065892 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-20658922007-11-29 MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules Sinha, Saurabh He, Xin PLoS Comput Biol Research Article The discovery and analysis of cis-regulatory modules (CRMs) in metazoan genomes is crucial for understanding the transcriptional control of development and many other biological processes. Cross-species sequence comparison holds much promise for improving computational prediction of CRMs, for elucidating their binding site composition, and for understanding how they evolve. Current methods for analyzing orthologous CRMs from multiple species rely upon sequence alignments produced by off-the-shelf alignment algorithms, which do not exploit the presence of binding sites in the sequences. We present here a unified probabilistic framework, called MORPH, that integrates the alignment task with binding site predictions, allowing more robust CRM analysis in two species. The framework sums over all possible alignments of two sequences, thus accounting for alignment ambiguities in a natural way. We perform extensive tests on orthologous CRMs from two moderately diverged species Drosophila melanogaster and D. mojavensis, to demonstrate the advantages of the new approach. We show that it can overcome certain computational artifacts of traditional alignment tools and provide a different, likely more accurate, picture of cis-regulatory evolution than that obtained from existing methods. The burgeoning field of cis-regulatory evolution, which is amply supported by the availability of many related genomes, is currently thwarted by the lack of accurate alignments of regulatory regions. Our work will fill in this void and enable more reliable analysis of CRM evolution. Public Library of Science 2007-11 2007-11-09 /pmc/articles/PMC2065892/ /pubmed/17997594 http://dx.doi.org/10.1371/journal.pcbi.0030216 Text en © 2007 Sinha and He. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Sinha, Saurabh He, Xin MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules |
title | MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules |
title_full | MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules |
title_fullStr | MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules |
title_full_unstemmed | MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules |
title_short | MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules |
title_sort | morph: probabilistic alignment combined with hidden markov models of cis-regulatory modules |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2065892/ https://www.ncbi.nlm.nih.gov/pubmed/17997594 http://dx.doi.org/10.1371/journal.pcbi.0030216 |
work_keys_str_mv | AT sinhasaurabh morphprobabilisticalignmentcombinedwithhiddenmarkovmodelsofcisregulatorymodules AT hexin morphprobabilisticalignmentcombinedwithhiddenmarkovmodelsofcisregulatorymodules |