Cargando…

MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules

The discovery and analysis of cis-regulatory modules (CRMs) in metazoan genomes is crucial for understanding the transcriptional control of development and many other biological processes. Cross-species sequence comparison holds much promise for improving computational prediction of CRMs, for elucid...

Descripción completa

Detalles Bibliográficos
Autores principales: Sinha, Saurabh, He, Xin
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2065892/
https://www.ncbi.nlm.nih.gov/pubmed/17997594
http://dx.doi.org/10.1371/journal.pcbi.0030216
_version_ 1782137655363895296
author Sinha, Saurabh
He, Xin
author_facet Sinha, Saurabh
He, Xin
author_sort Sinha, Saurabh
collection PubMed
description The discovery and analysis of cis-regulatory modules (CRMs) in metazoan genomes is crucial for understanding the transcriptional control of development and many other biological processes. Cross-species sequence comparison holds much promise for improving computational prediction of CRMs, for elucidating their binding site composition, and for understanding how they evolve. Current methods for analyzing orthologous CRMs from multiple species rely upon sequence alignments produced by off-the-shelf alignment algorithms, which do not exploit the presence of binding sites in the sequences. We present here a unified probabilistic framework, called MORPH, that integrates the alignment task with binding site predictions, allowing more robust CRM analysis in two species. The framework sums over all possible alignments of two sequences, thus accounting for alignment ambiguities in a natural way. We perform extensive tests on orthologous CRMs from two moderately diverged species Drosophila melanogaster and D. mojavensis, to demonstrate the advantages of the new approach. We show that it can overcome certain computational artifacts of traditional alignment tools and provide a different, likely more accurate, picture of cis-regulatory evolution than that obtained from existing methods. The burgeoning field of cis-regulatory evolution, which is amply supported by the availability of many related genomes, is currently thwarted by the lack of accurate alignments of regulatory regions. Our work will fill in this void and enable more reliable analysis of CRM evolution.
format Text
id pubmed-2065892
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-20658922007-11-29 MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules Sinha, Saurabh He, Xin PLoS Comput Biol Research Article The discovery and analysis of cis-regulatory modules (CRMs) in metazoan genomes is crucial for understanding the transcriptional control of development and many other biological processes. Cross-species sequence comparison holds much promise for improving computational prediction of CRMs, for elucidating their binding site composition, and for understanding how they evolve. Current methods for analyzing orthologous CRMs from multiple species rely upon sequence alignments produced by off-the-shelf alignment algorithms, which do not exploit the presence of binding sites in the sequences. We present here a unified probabilistic framework, called MORPH, that integrates the alignment task with binding site predictions, allowing more robust CRM analysis in two species. The framework sums over all possible alignments of two sequences, thus accounting for alignment ambiguities in a natural way. We perform extensive tests on orthologous CRMs from two moderately diverged species Drosophila melanogaster and D. mojavensis, to demonstrate the advantages of the new approach. We show that it can overcome certain computational artifacts of traditional alignment tools and provide a different, likely more accurate, picture of cis-regulatory evolution than that obtained from existing methods. The burgeoning field of cis-regulatory evolution, which is amply supported by the availability of many related genomes, is currently thwarted by the lack of accurate alignments of regulatory regions. Our work will fill in this void and enable more reliable analysis of CRM evolution. Public Library of Science 2007-11 2007-11-09 /pmc/articles/PMC2065892/ /pubmed/17997594 http://dx.doi.org/10.1371/journal.pcbi.0030216 Text en © 2007 Sinha and He. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Sinha, Saurabh
He, Xin
MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules
title MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules
title_full MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules
title_fullStr MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules
title_full_unstemmed MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules
title_short MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules
title_sort morph: probabilistic alignment combined with hidden markov models of cis-regulatory modules
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2065892/
https://www.ncbi.nlm.nih.gov/pubmed/17997594
http://dx.doi.org/10.1371/journal.pcbi.0030216
work_keys_str_mv AT sinhasaurabh morphprobabilisticalignmentcombinedwithhiddenmarkovmodelsofcisregulatorymodules
AT hexin morphprobabilisticalignmentcombinedwithhiddenmarkovmodelsofcisregulatorymodules