Cargando…

A probabilistic model for the evolution of RNA structure

BACKGROUND: For the purposes of finding and aligning noncoding RNA gene- and cis-regulatory elements in multiple-genome datasets, it is useful to be able to derive multi-sequence stochastic grammars (and hence multiple alignment algorithms) systematically, starting from hypotheses about the various...

Descripción completa

Detalles Bibliográficos
Autor principal: Holmes, Ian
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC534097/
https://www.ncbi.nlm.nih.gov/pubmed/15507142
http://dx.doi.org/10.1186/1471-2105-5-166
_version_ 1782121997986168832
author Holmes, Ian
author_facet Holmes, Ian
author_sort Holmes, Ian
collection PubMed
description BACKGROUND: For the purposes of finding and aligning noncoding RNA gene- and cis-regulatory elements in multiple-genome datasets, it is useful to be able to derive multi-sequence stochastic grammars (and hence multiple alignment algorithms) systematically, starting from hypotheses about the various kinds of random mutation event and their rates. RESULTS: Here, we consider a highly simplified evolutionary model for RNA, called "The TKF91 Structure Tree" (following Thorne, Kishino and Felsenstein's 1991 model of sequence evolution with indels), which we have implemented for pairwise alignment as proof of principle for such an approach. The model, its strengths and its weaknesses are discussed with reference to four examples of functional ncRNA sequences: a riboswitch (guanine), a zipcode (nanos), a splicing factor (U4) and a ribozyme (RNase P). As shown by our visualisations of posterior probability matrices, the selected examples illustrate three different signatures of natural selection that are highly characteristic of ncRNA: (i) co-ordinated basepair substitutions, (ii) co-ordinated basepair indels and (iii) whole-stem indels. CONCLUSIONS: Although all three types of mutation "event" are built into our model, events of type (i) and (ii) are found to be better modeled than events of type (iii). Nevertheless, we hypothesise from the model's performance on pairwise alignments that it would form an adequate basis for a prototype multiple alignment and genefinding tool.
format Text
id pubmed-534097
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5340972004-11-28 A probabilistic model for the evolution of RNA structure Holmes, Ian BMC Bioinformatics Methodology Article BACKGROUND: For the purposes of finding and aligning noncoding RNA gene- and cis-regulatory elements in multiple-genome datasets, it is useful to be able to derive multi-sequence stochastic grammars (and hence multiple alignment algorithms) systematically, starting from hypotheses about the various kinds of random mutation event and their rates. RESULTS: Here, we consider a highly simplified evolutionary model for RNA, called "The TKF91 Structure Tree" (following Thorne, Kishino and Felsenstein's 1991 model of sequence evolution with indels), which we have implemented for pairwise alignment as proof of principle for such an approach. The model, its strengths and its weaknesses are discussed with reference to four examples of functional ncRNA sequences: a riboswitch (guanine), a zipcode (nanos), a splicing factor (U4) and a ribozyme (RNase P). As shown by our visualisations of posterior probability matrices, the selected examples illustrate three different signatures of natural selection that are highly characteristic of ncRNA: (i) co-ordinated basepair substitutions, (ii) co-ordinated basepair indels and (iii) whole-stem indels. CONCLUSIONS: Although all three types of mutation "event" are built into our model, events of type (i) and (ii) are found to be better modeled than events of type (iii). Nevertheless, we hypothesise from the model's performance on pairwise alignments that it would form an adequate basis for a prototype multiple alignment and genefinding tool. BioMed Central 2004-10-26 /pmc/articles/PMC534097/ /pubmed/15507142 http://dx.doi.org/10.1186/1471-2105-5-166 Text en Copyright © 2004 Holmes; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Holmes, Ian
A probabilistic model for the evolution of RNA structure
title A probabilistic model for the evolution of RNA structure
title_full A probabilistic model for the evolution of RNA structure
title_fullStr A probabilistic model for the evolution of RNA structure
title_full_unstemmed A probabilistic model for the evolution of RNA structure
title_short A probabilistic model for the evolution of RNA structure
title_sort probabilistic model for the evolution of rna structure
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC534097/
https://www.ncbi.nlm.nih.gov/pubmed/15507142
http://dx.doi.org/10.1186/1471-2105-5-166
work_keys_str_mv AT holmesian aprobabilisticmodelfortheevolutionofrnastructure
AT holmesian probabilisticmodelfortheevolutionofrnastructure