Cargando…
Sampled ensemble neutrality as a feature to classify potential structured RNAs
BACKGROUND: Structured RNAs have many biological functions ranging from catalysis of chemical reactions to gene regulation. Yet, many homologous structured RNAs display most of their conservation at the secondary or tertiary structure level. As a result, strategies for structured RNA discovery rely...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333902/ https://www.ncbi.nlm.nih.gov/pubmed/25649229 http://dx.doi.org/10.1186/s12864-014-1203-8 |
_version_ | 1782358122893934592 |
---|---|
author | Pei, Shermin Anthony, Jon S Meyer, Michelle M |
author_facet | Pei, Shermin Anthony, Jon S Meyer, Michelle M |
author_sort | Pei, Shermin |
collection | PubMed |
description | BACKGROUND: Structured RNAs have many biological functions ranging from catalysis of chemical reactions to gene regulation. Yet, many homologous structured RNAs display most of their conservation at the secondary or tertiary structure level. As a result, strategies for structured RNA discovery rely heavily on identification of sequences sharing a common stable secondary structure. However, correctly distinguishing structured RNAs from surrounding genomic sequence remains challenging, especially during de novo discovery. RNA also has a long history as a computational model for evolution due to the direct link between genotype (sequence) and phenotype (structure). From these studies it is clear that evolved RNA structures, like protein structures, can be considered robust to point mutations. In this context, an RNA sequence is considered robust if its neutrality (extent to which single mutant neighbors maintain the same secondary structure) is greater than that expected for an artificial sequence with the same minimum free energy structure. RESULTS: In this work, we bring concepts from evolutionary biology to bear on the structured RNA de novo discovery process. We hypothesize that alignments corresponding to structured RNAs should consist of neutral sequences. We evaluate several measures of neutrality for their ability to distinguish between alignments of structured RNA sequences drawn from Rfam and various decoy alignments. We also introduce a new measure of RNA structural neutrality, the structure ensemble neutrality (SEN). SEN seeks to increase the biological relevance of existing neutrality measures in two ways. First, it uses information from an alignment of homologous sequences to identify a conserved biologically relevant structure for comparison. Second, it only counts base-pairs of the original structure that are absent in the comparison structure and does not penalize the formation of additional base-pairs. CONCLUSION: We find that several measures of neutrality are effective at separating structured RNAs from decoy sequences, including both shuffled alignments and flanking genomic sequence. Furthermore, as an independent feature classifier to identify structured RNAs, SEN yields comparable performance to current approaches that consider a variety of features including stability and sequence identity. Finally, SEN outperforms other measures of neutrality at detecting mutational robustness in bacterial regulatory RNA structures. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-014-1203-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4333902 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43339022015-02-20 Sampled ensemble neutrality as a feature to classify potential structured RNAs Pei, Shermin Anthony, Jon S Meyer, Michelle M BMC Genomics Research Article BACKGROUND: Structured RNAs have many biological functions ranging from catalysis of chemical reactions to gene regulation. Yet, many homologous structured RNAs display most of their conservation at the secondary or tertiary structure level. As a result, strategies for structured RNA discovery rely heavily on identification of sequences sharing a common stable secondary structure. However, correctly distinguishing structured RNAs from surrounding genomic sequence remains challenging, especially during de novo discovery. RNA also has a long history as a computational model for evolution due to the direct link between genotype (sequence) and phenotype (structure). From these studies it is clear that evolved RNA structures, like protein structures, can be considered robust to point mutations. In this context, an RNA sequence is considered robust if its neutrality (extent to which single mutant neighbors maintain the same secondary structure) is greater than that expected for an artificial sequence with the same minimum free energy structure. RESULTS: In this work, we bring concepts from evolutionary biology to bear on the structured RNA de novo discovery process. We hypothesize that alignments corresponding to structured RNAs should consist of neutral sequences. We evaluate several measures of neutrality for their ability to distinguish between alignments of structured RNA sequences drawn from Rfam and various decoy alignments. We also introduce a new measure of RNA structural neutrality, the structure ensemble neutrality (SEN). SEN seeks to increase the biological relevance of existing neutrality measures in two ways. First, it uses information from an alignment of homologous sequences to identify a conserved biologically relevant structure for comparison. Second, it only counts base-pairs of the original structure that are absent in the comparison structure and does not penalize the formation of additional base-pairs. CONCLUSION: We find that several measures of neutrality are effective at separating structured RNAs from decoy sequences, including both shuffled alignments and flanking genomic sequence. Furthermore, as an independent feature classifier to identify structured RNAs, SEN yields comparable performance to current approaches that consider a variety of features including stability and sequence identity. Finally, SEN outperforms other measures of neutrality at detecting mutational robustness in bacterial regulatory RNA structures. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-014-1203-8) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-05 /pmc/articles/PMC4333902/ /pubmed/25649229 http://dx.doi.org/10.1186/s12864-014-1203-8 Text en © Pei et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Pei, Shermin Anthony, Jon S Meyer, Michelle M Sampled ensemble neutrality as a feature to classify potential structured RNAs |
title | Sampled ensemble neutrality as a feature to classify potential structured RNAs |
title_full | Sampled ensemble neutrality as a feature to classify potential structured RNAs |
title_fullStr | Sampled ensemble neutrality as a feature to classify potential structured RNAs |
title_full_unstemmed | Sampled ensemble neutrality as a feature to classify potential structured RNAs |
title_short | Sampled ensemble neutrality as a feature to classify potential structured RNAs |
title_sort | sampled ensemble neutrality as a feature to classify potential structured rnas |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333902/ https://www.ncbi.nlm.nih.gov/pubmed/25649229 http://dx.doi.org/10.1186/s12864-014-1203-8 |
work_keys_str_mv | AT peishermin sampledensembleneutralityasafeaturetoclassifypotentialstructuredrnas AT anthonyjons sampledensembleneutralityasafeaturetoclassifypotentialstructuredrnas AT meyermichellem sampledensembleneutralityasafeaturetoclassifypotentialstructuredrnas |