Cargando…

Sampled ensemble neutrality as a feature to classify potential structured RNAs

BACKGROUND: Structured RNAs have many biological functions ranging from catalysis of chemical reactions to gene regulation. Yet, many homologous structured RNAs display most of their conservation at the secondary or tertiary structure level. As a result, strategies for structured RNA discovery rely...

Descripción completa

Detalles Bibliográficos
Autores principales: Pei, Shermin, Anthony, Jon S, Meyer, Michelle M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333902/
https://www.ncbi.nlm.nih.gov/pubmed/25649229
http://dx.doi.org/10.1186/s12864-014-1203-8
_version_ 1782358122893934592
author Pei, Shermin
Anthony, Jon S
Meyer, Michelle M
author_facet Pei, Shermin
Anthony, Jon S
Meyer, Michelle M
author_sort Pei, Shermin
collection PubMed
description BACKGROUND: Structured RNAs have many biological functions ranging from catalysis of chemical reactions to gene regulation. Yet, many homologous structured RNAs display most of their conservation at the secondary or tertiary structure level. As a result, strategies for structured RNA discovery rely heavily on identification of sequences sharing a common stable secondary structure. However, correctly distinguishing structured RNAs from surrounding genomic sequence remains challenging, especially during de novo discovery. RNA also has a long history as a computational model for evolution due to the direct link between genotype (sequence) and phenotype (structure). From these studies it is clear that evolved RNA structures, like protein structures, can be considered robust to point mutations. In this context, an RNA sequence is considered robust if its neutrality (extent to which single mutant neighbors maintain the same secondary structure) is greater than that expected for an artificial sequence with the same minimum free energy structure. RESULTS: In this work, we bring concepts from evolutionary biology to bear on the structured RNA de novo discovery process. We hypothesize that alignments corresponding to structured RNAs should consist of neutral sequences. We evaluate several measures of neutrality for their ability to distinguish between alignments of structured RNA sequences drawn from Rfam and various decoy alignments. We also introduce a new measure of RNA structural neutrality, the structure ensemble neutrality (SEN). SEN seeks to increase the biological relevance of existing neutrality measures in two ways. First, it uses information from an alignment of homologous sequences to identify a conserved biologically relevant structure for comparison. Second, it only counts base-pairs of the original structure that are absent in the comparison structure and does not penalize the formation of additional base-pairs. CONCLUSION: We find that several measures of neutrality are effective at separating structured RNAs from decoy sequences, including both shuffled alignments and flanking genomic sequence. Furthermore, as an independent feature classifier to identify structured RNAs, SEN yields comparable performance to current approaches that consider a variety of features including stability and sequence identity. Finally, SEN outperforms other measures of neutrality at detecting mutational robustness in bacterial regulatory RNA structures. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-014-1203-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4333902
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43339022015-02-20 Sampled ensemble neutrality as a feature to classify potential structured RNAs Pei, Shermin Anthony, Jon S Meyer, Michelle M BMC Genomics Research Article BACKGROUND: Structured RNAs have many biological functions ranging from catalysis of chemical reactions to gene regulation. Yet, many homologous structured RNAs display most of their conservation at the secondary or tertiary structure level. As a result, strategies for structured RNA discovery rely heavily on identification of sequences sharing a common stable secondary structure. However, correctly distinguishing structured RNAs from surrounding genomic sequence remains challenging, especially during de novo discovery. RNA also has a long history as a computational model for evolution due to the direct link between genotype (sequence) and phenotype (structure). From these studies it is clear that evolved RNA structures, like protein structures, can be considered robust to point mutations. In this context, an RNA sequence is considered robust if its neutrality (extent to which single mutant neighbors maintain the same secondary structure) is greater than that expected for an artificial sequence with the same minimum free energy structure. RESULTS: In this work, we bring concepts from evolutionary biology to bear on the structured RNA de novo discovery process. We hypothesize that alignments corresponding to structured RNAs should consist of neutral sequences. We evaluate several measures of neutrality for their ability to distinguish between alignments of structured RNA sequences drawn from Rfam and various decoy alignments. We also introduce a new measure of RNA structural neutrality, the structure ensemble neutrality (SEN). SEN seeks to increase the biological relevance of existing neutrality measures in two ways. First, it uses information from an alignment of homologous sequences to identify a conserved biologically relevant structure for comparison. Second, it only counts base-pairs of the original structure that are absent in the comparison structure and does not penalize the formation of additional base-pairs. CONCLUSION: We find that several measures of neutrality are effective at separating structured RNAs from decoy sequences, including both shuffled alignments and flanking genomic sequence. Furthermore, as an independent feature classifier to identify structured RNAs, SEN yields comparable performance to current approaches that consider a variety of features including stability and sequence identity. Finally, SEN outperforms other measures of neutrality at detecting mutational robustness in bacterial regulatory RNA structures. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-014-1203-8) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-05 /pmc/articles/PMC4333902/ /pubmed/25649229 http://dx.doi.org/10.1186/s12864-014-1203-8 Text en © Pei et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Pei, Shermin
Anthony, Jon S
Meyer, Michelle M
Sampled ensemble neutrality as a feature to classify potential structured RNAs
title Sampled ensemble neutrality as a feature to classify potential structured RNAs
title_full Sampled ensemble neutrality as a feature to classify potential structured RNAs
title_fullStr Sampled ensemble neutrality as a feature to classify potential structured RNAs
title_full_unstemmed Sampled ensemble neutrality as a feature to classify potential structured RNAs
title_short Sampled ensemble neutrality as a feature to classify potential structured RNAs
title_sort sampled ensemble neutrality as a feature to classify potential structured rnas
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333902/
https://www.ncbi.nlm.nih.gov/pubmed/25649229
http://dx.doi.org/10.1186/s12864-014-1203-8
work_keys_str_mv AT peishermin sampledensembleneutralityasafeaturetoclassifypotentialstructuredrnas
AT anthonyjons sampledensembleneutralityasafeaturetoclassifypotentialstructuredrnas
AT meyermichellem sampledensembleneutralityasafeaturetoclassifypotentialstructuredrnas