Cargando…
A novel approach to represent and compare RNA secondary structures
Structural information is crucial in ribonucleic acid (RNA) analysis and functional annotation; nevertheless, how to include such structural data is still a debated problem. Dot-bracket notation is the most common and simple representation for RNA secondary structures but its simplicity leads also t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4041456/ https://www.ncbi.nlm.nih.gov/pubmed/24753415 http://dx.doi.org/10.1093/nar/gku283 |
_version_ | 1782318680928944128 |
---|---|
author | Mattei, Eugenio Ausiello, Gabriele Ferrè, Fabrizio Helmer-Citterich, Manuela |
author_facet | Mattei, Eugenio Ausiello, Gabriele Ferrè, Fabrizio Helmer-Citterich, Manuela |
author_sort | Mattei, Eugenio |
collection | PubMed |
description | Structural information is crucial in ribonucleic acid (RNA) analysis and functional annotation; nevertheless, how to include such structural data is still a debated problem. Dot-bracket notation is the most common and simple representation for RNA secondary structures but its simplicity leads also to ambiguity requiring further processing steps to dissolve. Here we present BEAR (Brand nEw Alphabet for RNA), a new context-aware structural encoding represented by a string of characters. Each character in BEAR encodes for a specific secondary structure element (loop, stem, bulge and internal loop) with specific length. Furthermore, exploiting this informative and yet simple encoding in multiple alignments of related RNAs, we captured how much structural variation is tolerated in RNA families and convert it into transition rates among secondary structure elements. This allowed us to compute a substitution matrix for secondary structure elements called MBR (Matrix of BEAR-encoded RNA secondary structures), of which we tested the ability in aligning RNA secondary structures. We propose BEAR and the MBR as powerful resources for the RNA secondary structure analysis, comparison and classification, motif finding and phylogeny. |
format | Online Article Text |
id | pubmed-4041456 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-40414562014-06-11 A novel approach to represent and compare RNA secondary structures Mattei, Eugenio Ausiello, Gabriele Ferrè, Fabrizio Helmer-Citterich, Manuela Nucleic Acids Res Computational Biology Structural information is crucial in ribonucleic acid (RNA) analysis and functional annotation; nevertheless, how to include such structural data is still a debated problem. Dot-bracket notation is the most common and simple representation for RNA secondary structures but its simplicity leads also to ambiguity requiring further processing steps to dissolve. Here we present BEAR (Brand nEw Alphabet for RNA), a new context-aware structural encoding represented by a string of characters. Each character in BEAR encodes for a specific secondary structure element (loop, stem, bulge and internal loop) with specific length. Furthermore, exploiting this informative and yet simple encoding in multiple alignments of related RNAs, we captured how much structural variation is tolerated in RNA families and convert it into transition rates among secondary structure elements. This allowed us to compute a substitution matrix for secondary structure elements called MBR (Matrix of BEAR-encoded RNA secondary structures), of which we tested the ability in aligning RNA secondary structures. We propose BEAR and the MBR as powerful resources for the RNA secondary structure analysis, comparison and classification, motif finding and phylogeny. Oxford University Press 2014-06-01 2014-04-21 /pmc/articles/PMC4041456/ /pubmed/24753415 http://dx.doi.org/10.1093/nar/gku283 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Mattei, Eugenio Ausiello, Gabriele Ferrè, Fabrizio Helmer-Citterich, Manuela A novel approach to represent and compare RNA secondary structures |
title | A novel approach to represent and compare RNA secondary structures |
title_full | A novel approach to represent and compare RNA secondary structures |
title_fullStr | A novel approach to represent and compare RNA secondary structures |
title_full_unstemmed | A novel approach to represent and compare RNA secondary structures |
title_short | A novel approach to represent and compare RNA secondary structures |
title_sort | novel approach to represent and compare rna secondary structures |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4041456/ https://www.ncbi.nlm.nih.gov/pubmed/24753415 http://dx.doi.org/10.1093/nar/gku283 |
work_keys_str_mv | AT matteieugenio anovelapproachtorepresentandcomparernasecondarystructures AT ausiellogabriele anovelapproachtorepresentandcomparernasecondarystructures AT ferrefabrizio anovelapproachtorepresentandcomparernasecondarystructures AT helmercitterichmanuela anovelapproachtorepresentandcomparernasecondarystructures AT matteieugenio novelapproachtorepresentandcomparernasecondarystructures AT ausiellogabriele novelapproachtorepresentandcomparernasecondarystructures AT ferrefabrizio novelapproachtorepresentandcomparernasecondarystructures AT helmercitterichmanuela novelapproachtorepresentandcomparernasecondarystructures |