Cargando…
Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory
BACKGROUND: The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given seq...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5836418/ https://www.ncbi.nlm.nih.gov/pubmed/29506466 http://dx.doi.org/10.1186/s12859-018-2078-5 |
_version_ | 1783303962607222784 |
---|---|
author | Lin, Luan McKerrow, Wilson H. Richards, Bryce Phonsom, Chukiat Lawrence, Charles E. |
author_facet | Lin, Luan McKerrow, Wilson H. Richards, Bryce Phonsom, Chukiat Lawrence, Charles E. |
author_sort | Lin, Luan |
collection | PubMed |
description | BACKGROUND: The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this multimodality: the presence of conflicting basepairs. Information theory, or more specifically mutual information, provides a method to identify those basepairs that are key to the secondary structure. RESULTS: To this end we find most informative basepairs and visualize the effect of these basepairs on the secondary structure. Knowing whether a most informative basepair is present tells us not only the status of the particular pair but also provides a large amount of information about which other pairs are present or not present. We find that a few basepairs account for a large amount of the structural uncertainty. The identification of these pairs indicates small changes to sequence or stability that will have a large effect on structure. CONCLUSION: We provide a novel algorithm that uses mutual information to identify the key basepairs that lead to a multimodal Boltzmann distribution. We then visualize the effect of these pairs on the overall Boltzmann ensemble. |
format | Online Article Text |
id | pubmed-5836418 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58364182018-03-07 Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory Lin, Luan McKerrow, Wilson H. Richards, Bryce Phonsom, Chukiat Lawrence, Charles E. BMC Bioinformatics Methodology Article BACKGROUND: The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this multimodality: the presence of conflicting basepairs. Information theory, or more specifically mutual information, provides a method to identify those basepairs that are key to the secondary structure. RESULTS: To this end we find most informative basepairs and visualize the effect of these basepairs on the secondary structure. Knowing whether a most informative basepair is present tells us not only the status of the particular pair but also provides a large amount of information about which other pairs are present or not present. We find that a few basepairs account for a large amount of the structural uncertainty. The identification of these pairs indicates small changes to sequence or stability that will have a large effect on structure. CONCLUSION: We provide a novel algorithm that uses mutual information to identify the key basepairs that lead to a multimodal Boltzmann distribution. We then visualize the effect of these pairs on the overall Boltzmann ensemble. BioMed Central 2018-03-05 /pmc/articles/PMC5836418/ /pubmed/29506466 http://dx.doi.org/10.1186/s12859-018-2078-5 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Lin, Luan McKerrow, Wilson H. Richards, Bryce Phonsom, Chukiat Lawrence, Charles E. Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory |
title | Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory |
title_full | Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory |
title_fullStr | Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory |
title_full_unstemmed | Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory |
title_short | Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory |
title_sort | characterization and visualization of rna secondary structure boltzmann ensemble via information theory |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5836418/ https://www.ncbi.nlm.nih.gov/pubmed/29506466 http://dx.doi.org/10.1186/s12859-018-2078-5 |
work_keys_str_mv | AT linluan characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory AT mckerrowwilsonh characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory AT richardsbryce characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory AT phonsomchukiat characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory AT lawrencecharlese characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory |