Cargando…

Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory

BACKGROUND: The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given seq...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Luan, McKerrow, Wilson H., Richards, Bryce, Phonsom, Chukiat, Lawrence, Charles E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5836418/
https://www.ncbi.nlm.nih.gov/pubmed/29506466
http://dx.doi.org/10.1186/s12859-018-2078-5
_version_ 1783303962607222784
author Lin, Luan
McKerrow, Wilson H.
Richards, Bryce
Phonsom, Chukiat
Lawrence, Charles E.
author_facet Lin, Luan
McKerrow, Wilson H.
Richards, Bryce
Phonsom, Chukiat
Lawrence, Charles E.
author_sort Lin, Luan
collection PubMed
description BACKGROUND: The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this multimodality: the presence of conflicting basepairs. Information theory, or more specifically mutual information, provides a method to identify those basepairs that are key to the secondary structure. RESULTS: To this end we find most informative basepairs and visualize the effect of these basepairs on the secondary structure. Knowing whether a most informative basepair is present tells us not only the status of the particular pair but also provides a large amount of information about which other pairs are present or not present. We find that a few basepairs account for a large amount of the structural uncertainty. The identification of these pairs indicates small changes to sequence or stability that will have a large effect on structure. CONCLUSION: We provide a novel algorithm that uses mutual information to identify the key basepairs that lead to a multimodal Boltzmann distribution. We then visualize the effect of these pairs on the overall Boltzmann ensemble.
format Online
Article
Text
id pubmed-5836418
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58364182018-03-07 Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory Lin, Luan McKerrow, Wilson H. Richards, Bryce Phonsom, Chukiat Lawrence, Charles E. BMC Bioinformatics Methodology Article BACKGROUND: The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this multimodality: the presence of conflicting basepairs. Information theory, or more specifically mutual information, provides a method to identify those basepairs that are key to the secondary structure. RESULTS: To this end we find most informative basepairs and visualize the effect of these basepairs on the secondary structure. Knowing whether a most informative basepair is present tells us not only the status of the particular pair but also provides a large amount of information about which other pairs are present or not present. We find that a few basepairs account for a large amount of the structural uncertainty. The identification of these pairs indicates small changes to sequence or stability that will have a large effect on structure. CONCLUSION: We provide a novel algorithm that uses mutual information to identify the key basepairs that lead to a multimodal Boltzmann distribution. We then visualize the effect of these pairs on the overall Boltzmann ensemble. BioMed Central 2018-03-05 /pmc/articles/PMC5836418/ /pubmed/29506466 http://dx.doi.org/10.1186/s12859-018-2078-5 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Lin, Luan
McKerrow, Wilson H.
Richards, Bryce
Phonsom, Chukiat
Lawrence, Charles E.
Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory
title Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory
title_full Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory
title_fullStr Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory
title_full_unstemmed Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory
title_short Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory
title_sort characterization and visualization of rna secondary structure boltzmann ensemble via information theory
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5836418/
https://www.ncbi.nlm.nih.gov/pubmed/29506466
http://dx.doi.org/10.1186/s12859-018-2078-5
work_keys_str_mv AT linluan characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory
AT mckerrowwilsonh characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory
AT richardsbryce characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory
AT phonsomchukiat characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory
AT lawrencecharlese characterizationandvisualizationofrnasecondarystructureboltzmannensembleviainformationtheory