Cargando…

RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry

BACKGROUND: Scanning large genomes with a sliding window in search of locally stable RNA structures is a well motivated problem in bioinformatics. Given a predefined window size L and an RNA sequence S of size N (L < N), the consecutive windows folding problem is to compute the minimal free energ...

Descripción completa

Detalles Bibliográficos
Autores principales: Horesh, Yair, Wexler, Ydo, Lebenthal, Ilana, Ziv-Ukelson, Michal, Unger, Ron
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2682796/
https://www.ncbi.nlm.nih.gov/pubmed/19257906
http://dx.doi.org/10.1186/1471-2105-10-76
_version_ 1782167092750974976
author Horesh, Yair
Wexler, Ydo
Lebenthal, Ilana
Ziv-Ukelson, Michal
Unger, Ron
author_facet Horesh, Yair
Wexler, Ydo
Lebenthal, Ilana
Ziv-Ukelson, Michal
Unger, Ron
author_sort Horesh, Yair
collection PubMed
description BACKGROUND: Scanning large genomes with a sliding window in search of locally stable RNA structures is a well motivated problem in bioinformatics. Given a predefined window size L and an RNA sequence S of size N (L < N), the consecutive windows folding problem is to compute the minimal free energy (MFE) for the folding of each of the L-sized substrings of S. The consecutive windows folding problem can be naively solved in O(NL(3)) by applying any of the classical cubic-time RNA folding algorithms to each of the N-L windows of size L. Recently an O(NL(2)) solution for this problem has been described. RESULTS: Here, we describe and implement an O(NLψ(L)) engine for the consecutive windows folding problem, where ψ(L) is shown to converge to O(1) under the assumption of a standard probabilistic polymer folding model, yielding an O(L) speedup which is experimentally confirmed. Using this tool, we note an intriguing directionality (5'-3' vs. 3'-5') folding bias, i.e. that the minimal free energy (MFE) of folding is higher in the native direction of the DNA than in the reverse direction of various genomic regions in several organisms including regions of the genomes that do not encode proteins or ncRNA. This bias largely emerges from the genomic dinucleotide bias which affects the MFE, however we see some variations in the folding bias in the different genomic regions when normalized to the dinucleotide bias. We also present results from calculating the MFE landscape of a mouse chromosome 1, characterizing the MFE of the long ncRNA molecules that reside in this chromosome. CONCLUSION: The efficient consecutive windows folding engine described in this paper allows for genome wide scans for ncRNA molecules as well as large-scale statistics. This is implemented here as a software tool, called RNAslider, and applied to the scanning of long chromosomes, leading to the observation of features that are visible only on a large scale.
format Text
id pubmed-2682796
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26827962009-05-18 RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry Horesh, Yair Wexler, Ydo Lebenthal, Ilana Ziv-Ukelson, Michal Unger, Ron BMC Bioinformatics Methodology Article BACKGROUND: Scanning large genomes with a sliding window in search of locally stable RNA structures is a well motivated problem in bioinformatics. Given a predefined window size L and an RNA sequence S of size N (L < N), the consecutive windows folding problem is to compute the minimal free energy (MFE) for the folding of each of the L-sized substrings of S. The consecutive windows folding problem can be naively solved in O(NL(3)) by applying any of the classical cubic-time RNA folding algorithms to each of the N-L windows of size L. Recently an O(NL(2)) solution for this problem has been described. RESULTS: Here, we describe and implement an O(NLψ(L)) engine for the consecutive windows folding problem, where ψ(L) is shown to converge to O(1) under the assumption of a standard probabilistic polymer folding model, yielding an O(L) speedup which is experimentally confirmed. Using this tool, we note an intriguing directionality (5'-3' vs. 3'-5') folding bias, i.e. that the minimal free energy (MFE) of folding is higher in the native direction of the DNA than in the reverse direction of various genomic regions in several organisms including regions of the genomes that do not encode proteins or ncRNA. This bias largely emerges from the genomic dinucleotide bias which affects the MFE, however we see some variations in the folding bias in the different genomic regions when normalized to the dinucleotide bias. We also present results from calculating the MFE landscape of a mouse chromosome 1, characterizing the MFE of the long ncRNA molecules that reside in this chromosome. CONCLUSION: The efficient consecutive windows folding engine described in this paper allows for genome wide scans for ncRNA molecules as well as large-scale statistics. This is implemented here as a software tool, called RNAslider, and applied to the scanning of long chromosomes, leading to the observation of features that are visible only on a large scale. BioMed Central 2009-03-04 /pmc/articles/PMC2682796/ /pubmed/19257906 http://dx.doi.org/10.1186/1471-2105-10-76 Text en Copyright © 2009 Horesh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Horesh, Yair
Wexler, Ydo
Lebenthal, Ilana
Ziv-Ukelson, Michal
Unger, Ron
RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry
title RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry
title_full RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry
title_fullStr RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry
title_full_unstemmed RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry
title_short RNAslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry
title_sort rnaslider: a faster engine for consecutive windows folding and its application to the analysis of genomic folding asymmetry
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2682796/
https://www.ncbi.nlm.nih.gov/pubmed/19257906
http://dx.doi.org/10.1186/1471-2105-10-76
work_keys_str_mv AT horeshyair rnasliderafasterengineforconsecutivewindowsfoldinganditsapplicationtotheanalysisofgenomicfoldingasymmetry
AT wexlerydo rnasliderafasterengineforconsecutivewindowsfoldinganditsapplicationtotheanalysisofgenomicfoldingasymmetry
AT lebenthalilana rnasliderafasterengineforconsecutivewindowsfoldinganditsapplicationtotheanalysisofgenomicfoldingasymmetry
AT zivukelsonmichal rnasliderafasterengineforconsecutivewindowsfoldinganditsapplicationtotheanalysisofgenomicfoldingasymmetry
AT ungerron rnasliderafasterengineforconsecutivewindowsfoldinganditsapplicationtotheanalysisofgenomicfoldingasymmetry