Cargando…

RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model

Motivation: Regulatory, non-coding RNAs often function by forming a duplex with other RNAs. It is therefore of interest to predict putative RNA–RNA duplexes in silico on a genome-wide scale. Current computational methods for predicting these interactions range from fast complementary-based searches...

Descripción completa

Detalles Bibliográficos
Autores principales: Wenzel, Anne, Akbaşli, Erdinç, Gorodkin, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3476332/
https://www.ncbi.nlm.nih.gov/pubmed/22923300
http://dx.doi.org/10.1093/bioinformatics/bts519
_version_ 1782247082215604224
author Wenzel, Anne
Akbaşli, Erdinç
Gorodkin, Jan
author_facet Wenzel, Anne
Akbaşli, Erdinç
Gorodkin, Jan
author_sort Wenzel, Anne
collection PubMed
description Motivation: Regulatory, non-coding RNAs often function by forming a duplex with other RNAs. It is therefore of interest to predict putative RNA–RNA duplexes in silico on a genome-wide scale. Current computational methods for predicting these interactions range from fast complementary-based searches to those that take intramolecular binding into account. Together these methods constitute a trade-off between speed and accuracy, while leaving room for improvement within the context of genome-wide screens. A fast pre-filtering of putative duplexes would therefore be desirable. Results: We present RIsearch, an implementation of a simplified Turner energy model for fast computation of hybridization, which significantly reduces runtime while maintaining accuracy. Its time complexity for sequences of lengths m and n is [Image: see text] with a much smaller pre-factor than other tools. We show that this energy model is an accurate approximation of the full energy model for near-complementary RNA–RNA duplexes. RIsearch uses a Smith–Waterman-like algorithm using a dinucleotide scoring matrix which approximates the Turner nearest-neighbor energies. We show in benchmarks that we achieve a speed improvement of at least 2.4× compared with RNAplex, the currently fastest method for searching near-complementary regions. RIsearch shows a prediction accuracy similar to RNAplex on two datasets of known bacterial short RNA (sRNA)–messenger RNA (mRNA) and eukaryotic microRNA (miRNA)–mRNA interactions. Using RIsearch as a pre-filter in genome-wide screens reduces the number of binding site candidates reported by miRNA target prediction programs, such as TargetScanS and miRanda, by up to 70%. Likewise, substantial filtering was performed on bacterial RNA–RNA interaction data. Availability: The source code for RIsearch is available at: http://rth.dk/resources/risearch. Contact: gorodkin@rth.dk Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3476332
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34763322012-12-12 RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model Wenzel, Anne Akbaşli, Erdinç Gorodkin, Jan Bioinformatics Original Papers Motivation: Regulatory, non-coding RNAs often function by forming a duplex with other RNAs. It is therefore of interest to predict putative RNA–RNA duplexes in silico on a genome-wide scale. Current computational methods for predicting these interactions range from fast complementary-based searches to those that take intramolecular binding into account. Together these methods constitute a trade-off between speed and accuracy, while leaving room for improvement within the context of genome-wide screens. A fast pre-filtering of putative duplexes would therefore be desirable. Results: We present RIsearch, an implementation of a simplified Turner energy model for fast computation of hybridization, which significantly reduces runtime while maintaining accuracy. Its time complexity for sequences of lengths m and n is [Image: see text] with a much smaller pre-factor than other tools. We show that this energy model is an accurate approximation of the full energy model for near-complementary RNA–RNA duplexes. RIsearch uses a Smith–Waterman-like algorithm using a dinucleotide scoring matrix which approximates the Turner nearest-neighbor energies. We show in benchmarks that we achieve a speed improvement of at least 2.4× compared with RNAplex, the currently fastest method for searching near-complementary regions. RIsearch shows a prediction accuracy similar to RNAplex on two datasets of known bacterial short RNA (sRNA)–messenger RNA (mRNA) and eukaryotic microRNA (miRNA)–mRNA interactions. Using RIsearch as a pre-filter in genome-wide screens reduces the number of binding site candidates reported by miRNA target prediction programs, such as TargetScanS and miRanda, by up to 70%. Likewise, substantial filtering was performed on bacterial RNA–RNA interaction data. Availability: The source code for RIsearch is available at: http://rth.dk/resources/risearch. Contact: gorodkin@rth.dk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-11-01 2012-08-24 /pmc/articles/PMC3476332/ /pubmed/22923300 http://dx.doi.org/10.1093/bioinformatics/bts519 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Wenzel, Anne
Akbaşli, Erdinç
Gorodkin, Jan
RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model
title RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model
title_full RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model
title_fullStr RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model
title_full_unstemmed RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model
title_short RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model
title_sort risearch: fast rna–rna interaction search using a simplified nearest-neighbor energy model
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3476332/
https://www.ncbi.nlm.nih.gov/pubmed/22923300
http://dx.doi.org/10.1093/bioinformatics/bts519
work_keys_str_mv AT wenzelanne risearchfastrnarnainteractionsearchusingasimplifiednearestneighborenergymodel
AT akbaslierdinc risearchfastrnarnainteractionsearchusingasimplifiednearestneighborenergymodel
AT gorodkinjan risearchfastrnarnainteractionsearchusingasimplifiednearestneighborenergymodel