Cargando…

Frnakenstein: multiple target inverse RNA folding

BACKGROUND: RNA secondary structure prediction, or folding, is a classic problem in bioinformatics: given a sequence of nucleotides, the aim is to predict the base pairs formed in its three dimensional conformation. The inverse problem of designing a sequence folding into a particular target structu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lyngsø, Rune B, Anderson, James WJ, Sizikova, Elena, Badugu, Amarendra, Hyland, Tomas, Hein, Jotun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534541/ https://www.ncbi.nlm.nih.gov/pubmed/23043260 http://dx.doi.org/10.1186/1471-2105-13-260

_version_	1782475350987505664
author	Lyngsø, Rune B Anderson, James WJ Sizikova, Elena Badugu, Amarendra Hyland, Tomas Hein, Jotun
author_facet	Lyngsø, Rune B Anderson, James WJ Sizikova, Elena Badugu, Amarendra Hyland, Tomas Hein, Jotun
author_sort	Lyngsø, Rune B
collection	PubMed
description	BACKGROUND: RNA secondary structure prediction, or folding, is a classic problem in bioinformatics: given a sequence of nucleotides, the aim is to predict the base pairs formed in its three dimensional conformation. The inverse problem of designing a sequence folding into a particular target structure has only more recently received notable interest. With a growing appreciation and understanding of the functional and structural properties of RNA motifs, and a growing interest in utilising biomolecules in nano-scale designs, the interest in the inverse RNA folding problem is bound to increase. However, whereas the RNA folding problem from an algorithmic viewpoint has an elegant and efficient solution, the inverse RNA folding problem appears to be hard. RESULTS: In this paper we present a genetic algorithm approach to solve the inverse folding problem. The main aims of the development was to address the hitherto mostly ignored extension of solving the inverse folding problem, the multi-target inverse folding problem, while simultaneously designing a method with superior performance when measured on the quality of designed sequences. The genetic algorithm has been implemented as a Python program called Frnakenstein. It was benchmarked against four existing methods and several data sets totalling 769 real and predicted single structure targets, and on 292 two structure targets. It performed as well as or better at finding sequences which folded in silico into the target structure than all existing methods, without the heavy bias towards CG base pairs that was observed for all other top performing methods. On the two structure targets it also performed well, generating a perfect design for about 80% of the targets. CONCLUSIONS: Our method illustrates that successful designs for the inverse RNA folding problem does not necessarily have to rely on heavy biases in base pair and unpaired base distributions. The design problem seems to become more difficult on larger structures when the target structures are real structures, while no deterioration was observed for predicted structures. Design for two structure targets is considerably more difficult, but far from impossible, demonstrating the feasibility of automated design of artificial riboswitches. The Python implementation is available at http://www.stats.ox.ac.uk/research/genome/software/frnakenstein.
format	Online Article Text
id	pubmed-3534541
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-35345412013-01-03 Frnakenstein: multiple target inverse RNA folding Lyngsø, Rune B Anderson, James WJ Sizikova, Elena Badugu, Amarendra Hyland, Tomas Hein, Jotun BMC Bioinformatics Research Article BACKGROUND: RNA secondary structure prediction, or folding, is a classic problem in bioinformatics: given a sequence of nucleotides, the aim is to predict the base pairs formed in its three dimensional conformation. The inverse problem of designing a sequence folding into a particular target structure has only more recently received notable interest. With a growing appreciation and understanding of the functional and structural properties of RNA motifs, and a growing interest in utilising biomolecules in nano-scale designs, the interest in the inverse RNA folding problem is bound to increase. However, whereas the RNA folding problem from an algorithmic viewpoint has an elegant and efficient solution, the inverse RNA folding problem appears to be hard. RESULTS: In this paper we present a genetic algorithm approach to solve the inverse folding problem. The main aims of the development was to address the hitherto mostly ignored extension of solving the inverse folding problem, the multi-target inverse folding problem, while simultaneously designing a method with superior performance when measured on the quality of designed sequences. The genetic algorithm has been implemented as a Python program called Frnakenstein. It was benchmarked against four existing methods and several data sets totalling 769 real and predicted single structure targets, and on 292 two structure targets. It performed as well as or better at finding sequences which folded in silico into the target structure than all existing methods, without the heavy bias towards CG base pairs that was observed for all other top performing methods. On the two structure targets it also performed well, generating a perfect design for about 80% of the targets. CONCLUSIONS: Our method illustrates that successful designs for the inverse RNA folding problem does not necessarily have to rely on heavy biases in base pair and unpaired base distributions. The design problem seems to become more difficult on larger structures when the target structures are real structures, while no deterioration was observed for predicted structures. Design for two structure targets is considerably more difficult, but far from impossible, demonstrating the feasibility of automated design of artificial riboswitches. The Python implementation is available at http://www.stats.ox.ac.uk/research/genome/software/frnakenstein. BioMed Central 2012-10-09 /pmc/articles/PMC3534541/ /pubmed/23043260 http://dx.doi.org/10.1186/1471-2105-13-260 Text en Copyright ©2012 Lyngsøet al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Lyngsø, Rune B Anderson, James WJ Sizikova, Elena Badugu, Amarendra Hyland, Tomas Hein, Jotun Frnakenstein: multiple target inverse RNA folding
title	Frnakenstein: multiple target inverse RNA folding
title_full	Frnakenstein: multiple target inverse RNA folding
title_fullStr	Frnakenstein: multiple target inverse RNA folding
title_full_unstemmed	Frnakenstein: multiple target inverse RNA folding
title_short	Frnakenstein: multiple target inverse RNA folding
title_sort	frnakenstein: multiple target inverse rna folding
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534541/ https://www.ncbi.nlm.nih.gov/pubmed/23043260 http://dx.doi.org/10.1186/1471-2105-13-260
work_keys_str_mv	AT lyngsøruneb frnakensteinmultipletargetinversernafolding AT andersonjameswj frnakensteinmultipletargetinversernafolding AT sizikovaelena frnakensteinmultipletargetinversernafolding AT baduguamarendra frnakensteinmultipletargetinversernafolding AT hylandtomas frnakensteinmultipletargetinversernafolding AT heinjotun frnakensteinmultipletargetinversernafolding

Frnakenstein: multiple target inverse RNA folding

Ejemplares similares