Cargando…

A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution

Motivations: The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding prop...

Descripción completa

Detalles Bibliográficos
Autores principales: Reinharz, Vladimir, Ponty, Yann, Waldispühl, Jérôme
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694657/
https://www.ncbi.nlm.nih.gov/pubmed/23812999
http://dx.doi.org/10.1093/bioinformatics/btt217
_version_ 1782274882960097280
author Reinharz, Vladimir
Ponty, Yann
Waldispühl, Jérôme
author_facet Reinharz, Vladimir
Ponty, Yann
Waldispühl, Jérôme
author_sort Reinharz, Vladimir
collection PubMed
description Motivations: The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. Results: In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. Availability: IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/ Contact: jeromew@cs.mcgill.ca or yann.ponty@lix.polytechnique.fr Supplementary Information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3694657
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36946572013-06-27 A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution Reinharz, Vladimir Ponty, Yann Waldispühl, Jérôme Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivations: The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. Results: In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. Availability: IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/ Contact: jeromew@cs.mcgill.ca or yann.ponty@lix.polytechnique.fr Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694657/ /pubmed/23812999 http://dx.doi.org/10.1093/bioinformatics/btt217 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
Reinharz, Vladimir
Ponty, Yann
Waldispühl, Jérôme
A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution
title A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution
title_full A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution
title_fullStr A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution
title_full_unstemmed A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution
title_short A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution
title_sort weighted sampling algorithm for the design of rna sequences with targeted secondary structure and nucleotide distribution
topic Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694657/
https://www.ncbi.nlm.nih.gov/pubmed/23812999
http://dx.doi.org/10.1093/bioinformatics/btt217
work_keys_str_mv AT reinharzvladimir aweightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution
AT pontyyann aweightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution
AT waldispuhljerome aweightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution
AT reinharzvladimir weightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution
AT pontyyann weightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution
AT waldispuhljerome weightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution