Cargando…
A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution
Motivations: The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding prop...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694657/ https://www.ncbi.nlm.nih.gov/pubmed/23812999 http://dx.doi.org/10.1093/bioinformatics/btt217 |
_version_ | 1782274882960097280 |
---|---|
author | Reinharz, Vladimir Ponty, Yann Waldispühl, Jérôme |
author_facet | Reinharz, Vladimir Ponty, Yann Waldispühl, Jérôme |
author_sort | Reinharz, Vladimir |
collection | PubMed |
description | Motivations: The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. Results: In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. Availability: IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/ Contact: jeromew@cs.mcgill.ca or yann.ponty@lix.polytechnique.fr Supplementary Information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3694657 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36946572013-06-27 A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution Reinharz, Vladimir Ponty, Yann Waldispühl, Jérôme Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivations: The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. Results: In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. Availability: IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/ Contact: jeromew@cs.mcgill.ca or yann.ponty@lix.polytechnique.fr Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694657/ /pubmed/23812999 http://dx.doi.org/10.1093/bioinformatics/btt217 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Reinharz, Vladimir Ponty, Yann Waldispühl, Jérôme A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution |
title | A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution |
title_full | A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution |
title_fullStr | A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution |
title_full_unstemmed | A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution |
title_short | A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution |
title_sort | weighted sampling algorithm for the design of rna sequences with targeted secondary structure and nucleotide distribution |
topic | Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694657/ https://www.ncbi.nlm.nih.gov/pubmed/23812999 http://dx.doi.org/10.1093/bioinformatics/btt217 |
work_keys_str_mv | AT reinharzvladimir aweightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution AT pontyyann aweightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution AT waldispuhljerome aweightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution AT reinharzvladimir weightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution AT pontyyann weightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution AT waldispuhljerome weightedsamplingalgorithmforthedesignofrnasequenceswithtargetedsecondarystructureandnucleotidedistribution |