Cargando…
Computational generation and screening of RNA motifs in large nucleotide sequence pools
Although identification of active motifs in large random sequence pools is central to RNA in vitro selection, no systematic computational equivalent of this process has yet been developed. We develop a computational approach that combines target pool generation, motif scanning and motif screening us...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2910066/ https://www.ncbi.nlm.nih.gov/pubmed/20448026 http://dx.doi.org/10.1093/nar/gkq282 |
_version_ | 1782184363021041664 |
---|---|
author | Kim, Namhee Izzo, Joseph A. Elmetwaly, Shereef Gan, Hin Hark Schlick, Tamar |
author_facet | Kim, Namhee Izzo, Joseph A. Elmetwaly, Shereef Gan, Hin Hark Schlick, Tamar |
author_sort | Kim, Namhee |
collection | PubMed |
description | Although identification of active motifs in large random sequence pools is central to RNA in vitro selection, no systematic computational equivalent of this process has yet been developed. We develop a computational approach that combines target pool generation, motif scanning and motif screening using secondary structure analysis for applications to 10(12)–10(14)-sequence pools; large pool sizes are made possible using program redesign and supercomputing resources. We use the new protocol to search for aptamer and ribozyme motifs in pools up to experimental pool size (10(14) sequences). We show that motif scanning, structure matching and flanking sequence analysis, respectively, reduce the initial sequence pool by 6–8, 1–2 and 1 orders of magnitude, consistent with the rare occurrence of active motifs in random pools. The final yields match the theoretical yields from probability theory for simple motifs and overestimate experimental yields, which constitute lower bounds, for aptamers because screening analyses beyond secondary structure information are not considered systematically. We also show that designed pools using our nucleotide transition probability matrices can produce higher yields for RNA ligase motifs than random pools. Our methods for generating, analyzing and designing large pools can help improve RNA design via simulation of aspects of in vitro selection. |
format | Text |
id | pubmed-2910066 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-29100662010-07-27 Computational generation and screening of RNA motifs in large nucleotide sequence pools Kim, Namhee Izzo, Joseph A. Elmetwaly, Shereef Gan, Hin Hark Schlick, Tamar Nucleic Acids Res Methods Online Although identification of active motifs in large random sequence pools is central to RNA in vitro selection, no systematic computational equivalent of this process has yet been developed. We develop a computational approach that combines target pool generation, motif scanning and motif screening using secondary structure analysis for applications to 10(12)–10(14)-sequence pools; large pool sizes are made possible using program redesign and supercomputing resources. We use the new protocol to search for aptamer and ribozyme motifs in pools up to experimental pool size (10(14) sequences). We show that motif scanning, structure matching and flanking sequence analysis, respectively, reduce the initial sequence pool by 6–8, 1–2 and 1 orders of magnitude, consistent with the rare occurrence of active motifs in random pools. The final yields match the theoretical yields from probability theory for simple motifs and overestimate experimental yields, which constitute lower bounds, for aptamers because screening analyses beyond secondary structure information are not considered systematically. We also show that designed pools using our nucleotide transition probability matrices can produce higher yields for RNA ligase motifs than random pools. Our methods for generating, analyzing and designing large pools can help improve RNA design via simulation of aspects of in vitro selection. Oxford University Press 2010-07 2010-05-06 /pmc/articles/PMC2910066/ /pubmed/20448026 http://dx.doi.org/10.1093/nar/gkq282 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Kim, Namhee Izzo, Joseph A. Elmetwaly, Shereef Gan, Hin Hark Schlick, Tamar Computational generation and screening of RNA motifs in large nucleotide sequence pools |
title | Computational generation and screening of RNA motifs in large nucleotide sequence pools |
title_full | Computational generation and screening of RNA motifs in large nucleotide sequence pools |
title_fullStr | Computational generation and screening of RNA motifs in large nucleotide sequence pools |
title_full_unstemmed | Computational generation and screening of RNA motifs in large nucleotide sequence pools |
title_short | Computational generation and screening of RNA motifs in large nucleotide sequence pools |
title_sort | computational generation and screening of rna motifs in large nucleotide sequence pools |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2910066/ https://www.ncbi.nlm.nih.gov/pubmed/20448026 http://dx.doi.org/10.1093/nar/gkq282 |
work_keys_str_mv | AT kimnamhee computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools AT izzojosepha computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools AT elmetwalyshereef computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools AT ganhinhark computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools AT schlicktamar computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools |