Cargando…

Computational generation and screening of RNA motifs in large nucleotide sequence pools

Although identification of active motifs in large random sequence pools is central to RNA in vitro selection, no systematic computational equivalent of this process has yet been developed. We develop a computational approach that combines target pool generation, motif scanning and motif screening us...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Namhee, Izzo, Joseph A., Elmetwaly, Shereef, Gan, Hin Hark, Schlick, Tamar
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2910066/
https://www.ncbi.nlm.nih.gov/pubmed/20448026
http://dx.doi.org/10.1093/nar/gkq282
_version_ 1782184363021041664
author Kim, Namhee
Izzo, Joseph A.
Elmetwaly, Shereef
Gan, Hin Hark
Schlick, Tamar
author_facet Kim, Namhee
Izzo, Joseph A.
Elmetwaly, Shereef
Gan, Hin Hark
Schlick, Tamar
author_sort Kim, Namhee
collection PubMed
description Although identification of active motifs in large random sequence pools is central to RNA in vitro selection, no systematic computational equivalent of this process has yet been developed. We develop a computational approach that combines target pool generation, motif scanning and motif screening using secondary structure analysis for applications to 10(12)–10(14)-sequence pools; large pool sizes are made possible using program redesign and supercomputing resources. We use the new protocol to search for aptamer and ribozyme motifs in pools up to experimental pool size (10(14) sequences). We show that motif scanning, structure matching and flanking sequence analysis, respectively, reduce the initial sequence pool by 6–8, 1–2 and 1 orders of magnitude, consistent with the rare occurrence of active motifs in random pools. The final yields match the theoretical yields from probability theory for simple motifs and overestimate experimental yields, which constitute lower bounds, for aptamers because screening analyses beyond secondary structure information are not considered systematically. We also show that designed pools using our nucleotide transition probability matrices can produce higher yields for RNA ligase motifs than random pools. Our methods for generating, analyzing and designing large pools can help improve RNA design via simulation of aspects of in vitro selection.
format Text
id pubmed-2910066
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29100662010-07-27 Computational generation and screening of RNA motifs in large nucleotide sequence pools Kim, Namhee Izzo, Joseph A. Elmetwaly, Shereef Gan, Hin Hark Schlick, Tamar Nucleic Acids Res Methods Online Although identification of active motifs in large random sequence pools is central to RNA in vitro selection, no systematic computational equivalent of this process has yet been developed. We develop a computational approach that combines target pool generation, motif scanning and motif screening using secondary structure analysis for applications to 10(12)–10(14)-sequence pools; large pool sizes are made possible using program redesign and supercomputing resources. We use the new protocol to search for aptamer and ribozyme motifs in pools up to experimental pool size (10(14) sequences). We show that motif scanning, structure matching and flanking sequence analysis, respectively, reduce the initial sequence pool by 6–8, 1–2 and 1 orders of magnitude, consistent with the rare occurrence of active motifs in random pools. The final yields match the theoretical yields from probability theory for simple motifs and overestimate experimental yields, which constitute lower bounds, for aptamers because screening analyses beyond secondary structure information are not considered systematically. We also show that designed pools using our nucleotide transition probability matrices can produce higher yields for RNA ligase motifs than random pools. Our methods for generating, analyzing and designing large pools can help improve RNA design via simulation of aspects of in vitro selection. Oxford University Press 2010-07 2010-05-06 /pmc/articles/PMC2910066/ /pubmed/20448026 http://dx.doi.org/10.1093/nar/gkq282 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Kim, Namhee
Izzo, Joseph A.
Elmetwaly, Shereef
Gan, Hin Hark
Schlick, Tamar
Computational generation and screening of RNA motifs in large nucleotide sequence pools
title Computational generation and screening of RNA motifs in large nucleotide sequence pools
title_full Computational generation and screening of RNA motifs in large nucleotide sequence pools
title_fullStr Computational generation and screening of RNA motifs in large nucleotide sequence pools
title_full_unstemmed Computational generation and screening of RNA motifs in large nucleotide sequence pools
title_short Computational generation and screening of RNA motifs in large nucleotide sequence pools
title_sort computational generation and screening of rna motifs in large nucleotide sequence pools
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2910066/
https://www.ncbi.nlm.nih.gov/pubmed/20448026
http://dx.doi.org/10.1093/nar/gkq282
work_keys_str_mv AT kimnamhee computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools
AT izzojosepha computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools
AT elmetwalyshereef computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools
AT ganhinhark computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools
AT schlicktamar computationalgenerationandscreeningofrnamotifsinlargenucleotidesequencepools