Cargando…

Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape

Mapping the landscape of possible macromolecular polymer sequences to their fitness in performing biological functions is a challenge across the biosciences. A paradigm is the case of aptamers, nucleic acids that can be selected to bind particular target molecules. We have characterized the sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Knight, Christopher G., Platt, Mark, Rowe, William, Wedge, David C., Khan, Farid, Day, Philip J. R., McShea, Andy, Knowles, Joshua, Kell, Douglas B.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2615635/
https://www.ncbi.nlm.nih.gov/pubmed/19029139
http://dx.doi.org/10.1093/nar/gkn899
_version_ 1782163348745355264
author Knight, Christopher G.
Platt, Mark
Rowe, William
Wedge, David C.
Khan, Farid
Day, Philip J. R.
McShea, Andy
Knowles, Joshua
Kell, Douglas B.
author_facet Knight, Christopher G.
Platt, Mark
Rowe, William
Wedge, David C.
Khan, Farid
Day, Philip J. R.
McShea, Andy
Knowles, Joshua
Kell, Douglas B.
author_sort Knight, Christopher G.
collection PubMed
description Mapping the landscape of possible macromolecular polymer sequences to their fitness in performing biological functions is a challenge across the biosciences. A paradigm is the case of aptamers, nucleic acids that can be selected to bind particular target molecules. We have characterized the sequence-fitness landscape for aptamers binding allophycocyanin (APC) protein via a novel Closed Loop Aptameric Directed Evolution (CLADE) approach. In contrast to the conventional SELEX methodology, selection and mutation of aptamer sequences was carried out in silico, with explicit fitness assays for 44 131 aptamers of known sequence using DNA microarrays in vitro. We capture the landscape using a predictive machine learning model linking sequence features and function and validate this model using 5500 entirely separate test sequences, which give a very high observed versus predicted correlation of 0.87. This approach reveals a complex sequence-fitness mapping, and hypotheses for the physical basis of aptameric binding; it also enables rapid design of novel aptamers with desired binding properties. We demonstrate an extension to the approach by incorporating prior knowledge into CLADE, resulting in some of the tightest binding sequences.
format Text
id pubmed-2615635
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26156352009-03-30 Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape Knight, Christopher G. Platt, Mark Rowe, William Wedge, David C. Khan, Farid Day, Philip J. R. McShea, Andy Knowles, Joshua Kell, Douglas B. Nucleic Acids Res Methods Online Mapping the landscape of possible macromolecular polymer sequences to their fitness in performing biological functions is a challenge across the biosciences. A paradigm is the case of aptamers, nucleic acids that can be selected to bind particular target molecules. We have characterized the sequence-fitness landscape for aptamers binding allophycocyanin (APC) protein via a novel Closed Loop Aptameric Directed Evolution (CLADE) approach. In contrast to the conventional SELEX methodology, selection and mutation of aptamer sequences was carried out in silico, with explicit fitness assays for 44 131 aptamers of known sequence using DNA microarrays in vitro. We capture the landscape using a predictive machine learning model linking sequence features and function and validate this model using 5500 entirely separate test sequences, which give a very high observed versus predicted correlation of 0.87. This approach reveals a complex sequence-fitness mapping, and hypotheses for the physical basis of aptameric binding; it also enables rapid design of novel aptamers with desired binding properties. We demonstrate an extension to the approach by incorporating prior knowledge into CLADE, resulting in some of the tightest binding sequences. Oxford University Press 2009-01 2008-11-23 /pmc/articles/PMC2615635/ /pubmed/19029139 http://dx.doi.org/10.1093/nar/gkn899 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Knight, Christopher G.
Platt, Mark
Rowe, William
Wedge, David C.
Khan, Farid
Day, Philip J. R.
McShea, Andy
Knowles, Joshua
Kell, Douglas B.
Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape
title Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape
title_full Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape
title_fullStr Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape
title_full_unstemmed Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape
title_short Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape
title_sort array-based evolution of dna aptamers allows modelling of an explicit sequence-fitness landscape
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2615635/
https://www.ncbi.nlm.nih.gov/pubmed/19029139
http://dx.doi.org/10.1093/nar/gkn899
work_keys_str_mv AT knightchristopherg arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape
AT plattmark arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape
AT rowewilliam arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape
AT wedgedavidc arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape
AT khanfarid arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape
AT dayphilipjr arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape
AT mcsheaandy arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape
AT knowlesjoshua arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape
AT kelldouglasb arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape