Cargando…
Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape
Mapping the landscape of possible macromolecular polymer sequences to their fitness in performing biological functions is a challenge across the biosciences. A paradigm is the case of aptamers, nucleic acids that can be selected to bind particular target molecules. We have characterized the sequence...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2615635/ https://www.ncbi.nlm.nih.gov/pubmed/19029139 http://dx.doi.org/10.1093/nar/gkn899 |
_version_ | 1782163348745355264 |
---|---|
author | Knight, Christopher G. Platt, Mark Rowe, William Wedge, David C. Khan, Farid Day, Philip J. R. McShea, Andy Knowles, Joshua Kell, Douglas B. |
author_facet | Knight, Christopher G. Platt, Mark Rowe, William Wedge, David C. Khan, Farid Day, Philip J. R. McShea, Andy Knowles, Joshua Kell, Douglas B. |
author_sort | Knight, Christopher G. |
collection | PubMed |
description | Mapping the landscape of possible macromolecular polymer sequences to their fitness in performing biological functions is a challenge across the biosciences. A paradigm is the case of aptamers, nucleic acids that can be selected to bind particular target molecules. We have characterized the sequence-fitness landscape for aptamers binding allophycocyanin (APC) protein via a novel Closed Loop Aptameric Directed Evolution (CLADE) approach. In contrast to the conventional SELEX methodology, selection and mutation of aptamer sequences was carried out in silico, with explicit fitness assays for 44 131 aptamers of known sequence using DNA microarrays in vitro. We capture the landscape using a predictive machine learning model linking sequence features and function and validate this model using 5500 entirely separate test sequences, which give a very high observed versus predicted correlation of 0.87. This approach reveals a complex sequence-fitness mapping, and hypotheses for the physical basis of aptameric binding; it also enables rapid design of novel aptamers with desired binding properties. We demonstrate an extension to the approach by incorporating prior knowledge into CLADE, resulting in some of the tightest binding sequences. |
format | Text |
id | pubmed-2615635 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-26156352009-03-30 Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape Knight, Christopher G. Platt, Mark Rowe, William Wedge, David C. Khan, Farid Day, Philip J. R. McShea, Andy Knowles, Joshua Kell, Douglas B. Nucleic Acids Res Methods Online Mapping the landscape of possible macromolecular polymer sequences to their fitness in performing biological functions is a challenge across the biosciences. A paradigm is the case of aptamers, nucleic acids that can be selected to bind particular target molecules. We have characterized the sequence-fitness landscape for aptamers binding allophycocyanin (APC) protein via a novel Closed Loop Aptameric Directed Evolution (CLADE) approach. In contrast to the conventional SELEX methodology, selection and mutation of aptamer sequences was carried out in silico, with explicit fitness assays for 44 131 aptamers of known sequence using DNA microarrays in vitro. We capture the landscape using a predictive machine learning model linking sequence features and function and validate this model using 5500 entirely separate test sequences, which give a very high observed versus predicted correlation of 0.87. This approach reveals a complex sequence-fitness mapping, and hypotheses for the physical basis of aptameric binding; it also enables rapid design of novel aptamers with desired binding properties. We demonstrate an extension to the approach by incorporating prior knowledge into CLADE, resulting in some of the tightest binding sequences. Oxford University Press 2009-01 2008-11-23 /pmc/articles/PMC2615635/ /pubmed/19029139 http://dx.doi.org/10.1093/nar/gkn899 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Knight, Christopher G. Platt, Mark Rowe, William Wedge, David C. Khan, Farid Day, Philip J. R. McShea, Andy Knowles, Joshua Kell, Douglas B. Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape |
title | Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape |
title_full | Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape |
title_fullStr | Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape |
title_full_unstemmed | Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape |
title_short | Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape |
title_sort | array-based evolution of dna aptamers allows modelling of an explicit sequence-fitness landscape |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2615635/ https://www.ncbi.nlm.nih.gov/pubmed/19029139 http://dx.doi.org/10.1093/nar/gkn899 |
work_keys_str_mv | AT knightchristopherg arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape AT plattmark arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape AT rowewilliam arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape AT wedgedavidc arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape AT khanfarid arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape AT dayphilipjr arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape AT mcsheaandy arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape AT knowlesjoshua arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape AT kelldouglasb arraybasedevolutionofdnaaptamersallowsmodellingofanexplicitsequencefitnesslandscape |