Cargando…
Machine learning guided aptamer refinement and discovery
Aptamers are single-stranded nucleic acid ligands that bind to target molecules with high affinity and specificity. They are typically discovered by searching large libraries for sequences with desirable binding properties. These libraries, however, are practically constrained to a fraction of the t...
Autores principales: | , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8062585/ https://www.ncbi.nlm.nih.gov/pubmed/33888692 http://dx.doi.org/10.1038/s41467-021-22555-9 |
_version_ | 1783681796447141888 |
---|---|
author | Bashir, Ali Yang, Qin Wang, Jinpeng Hoyer, Stephan Chou, Wenchuan McLean, Cory Davis, Geoff Gong, Qiang Armstrong, Zan Jang, Junghoon Kang, Hui Pawlosky, Annalisa Scott, Alexander Dahl, George E. Berndl, Marc Dimon, Michelle Ferguson, B. Scott |
author_facet | Bashir, Ali Yang, Qin Wang, Jinpeng Hoyer, Stephan Chou, Wenchuan McLean, Cory Davis, Geoff Gong, Qiang Armstrong, Zan Jang, Junghoon Kang, Hui Pawlosky, Annalisa Scott, Alexander Dahl, George E. Berndl, Marc Dimon, Michelle Ferguson, B. Scott |
author_sort | Bashir, Ali |
collection | PubMed |
description | Aptamers are single-stranded nucleic acid ligands that bind to target molecules with high affinity and specificity. They are typically discovered by searching large libraries for sequences with desirable binding properties. These libraries, however, are practically constrained to a fraction of the theoretical sequence space. Machine learning provides an opportunity to intelligently navigate this space to identify high-performing aptamers. Here, we propose an approach that employs particle display (PD) to partition a library of aptamers by affinity, and uses such data to train machine learning models to predict affinity in silico. Our model predicted high-affinity DNA aptamers from experimental candidates at a rate 11-fold higher than random perturbation and generated novel, high-affinity aptamers at a greater rate than observed by PD alone. Our approach also facilitated the design of truncated aptamers 70% shorter and with higher binding affinity (1.5 nM) than the best experimental candidate. This work demonstrates how combining machine learning and physical approaches can be used to expedite the discovery of better diagnostic and therapeutic agents. |
format | Online Article Text |
id | pubmed-8062585 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-80625852021-05-11 Machine learning guided aptamer refinement and discovery Bashir, Ali Yang, Qin Wang, Jinpeng Hoyer, Stephan Chou, Wenchuan McLean, Cory Davis, Geoff Gong, Qiang Armstrong, Zan Jang, Junghoon Kang, Hui Pawlosky, Annalisa Scott, Alexander Dahl, George E. Berndl, Marc Dimon, Michelle Ferguson, B. Scott Nat Commun Article Aptamers are single-stranded nucleic acid ligands that bind to target molecules with high affinity and specificity. They are typically discovered by searching large libraries for sequences with desirable binding properties. These libraries, however, are practically constrained to a fraction of the theoretical sequence space. Machine learning provides an opportunity to intelligently navigate this space to identify high-performing aptamers. Here, we propose an approach that employs particle display (PD) to partition a library of aptamers by affinity, and uses such data to train machine learning models to predict affinity in silico. Our model predicted high-affinity DNA aptamers from experimental candidates at a rate 11-fold higher than random perturbation and generated novel, high-affinity aptamers at a greater rate than observed by PD alone. Our approach also facilitated the design of truncated aptamers 70% shorter and with higher binding affinity (1.5 nM) than the best experimental candidate. This work demonstrates how combining machine learning and physical approaches can be used to expedite the discovery of better diagnostic and therapeutic agents. Nature Publishing Group UK 2021-04-22 /pmc/articles/PMC8062585/ /pubmed/33888692 http://dx.doi.org/10.1038/s41467-021-22555-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Bashir, Ali Yang, Qin Wang, Jinpeng Hoyer, Stephan Chou, Wenchuan McLean, Cory Davis, Geoff Gong, Qiang Armstrong, Zan Jang, Junghoon Kang, Hui Pawlosky, Annalisa Scott, Alexander Dahl, George E. Berndl, Marc Dimon, Michelle Ferguson, B. Scott Machine learning guided aptamer refinement and discovery |
title | Machine learning guided aptamer refinement and discovery |
title_full | Machine learning guided aptamer refinement and discovery |
title_fullStr | Machine learning guided aptamer refinement and discovery |
title_full_unstemmed | Machine learning guided aptamer refinement and discovery |
title_short | Machine learning guided aptamer refinement and discovery |
title_sort | machine learning guided aptamer refinement and discovery |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8062585/ https://www.ncbi.nlm.nih.gov/pubmed/33888692 http://dx.doi.org/10.1038/s41467-021-22555-9 |
work_keys_str_mv | AT bashirali machinelearningguidedaptamerrefinementanddiscovery AT yangqin machinelearningguidedaptamerrefinementanddiscovery AT wangjinpeng machinelearningguidedaptamerrefinementanddiscovery AT hoyerstephan machinelearningguidedaptamerrefinementanddiscovery AT chouwenchuan machinelearningguidedaptamerrefinementanddiscovery AT mcleancory machinelearningguidedaptamerrefinementanddiscovery AT davisgeoff machinelearningguidedaptamerrefinementanddiscovery AT gongqiang machinelearningguidedaptamerrefinementanddiscovery AT armstrongzan machinelearningguidedaptamerrefinementanddiscovery AT jangjunghoon machinelearningguidedaptamerrefinementanddiscovery AT kanghui machinelearningguidedaptamerrefinementanddiscovery AT pawloskyannalisa machinelearningguidedaptamerrefinementanddiscovery AT scottalexander machinelearningguidedaptamerrefinementanddiscovery AT dahlgeorgee machinelearningguidedaptamerrefinementanddiscovery AT berndlmarc machinelearningguidedaptamerrefinementanddiscovery AT dimonmichelle machinelearningguidedaptamerrefinementanddiscovery AT fergusonbscott machinelearningguidedaptamerrefinementanddiscovery |