Cargando…

Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning

In vitro selection queries large combinatorial libraries for sequence-defined polymers with target binding and reaction catalysis activity. While the total sequence space of these libraries can extend beyond 10(22) sequences, practical considerations limit starting sequences to ≤~10(15) distinct mol...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jonathan C., Chen, Jonathan P., Shen, Max W., Wornow, Michael, Bae, Minwoo, Yeh, Wei-Hsi, Hsu, Alvin, Liu, David R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9352670/
https://www.ncbi.nlm.nih.gov/pubmed/35927274
http://dx.doi.org/10.1038/s41467-022-31955-4
_version_ 1784762699941412864
author Chen, Jonathan C.
Chen, Jonathan P.
Shen, Max W.
Wornow, Michael
Bae, Minwoo
Yeh, Wei-Hsi
Hsu, Alvin
Liu, David R.
author_facet Chen, Jonathan C.
Chen, Jonathan P.
Shen, Max W.
Wornow, Michael
Bae, Minwoo
Yeh, Wei-Hsi
Hsu, Alvin
Liu, David R.
author_sort Chen, Jonathan C.
collection PubMed
description In vitro selection queries large combinatorial libraries for sequence-defined polymers with target binding and reaction catalysis activity. While the total sequence space of these libraries can extend beyond 10(22) sequences, practical considerations limit starting sequences to ≤~10(15) distinct molecules. Selection-induced sequence convergence and limited sequencing depth further constrain experimentally observable sequence space. To address these limitations, we integrate experimental and machine learning approaches to explore regions of sequence space unrelated to experimentally derived variants. We perform in vitro selections to discover highly side-chain-functionalized nucleic acid polymers (HFNAPs) with potent affinities for a target small molecule (daunomycin K(D) = 5–65 nM). We then use the selection data to train a conditional variational autoencoder (CVAE) machine learning model to generate diverse and unique HFNAP sequences with high daunomycin affinities (K(D) = 9–26 nM), even though they are unrelated in sequence to experimental polymers. Coupling in vitro selection with a machine learning model thus enables direct generation of active variants, demonstrating a new approach to the discovery of functional biopolymers.
format Online
Article
Text
id pubmed-9352670
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-93526702022-08-06 Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning Chen, Jonathan C. Chen, Jonathan P. Shen, Max W. Wornow, Michael Bae, Minwoo Yeh, Wei-Hsi Hsu, Alvin Liu, David R. Nat Commun Article In vitro selection queries large combinatorial libraries for sequence-defined polymers with target binding and reaction catalysis activity. While the total sequence space of these libraries can extend beyond 10(22) sequences, practical considerations limit starting sequences to ≤~10(15) distinct molecules. Selection-induced sequence convergence and limited sequencing depth further constrain experimentally observable sequence space. To address these limitations, we integrate experimental and machine learning approaches to explore regions of sequence space unrelated to experimentally derived variants. We perform in vitro selections to discover highly side-chain-functionalized nucleic acid polymers (HFNAPs) with potent affinities for a target small molecule (daunomycin K(D) = 5–65 nM). We then use the selection data to train a conditional variational autoencoder (CVAE) machine learning model to generate diverse and unique HFNAP sequences with high daunomycin affinities (K(D) = 9–26 nM), even though they are unrelated in sequence to experimental polymers. Coupling in vitro selection with a machine learning model thus enables direct generation of active variants, demonstrating a new approach to the discovery of functional biopolymers. Nature Publishing Group UK 2022-08-04 /pmc/articles/PMC9352670/ /pubmed/35927274 http://dx.doi.org/10.1038/s41467-022-31955-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chen, Jonathan C.
Chen, Jonathan P.
Shen, Max W.
Wornow, Michael
Bae, Minwoo
Yeh, Wei-Hsi
Hsu, Alvin
Liu, David R.
Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning
title Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning
title_full Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning
title_fullStr Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning
title_full_unstemmed Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning
title_short Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning
title_sort generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9352670/
https://www.ncbi.nlm.nih.gov/pubmed/35927274
http://dx.doi.org/10.1038/s41467-022-31955-4
work_keys_str_mv AT chenjonathanc generatingexperimentallyunrelatedtargetmoleculebindinghighlyfunctionalizednucleicacidpolymersusingmachinelearning
AT chenjonathanp generatingexperimentallyunrelatedtargetmoleculebindinghighlyfunctionalizednucleicacidpolymersusingmachinelearning
AT shenmaxw generatingexperimentallyunrelatedtargetmoleculebindinghighlyfunctionalizednucleicacidpolymersusingmachinelearning
AT wornowmichael generatingexperimentallyunrelatedtargetmoleculebindinghighlyfunctionalizednucleicacidpolymersusingmachinelearning
AT baeminwoo generatingexperimentallyunrelatedtargetmoleculebindinghighlyfunctionalizednucleicacidpolymersusingmachinelearning
AT yehweihsi generatingexperimentallyunrelatedtargetmoleculebindinghighlyfunctionalizednucleicacidpolymersusingmachinelearning
AT hsualvin generatingexperimentallyunrelatedtargetmoleculebindinghighlyfunctionalizednucleicacidpolymersusingmachinelearning
AT liudavidr generatingexperimentallyunrelatedtargetmoleculebindinghighlyfunctionalizednucleicacidpolymersusingmachinelearning