Cargando…

DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis

[Image: see text] The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the “structure elucidation problem”: the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very m...

Descripción completa

Detalles Bibliográficos
Autores principales: MacConnell, Andrew B., McEnaney, Patrick J., Cavett, Valerie J., Paegel, Brian M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2015
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4571006/
https://www.ncbi.nlm.nih.gov/pubmed/26290177
http://dx.doi.org/10.1021/acscombsci.5b00106
_version_ 1782390287856828416
author MacConnell, Andrew B.
McEnaney, Patrick J.
Cavett, Valerie J.
Paegel, Brian M.
author_facet MacConnell, Andrew B.
McEnaney, Patrick J.
Cavett, Valerie J.
Paegel, Brian M.
author_sort MacConnell, Andrew B.
collection PubMed
description [Image: see text] The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the “structure elucidation problem”: the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS’s utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound’s synthetic history. We applied DESPS to the combinatorial synthesis of a 75 645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries.
format Online
Article
Text
id pubmed-4571006
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-45710062015-09-22 DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis MacConnell, Andrew B. McEnaney, Patrick J. Cavett, Valerie J. Paegel, Brian M. ACS Comb Sci [Image: see text] The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the “structure elucidation problem”: the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS’s utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound’s synthetic history. We applied DESPS to the combinatorial synthesis of a 75 645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries. American Chemical Society 2015-08-20 2015-09-14 /pmc/articles/PMC4571006/ /pubmed/26290177 http://dx.doi.org/10.1021/acscombsci.5b00106 Text en Copyright © 2015 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle MacConnell, Andrew B.
McEnaney, Patrick J.
Cavett, Valerie J.
Paegel, Brian M.
DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis
title DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis
title_full DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis
title_fullStr DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis
title_full_unstemmed DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis
title_short DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis
title_sort dna-encoded solid-phase synthesis: encoding language design and complex oligomer library synthesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4571006/
https://www.ncbi.nlm.nih.gov/pubmed/26290177
http://dx.doi.org/10.1021/acscombsci.5b00106
work_keys_str_mv AT macconnellandrewb dnaencodedsolidphasesynthesisencodinglanguagedesignandcomplexoligomerlibrarysynthesis
AT mcenaneypatrickj dnaencodedsolidphasesynthesisencodinglanguagedesignandcomplexoligomerlibrarysynthesis
AT cavettvaleriej dnaencodedsolidphasesynthesisencodinglanguagedesignandcomplexoligomerlibrarysynthesis
AT paegelbrianm dnaencodedsolidphasesynthesisencodinglanguagedesignandcomplexoligomerlibrarysynthesis