Cargando…

Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening

[Image: see text] Microfluidic droplet-based screening of DNA-encoded one-bead-one-compound combinatorial libraries is a miniaturized, potentially widely distributable approach to small molecule discovery. In these screens, a microfluidic circuit distributes library beads into droplets of activity a...

Descripción completa

Detalles Bibliográficos
Autores principales: MacConnell, Andrew B., Paegel, Brian M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2017
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5558193/
https://www.ncbi.nlm.nih.gov/pubmed/28682059
http://dx.doi.org/10.1021/acscombsci.7b00061
_version_ 1783257353391439872
author MacConnell, Andrew B.
Paegel, Brian M.
author_facet MacConnell, Andrew B.
Paegel, Brian M.
author_sort MacConnell, Andrew B.
collection PubMed
description [Image: see text] Microfluidic droplet-based screening of DNA-encoded one-bead-one-compound combinatorial libraries is a miniaturized, potentially widely distributable approach to small molecule discovery. In these screens, a microfluidic circuit distributes library beads into droplets of activity assay reagent, photochemically cleaves the compound from the bead, then incubates and sorts the droplets based on assay result for subsequent DNA sequencing-based hit compound structure elucidation. Pilot experimental studies revealed that Poisson statistics describe nearly all aspects of such screens, prompting the development of simulations to understand system behavior. Monte Carlo screening simulation data showed that increasing mean library sampling (ε), mean droplet occupancy, or library hit rate all increase the false discovery rate (FDR). Compounds identified as hits on k > 1 beads (the replicate k class) were much more likely to be authentic hits than singletons (k = 1), in agreement with previous findings. Here, we explain this observation by deriving an equation for authenticity, which reduces to the product of a library sampling bias term (exponential in k) and a sampling saturation term (exponential in ε) setting a threshold that the k-dependent bias must overcome. The equation thus quantitatively describes why each hit structure’s FDR is based on its k class, and further predicts the feasibility of intentionally populating droplets with multiple library beads, assaying the micromixtures for function, and identifying the active members by statistical deconvolution.
format Online
Article
Text
id pubmed-5558193
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-55581932017-08-17 Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening MacConnell, Andrew B. Paegel, Brian M. ACS Comb Sci [Image: see text] Microfluidic droplet-based screening of DNA-encoded one-bead-one-compound combinatorial libraries is a miniaturized, potentially widely distributable approach to small molecule discovery. In these screens, a microfluidic circuit distributes library beads into droplets of activity assay reagent, photochemically cleaves the compound from the bead, then incubates and sorts the droplets based on assay result for subsequent DNA sequencing-based hit compound structure elucidation. Pilot experimental studies revealed that Poisson statistics describe nearly all aspects of such screens, prompting the development of simulations to understand system behavior. Monte Carlo screening simulation data showed that increasing mean library sampling (ε), mean droplet occupancy, or library hit rate all increase the false discovery rate (FDR). Compounds identified as hits on k > 1 beads (the replicate k class) were much more likely to be authentic hits than singletons (k = 1), in agreement with previous findings. Here, we explain this observation by deriving an equation for authenticity, which reduces to the product of a library sampling bias term (exponential in k) and a sampling saturation term (exponential in ε) setting a threshold that the k-dependent bias must overcome. The equation thus quantitatively describes why each hit structure’s FDR is based on its k class, and further predicts the feasibility of intentionally populating droplets with multiple library beads, assaying the micromixtures for function, and identifying the active members by statistical deconvolution. American Chemical Society 2017-07-06 2017-08-14 /pmc/articles/PMC5558193/ /pubmed/28682059 http://dx.doi.org/10.1021/acscombsci.7b00061 Text en Copyright © 2017 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle MacConnell, Andrew B.
Paegel, Brian M.
Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening
title Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening
title_full Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening
title_fullStr Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening
title_full_unstemmed Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening
title_short Poisson Statistics of Combinatorial Library Sampling Predict False Discovery Rates of Screening
title_sort poisson statistics of combinatorial library sampling predict false discovery rates of screening
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5558193/
https://www.ncbi.nlm.nih.gov/pubmed/28682059
http://dx.doi.org/10.1021/acscombsci.7b00061
work_keys_str_mv AT macconnellandrewb poissonstatisticsofcombinatoriallibrarysamplingpredictfalsediscoveryratesofscreening
AT paegelbrianm poissonstatisticsofcombinatoriallibrarysamplingpredictfalsediscoveryratesofscreening