Cargando…

Determination of a Screening Metric for High Diversity DNA Libraries

The fields of antibody engineering, enzyme optimization and pathway construction rely increasingly on screening complex variant DNA libraries. These highly diverse libraries allow researchers to sample a maximized sequence space; and therefore, more rapidly identify proteins with significantly impro...

Descripción completa

Detalles Bibliográficos
Autores principales: Guido, Nicholas J., Handerson, Steven, Joseph, Elaine M., Leake, Devin, Kung, Li A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5145166/
https://www.ncbi.nlm.nih.gov/pubmed/27930689
http://dx.doi.org/10.1371/journal.pone.0167088
_version_ 1782473247579701248
author Guido, Nicholas J.
Handerson, Steven
Joseph, Elaine M.
Leake, Devin
Kung, Li A.
author_facet Guido, Nicholas J.
Handerson, Steven
Joseph, Elaine M.
Leake, Devin
Kung, Li A.
author_sort Guido, Nicholas J.
collection PubMed
description The fields of antibody engineering, enzyme optimization and pathway construction rely increasingly on screening complex variant DNA libraries. These highly diverse libraries allow researchers to sample a maximized sequence space; and therefore, more rapidly identify proteins with significantly improved activity. The current state of the art in synthetic biology allows for libraries with billions of variants, pushing the limits of researchers’ ability to qualify libraries for screening by measuring the traditional quality metrics of fidelity and diversity of variants. Instead, when screening variant libraries, researchers typically use a generic, and often insufficient, oversampling rate based on a common rule-of-thumb. We have developed methods to calculate a library-specific oversampling metric, based on fidelity, diversity, and representation of variants, which informs researchers, prior to screening the library, of the amount of oversampling required to ensure that the desired fraction of variant molecules will be sampled. To derive this oversampling metric, we developed a novel alignment tool to efficiently measure frequency counts of individual nucleotide variant positions using next-generation sequencing data. Next, we apply a method based on the “coupon collector” probability theory to construct a curve of upper bound estimates of the sampling size required for any desired variant coverage. The calculated oversampling metric will guide researchers to maximize their efficiency in using highly variant libraries.
format Online
Article
Text
id pubmed-5145166
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-51451662016-12-22 Determination of a Screening Metric for High Diversity DNA Libraries Guido, Nicholas J. Handerson, Steven Joseph, Elaine M. Leake, Devin Kung, Li A. PLoS One Research Article The fields of antibody engineering, enzyme optimization and pathway construction rely increasingly on screening complex variant DNA libraries. These highly diverse libraries allow researchers to sample a maximized sequence space; and therefore, more rapidly identify proteins with significantly improved activity. The current state of the art in synthetic biology allows for libraries with billions of variants, pushing the limits of researchers’ ability to qualify libraries for screening by measuring the traditional quality metrics of fidelity and diversity of variants. Instead, when screening variant libraries, researchers typically use a generic, and often insufficient, oversampling rate based on a common rule-of-thumb. We have developed methods to calculate a library-specific oversampling metric, based on fidelity, diversity, and representation of variants, which informs researchers, prior to screening the library, of the amount of oversampling required to ensure that the desired fraction of variant molecules will be sampled. To derive this oversampling metric, we developed a novel alignment tool to efficiently measure frequency counts of individual nucleotide variant positions using next-generation sequencing data. Next, we apply a method based on the “coupon collector” probability theory to construct a curve of upper bound estimates of the sampling size required for any desired variant coverage. The calculated oversampling metric will guide researchers to maximize their efficiency in using highly variant libraries. Public Library of Science 2016-12-08 /pmc/articles/PMC5145166/ /pubmed/27930689 http://dx.doi.org/10.1371/journal.pone.0167088 Text en © 2016 Guido et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Guido, Nicholas J.
Handerson, Steven
Joseph, Elaine M.
Leake, Devin
Kung, Li A.
Determination of a Screening Metric for High Diversity DNA Libraries
title Determination of a Screening Metric for High Diversity DNA Libraries
title_full Determination of a Screening Metric for High Diversity DNA Libraries
title_fullStr Determination of a Screening Metric for High Diversity DNA Libraries
title_full_unstemmed Determination of a Screening Metric for High Diversity DNA Libraries
title_short Determination of a Screening Metric for High Diversity DNA Libraries
title_sort determination of a screening metric for high diversity dna libraries
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5145166/
https://www.ncbi.nlm.nih.gov/pubmed/27930689
http://dx.doi.org/10.1371/journal.pone.0167088
work_keys_str_mv AT guidonicholasj determinationofascreeningmetricforhighdiversitydnalibraries
AT handersonsteven determinationofascreeningmetricforhighdiversitydnalibraries
AT josephelainem determinationofascreeningmetricforhighdiversitydnalibraries
AT leakedevin determinationofascreeningmetricforhighdiversitydnalibraries
AT kunglia determinationofascreeningmetricforhighdiversitydnalibraries