Cargando…

Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations

BACKGROUND: Sort-seq is an effective approach for simultaneous activity measurements in a large-scale library, combining flow cytometry, deep sequencing, and statistical inference. Such assays enable the characterization of functional landscapes at unprecedented scale for a wide-reaching array of bi...

Descripción completa

Detalles Bibliográficos
Autores principales: Peterman, Neil, Levine, Erel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4784318/
https://www.ncbi.nlm.nih.gov/pubmed/26956374
http://dx.doi.org/10.1186/s12864-016-2533-5
_version_ 1782420243119865856
author Peterman, Neil
Levine, Erel
author_facet Peterman, Neil
Levine, Erel
author_sort Peterman, Neil
collection PubMed
description BACKGROUND: Sort-seq is an effective approach for simultaneous activity measurements in a large-scale library, combining flow cytometry, deep sequencing, and statistical inference. Such assays enable the characterization of functional landscapes at unprecedented scale for a wide-reaching array of biological molecules and functionalities in vivo. Applications of sort-seq range from footprinting to establishing quantitative models of biological systems and rational design of synthetic genetic elements. Nearly as diverse are implementations of this technique, reflecting key design choices with extensive impact on the scope and accuracy the results. Yet how to make these choices remains unclear. Here we investigate the effects of alternative sort-seq designs and inference methods on the information output using mathematical formulation and simulations. RESULTS: We identify key intrinsic properties of any system of interest with practical implications for sort-seq assays, depending on the experimental goals. The fluorescence range and cell-to-cell variability specify the number of sorted populations needed for quantitative measurements that are precise and unbiased. These factors also indicate cases where an enrichment-based approach that uses a single sorted population can offer satisfactory results. These predications of our model are corroborated using re-analysis of published data. We explore implications of these results for quantitative modeling and library design. CONCLUSIONS: Sort-seq assays can be streamlined by reducing the number of sorted populations, saving considerable resources. Simple preliminary experiments can guide optimal experiment design, minimizing cost while maintaining the maximal information output and avoiding latent biases. These insights can facilitate future applications of this highly adaptable technique. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2533-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4784318
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47843182016-03-10 Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations Peterman, Neil Levine, Erel BMC Genomics Methodology Article BACKGROUND: Sort-seq is an effective approach for simultaneous activity measurements in a large-scale library, combining flow cytometry, deep sequencing, and statistical inference. Such assays enable the characterization of functional landscapes at unprecedented scale for a wide-reaching array of biological molecules and functionalities in vivo. Applications of sort-seq range from footprinting to establishing quantitative models of biological systems and rational design of synthetic genetic elements. Nearly as diverse are implementations of this technique, reflecting key design choices with extensive impact on the scope and accuracy the results. Yet how to make these choices remains unclear. Here we investigate the effects of alternative sort-seq designs and inference methods on the information output using mathematical formulation and simulations. RESULTS: We identify key intrinsic properties of any system of interest with practical implications for sort-seq assays, depending on the experimental goals. The fluorescence range and cell-to-cell variability specify the number of sorted populations needed for quantitative measurements that are precise and unbiased. These factors also indicate cases where an enrichment-based approach that uses a single sorted population can offer satisfactory results. These predications of our model are corroborated using re-analysis of published data. We explore implications of these results for quantitative modeling and library design. CONCLUSIONS: Sort-seq assays can be streamlined by reducing the number of sorted populations, saving considerable resources. Simple preliminary experiments can guide optimal experiment design, minimizing cost while maintaining the maximal information output and avoiding latent biases. These insights can facilitate future applications of this highly adaptable technique. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2533-5) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-09 /pmc/articles/PMC4784318/ /pubmed/26956374 http://dx.doi.org/10.1186/s12864-016-2533-5 Text en © Peterman and Levine. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Peterman, Neil
Levine, Erel
Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations
title Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations
title_full Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations
title_fullStr Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations
title_full_unstemmed Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations
title_short Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations
title_sort sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4784318/
https://www.ncbi.nlm.nih.gov/pubmed/26956374
http://dx.doi.org/10.1186/s12864-016-2533-5
work_keys_str_mv AT petermanneil sortsequnderthehoodimplicationsofdesignchoicesonlargescalecharacterizationofsequencefunctionrelations
AT levineerel sortsequnderthehoodimplicationsofdesignchoicesonlargescalecharacterizationofsequencefunctionrelations