Cargando…
Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations
BACKGROUND: Sort-seq is an effective approach for simultaneous activity measurements in a large-scale library, combining flow cytometry, deep sequencing, and statistical inference. Such assays enable the characterization of functional landscapes at unprecedented scale for a wide-reaching array of bi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4784318/ https://www.ncbi.nlm.nih.gov/pubmed/26956374 http://dx.doi.org/10.1186/s12864-016-2533-5 |
_version_ | 1782420243119865856 |
---|---|
author | Peterman, Neil Levine, Erel |
author_facet | Peterman, Neil Levine, Erel |
author_sort | Peterman, Neil |
collection | PubMed |
description | BACKGROUND: Sort-seq is an effective approach for simultaneous activity measurements in a large-scale library, combining flow cytometry, deep sequencing, and statistical inference. Such assays enable the characterization of functional landscapes at unprecedented scale for a wide-reaching array of biological molecules and functionalities in vivo. Applications of sort-seq range from footprinting to establishing quantitative models of biological systems and rational design of synthetic genetic elements. Nearly as diverse are implementations of this technique, reflecting key design choices with extensive impact on the scope and accuracy the results. Yet how to make these choices remains unclear. Here we investigate the effects of alternative sort-seq designs and inference methods on the information output using mathematical formulation and simulations. RESULTS: We identify key intrinsic properties of any system of interest with practical implications for sort-seq assays, depending on the experimental goals. The fluorescence range and cell-to-cell variability specify the number of sorted populations needed for quantitative measurements that are precise and unbiased. These factors also indicate cases where an enrichment-based approach that uses a single sorted population can offer satisfactory results. These predications of our model are corroborated using re-analysis of published data. We explore implications of these results for quantitative modeling and library design. CONCLUSIONS: Sort-seq assays can be streamlined by reducing the number of sorted populations, saving considerable resources. Simple preliminary experiments can guide optimal experiment design, minimizing cost while maintaining the maximal information output and avoiding latent biases. These insights can facilitate future applications of this highly adaptable technique. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2533-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4784318 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-47843182016-03-10 Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations Peterman, Neil Levine, Erel BMC Genomics Methodology Article BACKGROUND: Sort-seq is an effective approach for simultaneous activity measurements in a large-scale library, combining flow cytometry, deep sequencing, and statistical inference. Such assays enable the characterization of functional landscapes at unprecedented scale for a wide-reaching array of biological molecules and functionalities in vivo. Applications of sort-seq range from footprinting to establishing quantitative models of biological systems and rational design of synthetic genetic elements. Nearly as diverse are implementations of this technique, reflecting key design choices with extensive impact on the scope and accuracy the results. Yet how to make these choices remains unclear. Here we investigate the effects of alternative sort-seq designs and inference methods on the information output using mathematical formulation and simulations. RESULTS: We identify key intrinsic properties of any system of interest with practical implications for sort-seq assays, depending on the experimental goals. The fluorescence range and cell-to-cell variability specify the number of sorted populations needed for quantitative measurements that are precise and unbiased. These factors also indicate cases where an enrichment-based approach that uses a single sorted population can offer satisfactory results. These predications of our model are corroborated using re-analysis of published data. We explore implications of these results for quantitative modeling and library design. CONCLUSIONS: Sort-seq assays can be streamlined by reducing the number of sorted populations, saving considerable resources. Simple preliminary experiments can guide optimal experiment design, minimizing cost while maintaining the maximal information output and avoiding latent biases. These insights can facilitate future applications of this highly adaptable technique. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2533-5) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-09 /pmc/articles/PMC4784318/ /pubmed/26956374 http://dx.doi.org/10.1186/s12864-016-2533-5 Text en © Peterman and Levine. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Peterman, Neil Levine, Erel Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations |
title | Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations |
title_full | Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations |
title_fullStr | Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations |
title_full_unstemmed | Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations |
title_short | Sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations |
title_sort | sort-seq under the hood: implications of design choices on large-scale characterization of sequence-function relations |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4784318/ https://www.ncbi.nlm.nih.gov/pubmed/26956374 http://dx.doi.org/10.1186/s12864-016-2533-5 |
work_keys_str_mv | AT petermanneil sortsequnderthehoodimplicationsofdesignchoicesonlargescalecharacterizationofsequencefunctionrelations AT levineerel sortsequnderthehoodimplicationsofdesignchoicesonlargescalecharacterizationofsequencefunctionrelations |