Cargando…

Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing

Motivation: Despite the rapid decline in sequencing costs, sequencing large cohorts of individuals is still prohibitively expensive. Recently, several sophisticated pooling designs were suggested that can identify carriers of rare alleles in large cohorts with a significantly smaller number of pools...

Descripción completa

Detalles Bibliográficos
Autores principales:	Golan, David, Erlich, Yaniv, Rosset, Saharon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2012
Materias:	Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371840/ https://www.ncbi.nlm.nih.gov/pubmed/22689761 http://dx.doi.org/10.1093/bioinformatics/bts208

_version_	1782235268176150528
author	Golan, David Erlich, Yaniv Rosset, Saharon
author_facet	Golan, David Erlich, Yaniv Rosset, Saharon
author_sort	Golan, David
collection	PubMed
description	Motivation: Despite the rapid decline in sequencing costs, sequencing large cohorts of individuals is still prohibitively expensive. Recently, several sophisticated pooling designs were suggested that can identify carriers of rare alleles in large cohorts with a significantly smaller number of pools, thus dramatically reducing the cost of such large-scale sequencing projects. These approaches use combinatorial pooling designs where each individual is either present or absent from a pool. One can then infer the number of carriers in a pool, and by combining information across pools, reconstruct the identity of the carriers. Results: We show that one can gain further efficiency and cost reduction by using ‘weighted’ designs, in which different individuals donate different amounts of DNA to the pools. Intuitively, in this situation, the number of mutant reads in a pool does not only indicate the number of carriers, but also their identity. We describe and study a powerful example of such weighted designs, using non-overlapping pools. We demonstrate that this approach is not only easier to implement and analyze but is also competitive in terms of accuracy with combinatorial designs when identifying rare variants, and is superior when sequencing common variants. We then discuss how weighting can be incorporated into existing combinatorial designs to increase their accuracy and demonstrate the resulting improvement using simulations. Finally, we argue that weighted designs have enough power to facilitate detection of common alleles, so they can be used as a cornerstone of whole-exome sequencing projects. Contact: saharon@post.tau.ac.il
format	Online Article Text
id	pubmed-3371840
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-33718402012-06-11 Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing Golan, David Erlich, Yaniv Rosset, Saharon Bioinformatics Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa Motivation: Despite the rapid decline in sequencing costs, sequencing large cohorts of individuals is still prohibitively expensive. Recently, several sophisticated pooling designs were suggested that can identify carriers of rare alleles in large cohorts with a significantly smaller number of pools, thus dramatically reducing the cost of such large-scale sequencing projects. These approaches use combinatorial pooling designs where each individual is either present or absent from a pool. One can then infer the number of carriers in a pool, and by combining information across pools, reconstruct the identity of the carriers. Results: We show that one can gain further efficiency and cost reduction by using ‘weighted’ designs, in which different individuals donate different amounts of DNA to the pools. Intuitively, in this situation, the number of mutant reads in a pool does not only indicate the number of carriers, but also their identity. We describe and study a powerful example of such weighted designs, using non-overlapping pools. We demonstrate that this approach is not only easier to implement and analyze but is also competitive in terms of accuracy with combinatorial designs when identifying rare variants, and is superior when sequencing common variants. We then discuss how weighting can be incorporated into existing combinatorial designs to increase their accuracy and demonstrate the resulting improvement using simulations. Finally, we argue that weighted designs have enough power to facilitate detection of common alleles, so they can be used as a cornerstone of whole-exome sequencing projects. Contact: saharon@post.tau.ac.il Oxford University Press 2012-06-15 2012-06-09 /pmc/articles/PMC3371840/ /pubmed/22689761 http://dx.doi.org/10.1093/bioinformatics/bts208 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa Golan, David Erlich, Yaniv Rosset, Saharon Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing
title	Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing
title_full	Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing
title_fullStr	Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing
title_full_unstemmed	Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing
title_short	Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing
title_sort	weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing
topic	Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371840/ https://www.ncbi.nlm.nih.gov/pubmed/22689761 http://dx.doi.org/10.1093/bioinformatics/bts208
work_keys_str_mv	AT golandavid weightedpoolingpracticalandcosteffectivetechniquesforpooledhighthroughputsequencing AT erlichyaniv weightedpoolingpracticalandcosteffectivetechniquesforpooledhighthroughputsequencing AT rossetsaharon weightedpoolingpracticalandcosteffectivetechniquesforpooledhighthroughputsequencing

Weighted pooling—practical and cost-effective techniques for pooled high-throughput sequencing

Ejemplares similares