Cargando…

Genotyping common and rare variation using overlapping pool sequencing

BACKGROUND: Recent advances in sequencing technologies set the stage for large, population based studies, in which the ANA or RNA of thousands of individuals will be sequenced. Currently, however, such studies are still infeasible using a straightforward sequencing approach; as a result, recently a...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Dan, Zaitlen, Noah, Pasaniuc, Bogdan, Eskin, Eleazar, Halperin, Eran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194190/
https://www.ncbi.nlm.nih.gov/pubmed/21989232
http://dx.doi.org/10.1186/1471-2105-12-S6-S2
_version_ 1782213926790889472
author He, Dan
Zaitlen, Noah
Pasaniuc, Bogdan
Eskin, Eleazar
Halperin, Eran
author_facet He, Dan
Zaitlen, Noah
Pasaniuc, Bogdan
Eskin, Eleazar
Halperin, Eran
author_sort He, Dan
collection PubMed
description BACKGROUND: Recent advances in sequencing technologies set the stage for large, population based studies, in which the ANA or RNA of thousands of individuals will be sequenced. Currently, however, such studies are still infeasible using a straightforward sequencing approach; as a result, recently a few multiplexing schemes have been suggested, in which a small number of ANA pools are sequenced, and the results are then deconvoluted using compressed sensing or similar approaches. These methods, however, are limited to the detection of rare variants. RESULTS: In this paper we provide a new algorithm for the deconvolution of DNA pools multiplexing schemes. The presented algorithm utilizes a likelihood model and linear programming. The approach allows for the addition of external data, particularly imputation data, resulting in a flexible environment that is suitable for different applications. CONCLUSIONS: Particularly, we demonstrate that both low and high allele frequency SNPs can be accurately genotyped when the DNA pooling scheme is performed in conjunction with microarray genotyping and imputation. Additionally, we demonstrate the use of our framework for the detection of cancer fusion genes from RNA sequences.
format Online
Article
Text
id pubmed-3194190
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31941902011-10-17 Genotyping common and rare variation using overlapping pool sequencing He, Dan Zaitlen, Noah Pasaniuc, Bogdan Eskin, Eleazar Halperin, Eran BMC Bioinformatics Proceedings BACKGROUND: Recent advances in sequencing technologies set the stage for large, population based studies, in which the ANA or RNA of thousands of individuals will be sequenced. Currently, however, such studies are still infeasible using a straightforward sequencing approach; as a result, recently a few multiplexing schemes have been suggested, in which a small number of ANA pools are sequenced, and the results are then deconvoluted using compressed sensing or similar approaches. These methods, however, are limited to the detection of rare variants. RESULTS: In this paper we provide a new algorithm for the deconvolution of DNA pools multiplexing schemes. The presented algorithm utilizes a likelihood model and linear programming. The approach allows for the addition of external data, particularly imputation data, resulting in a flexible environment that is suitable for different applications. CONCLUSIONS: Particularly, we demonstrate that both low and high allele frequency SNPs can be accurately genotyped when the DNA pooling scheme is performed in conjunction with microarray genotyping and imputation. Additionally, we demonstrate the use of our framework for the detection of cancer fusion genes from RNA sequences. BioMed Central 2011-07-28 /pmc/articles/PMC3194190/ /pubmed/21989232 http://dx.doi.org/10.1186/1471-2105-12-S6-S2 Text en Copyright ©2011 He et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
He, Dan
Zaitlen, Noah
Pasaniuc, Bogdan
Eskin, Eleazar
Halperin, Eran
Genotyping common and rare variation using overlapping pool sequencing
title Genotyping common and rare variation using overlapping pool sequencing
title_full Genotyping common and rare variation using overlapping pool sequencing
title_fullStr Genotyping common and rare variation using overlapping pool sequencing
title_full_unstemmed Genotyping common and rare variation using overlapping pool sequencing
title_short Genotyping common and rare variation using overlapping pool sequencing
title_sort genotyping common and rare variation using overlapping pool sequencing
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3194190/
https://www.ncbi.nlm.nih.gov/pubmed/21989232
http://dx.doi.org/10.1186/1471-2105-12-S6-S2
work_keys_str_mv AT hedan genotypingcommonandrarevariationusingoverlappingpoolsequencing
AT zaitlennoah genotypingcommonandrarevariationusingoverlappingpoolsequencing
AT pasaniucbogdan genotypingcommonandrarevariationusingoverlappingpoolsequencing
AT eskineleazar genotypingcommonandrarevariationusingoverlappingpoolsequencing
AT halperineran genotypingcommonandrarevariationusingoverlappingpoolsequencing