Cargando…

s-dePooler: determination of polymorphism carriers from overlapping DNA pools

BACKGROUND: Samples pooling is a method widely used in studies to reduce costs and labour. DNA sample pooling combined with massive parallel sequencing is a powerful tool for discovering DNA variants (polymorphisms) in large analysing populations, which is the base of such research fields as Genome-...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhernakov, Aleksandr Igorevich, Afonin, Alexey Mikhailovich, Gavriliuk, Natalia Dmitrievna, Moiseeva, Olga Mikhailovna, Zhukov, Vladimir Aleksandrovich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6343301/
https://www.ncbi.nlm.nih.gov/pubmed/30669964
http://dx.doi.org/10.1186/s12859-019-2616-9
_version_ 1783389258299473920
author Zhernakov, Aleksandr Igorevich
Afonin, Alexey Mikhailovich
Gavriliuk, Natalia Dmitrievna
Moiseeva, Olga Mikhailovna
Zhukov, Vladimir Aleksandrovich
author_facet Zhernakov, Aleksandr Igorevich
Afonin, Alexey Mikhailovich
Gavriliuk, Natalia Dmitrievna
Moiseeva, Olga Mikhailovna
Zhukov, Vladimir Aleksandrovich
author_sort Zhernakov, Aleksandr Igorevich
collection PubMed
description BACKGROUND: Samples pooling is a method widely used in studies to reduce costs and labour. DNA sample pooling combined with massive parallel sequencing is a powerful tool for discovering DNA variants (polymorphisms) in large analysing populations, which is the base of such research fields as Genome-Wide Association Studies, evolutionary and population studies, etc. Usage of overlapping pools where each sample is present in multiple pools can enhance the accuracy of polymorphism detection and allow identifying carriers of rare-variants. Surprisingly there is a lack of tools for result interpretation and carrier identification, i.e. for “depooling”. RESULTS: Here we present s-dePooler, the application for analysis of pooling experiments data. s-dePooler uses the variants information (VCF-file) and the pooling scheme to produce a list of candidate carriers for each polymorphism. We incorporated s-dePooler into a pipeline (dePoP) for automation of pooling analysis. The performance of the pipeline was tested on a synthetic dataset built using the 1000 Genomes Project data, resulting in the successful identification 97% of carriers of polymorphisms present in fewer than ~ 10% of carriers. CONCLUSIONS: s-dePooler along with dePoP can be used to identify carriers of polymorphisms in overlapping pools, and is compatible with any pooling scheme with equivalent molar ratios of pooled samples. s-dePooler and dePoP with usage instructions and test data are freely available at https://github.com/lab9arriam/depop. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2616-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6343301
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63433012019-01-24 s-dePooler: determination of polymorphism carriers from overlapping DNA pools Zhernakov, Aleksandr Igorevich Afonin, Alexey Mikhailovich Gavriliuk, Natalia Dmitrievna Moiseeva, Olga Mikhailovna Zhukov, Vladimir Aleksandrovich BMC Bioinformatics Software BACKGROUND: Samples pooling is a method widely used in studies to reduce costs and labour. DNA sample pooling combined with massive parallel sequencing is a powerful tool for discovering DNA variants (polymorphisms) in large analysing populations, which is the base of such research fields as Genome-Wide Association Studies, evolutionary and population studies, etc. Usage of overlapping pools where each sample is present in multiple pools can enhance the accuracy of polymorphism detection and allow identifying carriers of rare-variants. Surprisingly there is a lack of tools for result interpretation and carrier identification, i.e. for “depooling”. RESULTS: Here we present s-dePooler, the application for analysis of pooling experiments data. s-dePooler uses the variants information (VCF-file) and the pooling scheme to produce a list of candidate carriers for each polymorphism. We incorporated s-dePooler into a pipeline (dePoP) for automation of pooling analysis. The performance of the pipeline was tested on a synthetic dataset built using the 1000 Genomes Project data, resulting in the successful identification 97% of carriers of polymorphisms present in fewer than ~ 10% of carriers. CONCLUSIONS: s-dePooler along with dePoP can be used to identify carriers of polymorphisms in overlapping pools, and is compatible with any pooling scheme with equivalent molar ratios of pooled samples. s-dePooler and dePoP with usage instructions and test data are freely available at https://github.com/lab9arriam/depop. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2616-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-22 /pmc/articles/PMC6343301/ /pubmed/30669964 http://dx.doi.org/10.1186/s12859-019-2616-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Zhernakov, Aleksandr Igorevich
Afonin, Alexey Mikhailovich
Gavriliuk, Natalia Dmitrievna
Moiseeva, Olga Mikhailovna
Zhukov, Vladimir Aleksandrovich
s-dePooler: determination of polymorphism carriers from overlapping DNA pools
title s-dePooler: determination of polymorphism carriers from overlapping DNA pools
title_full s-dePooler: determination of polymorphism carriers from overlapping DNA pools
title_fullStr s-dePooler: determination of polymorphism carriers from overlapping DNA pools
title_full_unstemmed s-dePooler: determination of polymorphism carriers from overlapping DNA pools
title_short s-dePooler: determination of polymorphism carriers from overlapping DNA pools
title_sort s-depooler: determination of polymorphism carriers from overlapping dna pools
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6343301/
https://www.ncbi.nlm.nih.gov/pubmed/30669964
http://dx.doi.org/10.1186/s12859-019-2616-9
work_keys_str_mv AT zhernakovaleksandrigorevich sdepoolerdeterminationofpolymorphismcarriersfromoverlappingdnapools
AT afoninalexeymikhailovich sdepoolerdeterminationofpolymorphismcarriersfromoverlappingdnapools
AT gavriliuknataliadmitrievna sdepoolerdeterminationofpolymorphismcarriersfromoverlappingdnapools
AT moiseevaolgamikhailovna sdepoolerdeterminationofpolymorphismcarriersfromoverlappingdnapools
AT zhukovvladimiraleksandrovich sdepoolerdeterminationofpolymorphismcarriersfromoverlappingdnapools