Cargando…

pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data

BACKGROUND: Haloplex targeted resequencing is a popular method to analyze both germline and somatic variants in gene panels. However, involved wet-lab procedures may introduce false positives that need to be considered in subsequent data-analysis. No variant filtering rationale addressing amplicon e...

Descripción completa

Detalles Bibliográficos
Autores principales: Beyens, Matthias, Boeckx, Nele, Van Camp, Guy, Op de Beeck, Ken, Vandeweyer, Geert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5729461/
https://www.ncbi.nlm.nih.gov/pubmed/29237398
http://dx.doi.org/10.1186/s12859-017-1985-1
_version_ 1783286201383387136
author Beyens, Matthias
Boeckx, Nele
Van Camp, Guy
Op de Beeck, Ken
Vandeweyer, Geert
author_facet Beyens, Matthias
Boeckx, Nele
Van Camp, Guy
Op de Beeck, Ken
Vandeweyer, Geert
author_sort Beyens, Matthias
collection PubMed
description BACKGROUND: Haloplex targeted resequencing is a popular method to analyze both germline and somatic variants in gene panels. However, involved wet-lab procedures may introduce false positives that need to be considered in subsequent data-analysis. No variant filtering rationale addressing amplicon enrichment related systematic errors, in the form of an all-in-one package, exists to our knowledge. RESULTS: We present pyAmpli, a platform independent parallelized Python package that implements an amplicon-based germline and somatic variant filtering strategy for Haloplex data. pyAmpli can filter variants for systematic errors by user pre-defined criteria. We show that pyAmpli significantly increases specificity, without reducing sensitivity, essential for reporting true positive clinical relevant mutations in gene panel data. CONCLUSIONS: pyAmpli is an easy-to-use software tool which increases the true positive variant call rate in targeted resequencing data. It specifically reduces errors related to PCR-based enrichment of targeted regions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1985-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5729461
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57294612017-12-18 pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data Beyens, Matthias Boeckx, Nele Van Camp, Guy Op de Beeck, Ken Vandeweyer, Geert BMC Bioinformatics Software BACKGROUND: Haloplex targeted resequencing is a popular method to analyze both germline and somatic variants in gene panels. However, involved wet-lab procedures may introduce false positives that need to be considered in subsequent data-analysis. No variant filtering rationale addressing amplicon enrichment related systematic errors, in the form of an all-in-one package, exists to our knowledge. RESULTS: We present pyAmpli, a platform independent parallelized Python package that implements an amplicon-based germline and somatic variant filtering strategy for Haloplex data. pyAmpli can filter variants for systematic errors by user pre-defined criteria. We show that pyAmpli significantly increases specificity, without reducing sensitivity, essential for reporting true positive clinical relevant mutations in gene panel data. CONCLUSIONS: pyAmpli is an easy-to-use software tool which increases the true positive variant call rate in targeted resequencing data. It specifically reduces errors related to PCR-based enrichment of targeted regions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1985-1) contains supplementary material, which is available to authorized users. BioMed Central 2017-12-14 /pmc/articles/PMC5729461/ /pubmed/29237398 http://dx.doi.org/10.1186/s12859-017-1985-1 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Beyens, Matthias
Boeckx, Nele
Van Camp, Guy
Op de Beeck, Ken
Vandeweyer, Geert
pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data
title pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data
title_full pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data
title_fullStr pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data
title_full_unstemmed pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data
title_short pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data
title_sort pyampli: an amplicon-based variant filter pipeline for targeted resequencing data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5729461/
https://www.ncbi.nlm.nih.gov/pubmed/29237398
http://dx.doi.org/10.1186/s12859-017-1985-1
work_keys_str_mv AT beyensmatthias pyamplianampliconbasedvariantfilterpipelinefortargetedresequencingdata
AT boeckxnele pyamplianampliconbasedvariantfilterpipelinefortargetedresequencingdata
AT vancampguy pyamplianampliconbasedvariantfilterpipelinefortargetedresequencingdata
AT opdebeeckken pyamplianampliconbasedvariantfilterpipelinefortargetedresequencingdata
AT vandeweyergeert pyamplianampliconbasedvariantfilterpipelinefortargetedresequencingdata