Cargando…

Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller

BACKGROUND: Detection of DNA mutations at very low allele fractions with high accuracy will significantly improve the effectiveness of precision medicine for cancer patients. To achieve this goal through next generation sequencing, researchers need a detection method that 1) captures rare mutation-c...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Chang, Nezami Ranjbar, Mohammad R., Wu, Zhong, DiCarlo, John, Wang, Yexun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209917/
https://www.ncbi.nlm.nih.gov/pubmed/28049435
http://dx.doi.org/10.1186/s12864-016-3425-4
_version_ 1782490820483481600
author Xu, Chang
Nezami Ranjbar, Mohammad R.
Wu, Zhong
DiCarlo, John
Wang, Yexun
author_facet Xu, Chang
Nezami Ranjbar, Mohammad R.
Wu, Zhong
DiCarlo, John
Wang, Yexun
author_sort Xu, Chang
collection PubMed
description BACKGROUND: Detection of DNA mutations at very low allele fractions with high accuracy will significantly improve the effectiveness of precision medicine for cancer patients. To achieve this goal through next generation sequencing, researchers need a detection method that 1) captures rare mutation-containing DNA fragments efficiently in the mix of abundant wild-type DNA; 2) sequences the DNA library extensively to deep coverage; and 3) distinguishes low level true variants from amplification and sequencing errors with high accuracy. Targeted enrichment using PCR primers provides researchers with a convenient way to achieve deep sequencing for a small, yet most relevant region using benchtop sequencers. Molecular barcoding (or indexing) provides a unique solution for reducing sequencing artifacts analytically. Although different molecular barcoding schemes have been reported in recent literature, most variant calling has been done on limited targets, using simple custom scripts. The analytical performance of barcode-aware variant calling can be significantly improved by incorporating advanced statistical models. RESULTS: We present here a highly efficient, simple and scalable enrichment protocol that integrates molecular barcodes in multiplex PCR amplification. In addition, we developed smCounter, an open source, generic, barcode-aware variant caller based on a Bayesian probabilistic model. smCounter was optimized and benchmarked on two independent read sets with SNVs and indels at 5 and 1% allele fractions. Variants were called with very good sensitivity and specificity within coding regions. CONCLUSIONS: We demonstrated that we can accurately detect somatic mutations with allele fractions as low as 1% in coding regions using our enrichment protocol and variant caller. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3425-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5209917
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52099172017-01-04 Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller Xu, Chang Nezami Ranjbar, Mohammad R. Wu, Zhong DiCarlo, John Wang, Yexun BMC Genomics Methodology Article BACKGROUND: Detection of DNA mutations at very low allele fractions with high accuracy will significantly improve the effectiveness of precision medicine for cancer patients. To achieve this goal through next generation sequencing, researchers need a detection method that 1) captures rare mutation-containing DNA fragments efficiently in the mix of abundant wild-type DNA; 2) sequences the DNA library extensively to deep coverage; and 3) distinguishes low level true variants from amplification and sequencing errors with high accuracy. Targeted enrichment using PCR primers provides researchers with a convenient way to achieve deep sequencing for a small, yet most relevant region using benchtop sequencers. Molecular barcoding (or indexing) provides a unique solution for reducing sequencing artifacts analytically. Although different molecular barcoding schemes have been reported in recent literature, most variant calling has been done on limited targets, using simple custom scripts. The analytical performance of barcode-aware variant calling can be significantly improved by incorporating advanced statistical models. RESULTS: We present here a highly efficient, simple and scalable enrichment protocol that integrates molecular barcodes in multiplex PCR amplification. In addition, we developed smCounter, an open source, generic, barcode-aware variant caller based on a Bayesian probabilistic model. smCounter was optimized and benchmarked on two independent read sets with SNVs and indels at 5 and 1% allele fractions. Variants were called with very good sensitivity and specificity within coding regions. CONCLUSIONS: We demonstrated that we can accurately detect somatic mutations with allele fractions as low as 1% in coding regions using our enrichment protocol and variant caller. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3425-4) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-03 /pmc/articles/PMC5209917/ /pubmed/28049435 http://dx.doi.org/10.1186/s12864-016-3425-4 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Xu, Chang
Nezami Ranjbar, Mohammad R.
Wu, Zhong
DiCarlo, John
Wang, Yexun
Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller
title Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller
title_full Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller
title_fullStr Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller
title_full_unstemmed Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller
title_short Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller
title_sort detecting very low allele fraction variants using targeted dna sequencing and a novel molecular barcode-aware variant caller
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209917/
https://www.ncbi.nlm.nih.gov/pubmed/28049435
http://dx.doi.org/10.1186/s12864-016-3425-4
work_keys_str_mv AT xuchang detectingverylowallelefractionvariantsusingtargeteddnasequencingandanovelmolecularbarcodeawarevariantcaller
AT nezamiranjbarmohammadr detectingverylowallelefractionvariantsusingtargeteddnasequencingandanovelmolecularbarcodeawarevariantcaller
AT wuzhong detectingverylowallelefractionvariantsusingtargeteddnasequencingandanovelmolecularbarcodeawarevariantcaller
AT dicarlojohn detectingverylowallelefractionvariantsusingtargeteddnasequencingandanovelmolecularbarcodeawarevariantcaller
AT wangyexun detectingverylowallelefractionvariantsusingtargeteddnasequencingandanovelmolecularbarcodeawarevariantcaller