Cargando…

A novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor DNA

BACKGROUND: Hybrid capture-based next-generation sequencing of DNA has been widely applied in the detection of circulating tumor DNA (ctDNA). Various methods have been proposed for ctDNA detection, but low-allelic-fraction (AF) variants are still a great challenge. In addition, no panel-wide calling...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Leilei, Deng, Qinfang, Xu, Ze, Zhou, Songwen, Li, Chao, Li, Yi-Xue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7118954/
https://www.ncbi.nlm.nih.gov/pubmed/32245364
http://dx.doi.org/10.1186/s12859-020-3412-2
_version_ 1783514672979247104
author Wu, Leilei
Deng, Qinfang
Xu, Ze
Zhou, Songwen
Li, Chao
Li, Yi-Xue
author_facet Wu, Leilei
Deng, Qinfang
Xu, Ze
Zhou, Songwen
Li, Chao
Li, Yi-Xue
author_sort Wu, Leilei
collection PubMed
description BACKGROUND: Hybrid capture-based next-generation sequencing of DNA has been widely applied in the detection of circulating tumor DNA (ctDNA). Various methods have been proposed for ctDNA detection, but low-allelic-fraction (AF) variants are still a great challenge. In addition, no panel-wide calling algorithm is available, which hiders the full usage of ctDNA based ‘liquid biopsy’. Thus, we developed the VBCALAVD (Virtual Barcode-based Calling Algorithm for Low Allelic Variant Detection) in silico to overcome these limitations. RESULTS: Based on the understanding of the nature of ctDNA fragmentation, a novel platform-independent virtual barcode strategy was established to eliminate random sequencing errors by clustering sequencing reads into virtual families. Stereotypical mutant-family-level background artifacts were polished by constructing AF distributions. Three additional robust fine-tuning filters were obtained to eliminate stochastic mutant-family-level noises. The performance of our algorithm was validated using cell-free DNA reference standard samples (cfDNA RSDs) and normal healthy cfDNA samples (cfDNA controls). For the RSDs with AFs of 0.1, 0.2, 0.5, 1 and 5%, the mean F1 scores were 0.43 (0.25~0.56), 0.77, 0.92, 0.926 (0.86~1.0) and 0.89 (0.75~1.0), respectively, which indicates that the proposed approach significantly outperforms the published algorithms. Among controls, no false positives were detected. Meanwhile, characteristics of mutant-family-level noise and quantitative determinants of divergence between mutant-family-level noises from controls and RSDs were clearly depicted. CONCLUSIONS: Due to its good performance in the detection of low-AF variants, our algorithm will greatly facilitate the noninvasive panel-wide detection of ctDNA in research and clinical settings. The whole pipeline is available at https://github.com/zhaodalv/VBCALAVD.
format Online
Article
Text
id pubmed-7118954
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71189542020-04-07 A novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor DNA Wu, Leilei Deng, Qinfang Xu, Ze Zhou, Songwen Li, Chao Li, Yi-Xue BMC Bioinformatics Research Article BACKGROUND: Hybrid capture-based next-generation sequencing of DNA has been widely applied in the detection of circulating tumor DNA (ctDNA). Various methods have been proposed for ctDNA detection, but low-allelic-fraction (AF) variants are still a great challenge. In addition, no panel-wide calling algorithm is available, which hiders the full usage of ctDNA based ‘liquid biopsy’. Thus, we developed the VBCALAVD (Virtual Barcode-based Calling Algorithm for Low Allelic Variant Detection) in silico to overcome these limitations. RESULTS: Based on the understanding of the nature of ctDNA fragmentation, a novel platform-independent virtual barcode strategy was established to eliminate random sequencing errors by clustering sequencing reads into virtual families. Stereotypical mutant-family-level background artifacts were polished by constructing AF distributions. Three additional robust fine-tuning filters were obtained to eliminate stochastic mutant-family-level noises. The performance of our algorithm was validated using cell-free DNA reference standard samples (cfDNA RSDs) and normal healthy cfDNA samples (cfDNA controls). For the RSDs with AFs of 0.1, 0.2, 0.5, 1 and 5%, the mean F1 scores were 0.43 (0.25~0.56), 0.77, 0.92, 0.926 (0.86~1.0) and 0.89 (0.75~1.0), respectively, which indicates that the proposed approach significantly outperforms the published algorithms. Among controls, no false positives were detected. Meanwhile, characteristics of mutant-family-level noise and quantitative determinants of divergence between mutant-family-level noises from controls and RSDs were clearly depicted. CONCLUSIONS: Due to its good performance in the detection of low-AF variants, our algorithm will greatly facilitate the noninvasive panel-wide detection of ctDNA in research and clinical settings. The whole pipeline is available at https://github.com/zhaodalv/VBCALAVD. BioMed Central 2020-04-03 /pmc/articles/PMC7118954/ /pubmed/32245364 http://dx.doi.org/10.1186/s12859-020-3412-2 Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Wu, Leilei
Deng, Qinfang
Xu, Ze
Zhou, Songwen
Li, Chao
Li, Yi-Xue
A novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor DNA
title A novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor DNA
title_full A novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor DNA
title_fullStr A novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor DNA
title_full_unstemmed A novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor DNA
title_short A novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor DNA
title_sort novel virtual barcode strategy for accurate panel-wide variant calling in circulating tumor dna
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7118954/
https://www.ncbi.nlm.nih.gov/pubmed/32245364
http://dx.doi.org/10.1186/s12859-020-3412-2
work_keys_str_mv AT wuleilei anovelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT dengqinfang anovelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT xuze anovelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT zhousongwen anovelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT lichao anovelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT liyixue anovelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT wuleilei novelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT dengqinfang novelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT xuze novelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT zhousongwen novelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT lichao novelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna
AT liyixue novelvirtualbarcodestrategyforaccuratepanelwidevariantcallingincirculatingtumordna