Cargando…
Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools
The next-generation sequencing (NGS) technology represents a significant advance in genomics and medical diagnosis. Nevertheless, the time it takes to perform sequencing, data analysis, and variant interpretation is a bottleneck in using next-generation sequencing in precision medicine. For accurate...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10399881/ https://www.ncbi.nlm.nih.gov/pubmed/37535628 http://dx.doi.org/10.1371/journal.pone.0288371 |
_version_ | 1785084344895799296 |
---|---|
author | Alganmi, Nofe Abusamra, Heba |
author_facet | Alganmi, Nofe Abusamra, Heba |
author_sort | Alganmi, Nofe |
collection | PubMed |
description | The next-generation sequencing (NGS) technology represents a significant advance in genomics and medical diagnosis. Nevertheless, the time it takes to perform sequencing, data analysis, and variant interpretation is a bottleneck in using next-generation sequencing in precision medicine. For accurate and efficient performance in clinical diagnostic lab practice, a consistent data analysis pipeline is necessary to avoid false variant calls and achieve optimum accuracy. This study aims to compare the performance of two NGS data analysis pipeline compartments, including short-read mapping (BWA-MEM and BWA-MEM2) and variant calling (GATK-HaplotypeCaller and DRAGEN-GATK). On Whole Exome Sequencing (WES) data, computational performance was assessed using several criteria, including mapping efficiency, variant calling performance, false positive calls rate, and time. We examined four gold-standard WES data sets: Ashkenazim father (NA24149), Ashkenazim mother (NA24143), Ashkenazim son (NA24385), and Asian son (NA25631). In addition, eighteen exome samples were analyzed based on different read counts, and coverage was used precisely in the run-time assessment. By using BWA-MEM 2 and Dragen-GATK, this study achieved faster and more accurate detection for SNVs and indels than the standard GATK Best Practices workflow. This systematic comparison will enable the bioinformatics community to develop a more efficient and faster solution for analyzing NGS data. |
format | Online Article Text |
id | pubmed-10399881 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-103998812023-08-04 Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools Alganmi, Nofe Abusamra, Heba PLoS One Research Article The next-generation sequencing (NGS) technology represents a significant advance in genomics and medical diagnosis. Nevertheless, the time it takes to perform sequencing, data analysis, and variant interpretation is a bottleneck in using next-generation sequencing in precision medicine. For accurate and efficient performance in clinical diagnostic lab practice, a consistent data analysis pipeline is necessary to avoid false variant calls and achieve optimum accuracy. This study aims to compare the performance of two NGS data analysis pipeline compartments, including short-read mapping (BWA-MEM and BWA-MEM2) and variant calling (GATK-HaplotypeCaller and DRAGEN-GATK). On Whole Exome Sequencing (WES) data, computational performance was assessed using several criteria, including mapping efficiency, variant calling performance, false positive calls rate, and time. We examined four gold-standard WES data sets: Ashkenazim father (NA24149), Ashkenazim mother (NA24143), Ashkenazim son (NA24385), and Asian son (NA25631). In addition, eighteen exome samples were analyzed based on different read counts, and coverage was used precisely in the run-time assessment. By using BWA-MEM 2 and Dragen-GATK, this study achieved faster and more accurate detection for SNVs and indels than the standard GATK Best Practices workflow. This systematic comparison will enable the bioinformatics community to develop a more efficient and faster solution for analyzing NGS data. Public Library of Science 2023-08-03 /pmc/articles/PMC10399881/ /pubmed/37535628 http://dx.doi.org/10.1371/journal.pone.0288371 Text en © 2023 Alganmi, Abusamra https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Alganmi, Nofe Abusamra, Heba Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools |
title | Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools |
title_full | Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools |
title_fullStr | Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools |
title_full_unstemmed | Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools |
title_short | Evaluation of an optimized germline exomes pipeline using BWA-MEM2 and Dragen-GATK tools |
title_sort | evaluation of an optimized germline exomes pipeline using bwa-mem2 and dragen-gatk tools |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10399881/ https://www.ncbi.nlm.nih.gov/pubmed/37535628 http://dx.doi.org/10.1371/journal.pone.0288371 |
work_keys_str_mv | AT alganminofe evaluationofanoptimizedgermlineexomespipelineusingbwamem2anddragengatktools AT abusamraheba evaluationofanoptimizedgermlineexomespipelineusingbwamem2anddragengatktools |