Cargando…
Systematic comparison of variant calling pipelines using gold standard personal exome variants
The success of clinical genomics using next generation sequencing (NGS) requires the accurate and consistent identification of personal genome variants. Assorted variant calling methods have been developed, which show low concordance between their calls. Hence, a systematic comparison of the variant...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4671096/ https://www.ncbi.nlm.nih.gov/pubmed/26639839 http://dx.doi.org/10.1038/srep17875 |
_version_ | 1782404354170421248 |
---|---|
author | Hwang, Sohyun Kim, Eiru Lee, Insuk Marcotte, Edward M. |
author_facet | Hwang, Sohyun Kim, Eiru Lee, Insuk Marcotte, Edward M. |
author_sort | Hwang, Sohyun |
collection | PubMed |
description | The success of clinical genomics using next generation sequencing (NGS) requires the accurate and consistent identification of personal genome variants. Assorted variant calling methods have been developed, which show low concordance between their calls. Hence, a systematic comparison of the variant callers could give important guidance to NGS-based clinical genomics. Recently, a set of high-confident variant calls for one individual (NA12878) has been published by the Genome in a Bottle (GIAB) consortium, enabling performance benchmarking of different variant calling pipelines. Based on the gold standard reference variant calls from GIAB, we compared the performance of thirteen variant calling pipelines, testing combinations of three read aligners—BWA-MEM, Bowtie2, and Novoalign—and four variant callers—Genome Analysis Tool Kit HaplotypeCaller (GATK-HC), Samtools mpileup, Freebayes and Ion Proton Variant Caller (TVC), for twelve data sets for the NA12878 genome sequenced by different platforms including Illumina2000, Illumina2500, and Ion Proton, with various exome capture systems and exome coverage. We observed different biases toward specific types of SNP genotyping errors by the different variant callers. The results of our study provide useful guidelines for reliable variant identification from deep sequencing of personal genomes. |
format | Online Article Text |
id | pubmed-4671096 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-46710962015-12-11 Systematic comparison of variant calling pipelines using gold standard personal exome variants Hwang, Sohyun Kim, Eiru Lee, Insuk Marcotte, Edward M. Sci Rep Article The success of clinical genomics using next generation sequencing (NGS) requires the accurate and consistent identification of personal genome variants. Assorted variant calling methods have been developed, which show low concordance between their calls. Hence, a systematic comparison of the variant callers could give important guidance to NGS-based clinical genomics. Recently, a set of high-confident variant calls for one individual (NA12878) has been published by the Genome in a Bottle (GIAB) consortium, enabling performance benchmarking of different variant calling pipelines. Based on the gold standard reference variant calls from GIAB, we compared the performance of thirteen variant calling pipelines, testing combinations of three read aligners—BWA-MEM, Bowtie2, and Novoalign—and four variant callers—Genome Analysis Tool Kit HaplotypeCaller (GATK-HC), Samtools mpileup, Freebayes and Ion Proton Variant Caller (TVC), for twelve data sets for the NA12878 genome sequenced by different platforms including Illumina2000, Illumina2500, and Ion Proton, with various exome capture systems and exome coverage. We observed different biases toward specific types of SNP genotyping errors by the different variant callers. The results of our study provide useful guidelines for reliable variant identification from deep sequencing of personal genomes. Nature Publishing Group 2015-12-07 /pmc/articles/PMC4671096/ /pubmed/26639839 http://dx.doi.org/10.1038/srep17875 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Hwang, Sohyun Kim, Eiru Lee, Insuk Marcotte, Edward M. Systematic comparison of variant calling pipelines using gold standard personal exome variants |
title | Systematic comparison of variant calling pipelines using gold standard personal exome variants |
title_full | Systematic comparison of variant calling pipelines using gold standard personal exome variants |
title_fullStr | Systematic comparison of variant calling pipelines using gold standard personal exome variants |
title_full_unstemmed | Systematic comparison of variant calling pipelines using gold standard personal exome variants |
title_short | Systematic comparison of variant calling pipelines using gold standard personal exome variants |
title_sort | systematic comparison of variant calling pipelines using gold standard personal exome variants |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4671096/ https://www.ncbi.nlm.nih.gov/pubmed/26639839 http://dx.doi.org/10.1038/srep17875 |
work_keys_str_mv | AT hwangsohyun systematiccomparisonofvariantcallingpipelinesusinggoldstandardpersonalexomevariants AT kimeiru systematiccomparisonofvariantcallingpipelinesusinggoldstandardpersonalexomevariants AT leeinsuk systematiccomparisonofvariantcallingpipelinesusinggoldstandardpersonalexomevariants AT marcotteedwardm systematiccomparisonofvariantcallingpipelinesusinggoldstandardpersonalexomevariants |