Cargando…

CoVaCS: a consensus variant calling system

BACKGROUND: The advent and ongoing development of next generation sequencing technologies (NGS) has led to a rapid increase in the rate of human genome re-sequencing data, paving the way for personalized genomics and precision medicine. The body of genome resequencing data is progressively increasin...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiara, Matteo, Gioiosa, Silvia, Chillemi, Giovanni, D’Antonio, Mattia, Flati, Tiziano, Picardi, Ernesto, Zambelli, Federico, Horner, David Stephen, Pesole, Graziano, Castrignanò, Tiziana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5800023/
https://www.ncbi.nlm.nih.gov/pubmed/29402227
http://dx.doi.org/10.1186/s12864-018-4508-1
_version_ 1783298121925656576
author Chiara, Matteo
Gioiosa, Silvia
Chillemi, Giovanni
D’Antonio, Mattia
Flati, Tiziano
Picardi, Ernesto
Zambelli, Federico
Horner, David Stephen
Pesole, Graziano
Castrignanò, Tiziana
author_facet Chiara, Matteo
Gioiosa, Silvia
Chillemi, Giovanni
D’Antonio, Mattia
Flati, Tiziano
Picardi, Ernesto
Zambelli, Federico
Horner, David Stephen
Pesole, Graziano
Castrignanò, Tiziana
author_sort Chiara, Matteo
collection PubMed
description BACKGROUND: The advent and ongoing development of next generation sequencing technologies (NGS) has led to a rapid increase in the rate of human genome re-sequencing data, paving the way for personalized genomics and precision medicine. The body of genome resequencing data is progressively increasing underlining the need for accurate and time-effective bioinformatics systems for genotyping - a crucial prerequisite for identification of candidate causal mutations in diagnostic screens. RESULTS: Here we present CoVaCS, a fully automated, highly accurate system with a web based graphical interface for genotyping and variant annotation. Extensive tests on a gold standard benchmark data-set -the NA12878 Illumina platinum genome- confirm that call-sets based on our consensus strategy are completely in line with those attained by similar command line based approaches, and far more accurate than call-sets from any individual tool. Importantly our system exhibits better sensitivity and higher specificity than equivalent commercial software. CONCLUSIONS: CoVaCS offers optimized pipelines integrating state of the art tools for variant calling and annotation for whole genome sequencing (WGS), whole-exome sequencing (WES) and target-gene sequencing (TGS) data. The system is currently hosted at Cineca, and offers the speed of a HPC computing facility, a crucial consideration when large numbers of samples must be analysed. Importantly, all the analyses are performed automatically allowing high reproducibility of the results. As such, we believe that CoVaCS can be a valuable tool for the analysis of human genome resequencing studies. CoVaCS is available at: https://bioinformatics.cineca.it/covacs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4508-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5800023
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58000232018-02-13 CoVaCS: a consensus variant calling system Chiara, Matteo Gioiosa, Silvia Chillemi, Giovanni D’Antonio, Mattia Flati, Tiziano Picardi, Ernesto Zambelli, Federico Horner, David Stephen Pesole, Graziano Castrignanò, Tiziana BMC Genomics Software BACKGROUND: The advent and ongoing development of next generation sequencing technologies (NGS) has led to a rapid increase in the rate of human genome re-sequencing data, paving the way for personalized genomics and precision medicine. The body of genome resequencing data is progressively increasing underlining the need for accurate and time-effective bioinformatics systems for genotyping - a crucial prerequisite for identification of candidate causal mutations in diagnostic screens. RESULTS: Here we present CoVaCS, a fully automated, highly accurate system with a web based graphical interface for genotyping and variant annotation. Extensive tests on a gold standard benchmark data-set -the NA12878 Illumina platinum genome- confirm that call-sets based on our consensus strategy are completely in line with those attained by similar command line based approaches, and far more accurate than call-sets from any individual tool. Importantly our system exhibits better sensitivity and higher specificity than equivalent commercial software. CONCLUSIONS: CoVaCS offers optimized pipelines integrating state of the art tools for variant calling and annotation for whole genome sequencing (WGS), whole-exome sequencing (WES) and target-gene sequencing (TGS) data. The system is currently hosted at Cineca, and offers the speed of a HPC computing facility, a crucial consideration when large numbers of samples must be analysed. Importantly, all the analyses are performed automatically allowing high reproducibility of the results. As such, we believe that CoVaCS can be a valuable tool for the analysis of human genome resequencing studies. CoVaCS is available at: https://bioinformatics.cineca.it/covacs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4508-1) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-05 /pmc/articles/PMC5800023/ /pubmed/29402227 http://dx.doi.org/10.1186/s12864-018-4508-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Chiara, Matteo
Gioiosa, Silvia
Chillemi, Giovanni
D’Antonio, Mattia
Flati, Tiziano
Picardi, Ernesto
Zambelli, Federico
Horner, David Stephen
Pesole, Graziano
Castrignanò, Tiziana
CoVaCS: a consensus variant calling system
title CoVaCS: a consensus variant calling system
title_full CoVaCS: a consensus variant calling system
title_fullStr CoVaCS: a consensus variant calling system
title_full_unstemmed CoVaCS: a consensus variant calling system
title_short CoVaCS: a consensus variant calling system
title_sort covacs: a consensus variant calling system
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5800023/
https://www.ncbi.nlm.nih.gov/pubmed/29402227
http://dx.doi.org/10.1186/s12864-018-4508-1
work_keys_str_mv AT chiaramatteo covacsaconsensusvariantcallingsystem
AT gioiosasilvia covacsaconsensusvariantcallingsystem
AT chillemigiovanni covacsaconsensusvariantcallingsystem
AT dantoniomattia covacsaconsensusvariantcallingsystem
AT flatitiziano covacsaconsensusvariantcallingsystem
AT picardiernesto covacsaconsensusvariantcallingsystem
AT zambellifederico covacsaconsensusvariantcallingsystem
AT hornerdavidstephen covacsaconsensusvariantcallingsystem
AT pesolegraziano covacsaconsensusvariantcallingsystem
AT castrignanotiziana covacsaconsensusvariantcallingsystem