Cargando…

The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data

Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WE...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferrarini, Alberto, Xumerle, Luciano, Griggio, Francesca, Garonzi, Marianna, Cantaloni, Chiara, Centomo, Cesare, Vargas, Sergio Marin, Descombes, Patrick, Marquis, Julien, Collino, Sebastiano, Franceschi, Claudio, Garagnani, Paolo, Salisbury, Benjamin A., Harvey, John Max, Delledonne, Massimo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4492948/
https://www.ncbi.nlm.nih.gov/pubmed/26147798
http://dx.doi.org/10.1371/journal.pone.0132180
_version_ 1782379830679961600
author Ferrarini, Alberto
Xumerle, Luciano
Griggio, Francesca
Garonzi, Marianna
Cantaloni, Chiara
Centomo, Cesare
Vargas, Sergio Marin
Descombes, Patrick
Marquis, Julien
Collino, Sebastiano
Franceschi, Claudio
Garagnani, Paolo
Salisbury, Benjamin A.
Harvey, John Max
Delledonne, Massimo
author_facet Ferrarini, Alberto
Xumerle, Luciano
Griggio, Francesca
Garonzi, Marianna
Cantaloni, Chiara
Centomo, Cesare
Vargas, Sergio Marin
Descombes, Patrick
Marquis, Julien
Collino, Sebastiano
Franceschi, Claudio
Garagnani, Paolo
Salisbury, Benjamin A.
Harvey, John Max
Delledonne, Massimo
author_sort Ferrarini, Alberto
collection PubMed
description Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WES) and whole genome sequencing (WGS) viable strategies for targeted or complete genetic analysis, respectively. Standard WGS/WES data analytical workflows generally rely on calling of sequence variants respect to the reference genome sequence. However, the reference genome sequence contains a large number of sites represented by rare alleles, by known pathogenic alleles and by alleles strongly associated to disease by GWAS. It’s thus critical, for clinical applications of WGS and WES, to interpret whether non-variant sites are homozygous for the reference allele or if the corresponding genotype cannot be reliably called. Here we show that an alternative analytical approach based on the analysis of both variant and non-variant sites from WGS data allows to genotype more than 92% of sites corresponding to known SNPs compared to 6% genotyped by standard variant analysis. These include homozygous reference sites of clinical interest, thus leading to a broad and comprehensive characterization of variation necessary to an accurate evaluation of disease risk. Altogether, our findings indicate that characterization of both variant and non-variant clinically informative sites in the genome is necessary to allow an accurate clinical assessment of a personal genome. Finally, we propose a highly efficient extended VCF (eVCF) file format which allows to store genotype calls for sites of clinical interest while remaining compatible with current variant interpretation software.
format Online
Article
Text
id pubmed-4492948
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44929482015-07-15 The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data Ferrarini, Alberto Xumerle, Luciano Griggio, Francesca Garonzi, Marianna Cantaloni, Chiara Centomo, Cesare Vargas, Sergio Marin Descombes, Patrick Marquis, Julien Collino, Sebastiano Franceschi, Claudio Garagnani, Paolo Salisbury, Benjamin A. Harvey, John Max Delledonne, Massimo PLoS One Research Article Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WES) and whole genome sequencing (WGS) viable strategies for targeted or complete genetic analysis, respectively. Standard WGS/WES data analytical workflows generally rely on calling of sequence variants respect to the reference genome sequence. However, the reference genome sequence contains a large number of sites represented by rare alleles, by known pathogenic alleles and by alleles strongly associated to disease by GWAS. It’s thus critical, for clinical applications of WGS and WES, to interpret whether non-variant sites are homozygous for the reference allele or if the corresponding genotype cannot be reliably called. Here we show that an alternative analytical approach based on the analysis of both variant and non-variant sites from WGS data allows to genotype more than 92% of sites corresponding to known SNPs compared to 6% genotyped by standard variant analysis. These include homozygous reference sites of clinical interest, thus leading to a broad and comprehensive characterization of variation necessary to an accurate evaluation of disease risk. Altogether, our findings indicate that characterization of both variant and non-variant clinically informative sites in the genome is necessary to allow an accurate clinical assessment of a personal genome. Finally, we propose a highly efficient extended VCF (eVCF) file format which allows to store genotype calls for sites of clinical interest while remaining compatible with current variant interpretation software. Public Library of Science 2015-07-06 /pmc/articles/PMC4492948/ /pubmed/26147798 http://dx.doi.org/10.1371/journal.pone.0132180 Text en © 2015 Ferrarini et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ferrarini, Alberto
Xumerle, Luciano
Griggio, Francesca
Garonzi, Marianna
Cantaloni, Chiara
Centomo, Cesare
Vargas, Sergio Marin
Descombes, Patrick
Marquis, Julien
Collino, Sebastiano
Franceschi, Claudio
Garagnani, Paolo
Salisbury, Benjamin A.
Harvey, John Max
Delledonne, Massimo
The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data
title The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data
title_full The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data
title_fullStr The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data
title_full_unstemmed The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data
title_short The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data
title_sort use of non-variant sites to improve the clinical assessment of whole-genome sequence data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4492948/
https://www.ncbi.nlm.nih.gov/pubmed/26147798
http://dx.doi.org/10.1371/journal.pone.0132180
work_keys_str_mv AT ferrarinialberto theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT xumerleluciano theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT griggiofrancesca theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT garonzimarianna theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT cantalonichiara theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT centomocesare theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT vargassergiomarin theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT descombespatrick theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT marquisjulien theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT collinosebastiano theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT franceschiclaudio theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT garagnanipaolo theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT salisburybenjamina theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT harveyjohnmax theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT delledonnemassimo theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT ferrarinialberto useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT xumerleluciano useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT griggiofrancesca useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT garonzimarianna useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT cantalonichiara useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT centomocesare useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT vargassergiomarin useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT descombespatrick useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT marquisjulien useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT collinosebastiano useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT franceschiclaudio useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT garagnanipaolo useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT salisburybenjamina useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT harveyjohnmax useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata
AT delledonnemassimo useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata