Cargando…
The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data
Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WE...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4492948/ https://www.ncbi.nlm.nih.gov/pubmed/26147798 http://dx.doi.org/10.1371/journal.pone.0132180 |
_version_ | 1782379830679961600 |
---|---|
author | Ferrarini, Alberto Xumerle, Luciano Griggio, Francesca Garonzi, Marianna Cantaloni, Chiara Centomo, Cesare Vargas, Sergio Marin Descombes, Patrick Marquis, Julien Collino, Sebastiano Franceschi, Claudio Garagnani, Paolo Salisbury, Benjamin A. Harvey, John Max Delledonne, Massimo |
author_facet | Ferrarini, Alberto Xumerle, Luciano Griggio, Francesca Garonzi, Marianna Cantaloni, Chiara Centomo, Cesare Vargas, Sergio Marin Descombes, Patrick Marquis, Julien Collino, Sebastiano Franceschi, Claudio Garagnani, Paolo Salisbury, Benjamin A. Harvey, John Max Delledonne, Massimo |
author_sort | Ferrarini, Alberto |
collection | PubMed |
description | Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WES) and whole genome sequencing (WGS) viable strategies for targeted or complete genetic analysis, respectively. Standard WGS/WES data analytical workflows generally rely on calling of sequence variants respect to the reference genome sequence. However, the reference genome sequence contains a large number of sites represented by rare alleles, by known pathogenic alleles and by alleles strongly associated to disease by GWAS. It’s thus critical, for clinical applications of WGS and WES, to interpret whether non-variant sites are homozygous for the reference allele or if the corresponding genotype cannot be reliably called. Here we show that an alternative analytical approach based on the analysis of both variant and non-variant sites from WGS data allows to genotype more than 92% of sites corresponding to known SNPs compared to 6% genotyped by standard variant analysis. These include homozygous reference sites of clinical interest, thus leading to a broad and comprehensive characterization of variation necessary to an accurate evaluation of disease risk. Altogether, our findings indicate that characterization of both variant and non-variant clinically informative sites in the genome is necessary to allow an accurate clinical assessment of a personal genome. Finally, we propose a highly efficient extended VCF (eVCF) file format which allows to store genotype calls for sites of clinical interest while remaining compatible with current variant interpretation software. |
format | Online Article Text |
id | pubmed-4492948 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-44929482015-07-15 The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data Ferrarini, Alberto Xumerle, Luciano Griggio, Francesca Garonzi, Marianna Cantaloni, Chiara Centomo, Cesare Vargas, Sergio Marin Descombes, Patrick Marquis, Julien Collino, Sebastiano Franceschi, Claudio Garagnani, Paolo Salisbury, Benjamin A. Harvey, John Max Delledonne, Massimo PLoS One Research Article Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WES) and whole genome sequencing (WGS) viable strategies for targeted or complete genetic analysis, respectively. Standard WGS/WES data analytical workflows generally rely on calling of sequence variants respect to the reference genome sequence. However, the reference genome sequence contains a large number of sites represented by rare alleles, by known pathogenic alleles and by alleles strongly associated to disease by GWAS. It’s thus critical, for clinical applications of WGS and WES, to interpret whether non-variant sites are homozygous for the reference allele or if the corresponding genotype cannot be reliably called. Here we show that an alternative analytical approach based on the analysis of both variant and non-variant sites from WGS data allows to genotype more than 92% of sites corresponding to known SNPs compared to 6% genotyped by standard variant analysis. These include homozygous reference sites of clinical interest, thus leading to a broad and comprehensive characterization of variation necessary to an accurate evaluation of disease risk. Altogether, our findings indicate that characterization of both variant and non-variant clinically informative sites in the genome is necessary to allow an accurate clinical assessment of a personal genome. Finally, we propose a highly efficient extended VCF (eVCF) file format which allows to store genotype calls for sites of clinical interest while remaining compatible with current variant interpretation software. Public Library of Science 2015-07-06 /pmc/articles/PMC4492948/ /pubmed/26147798 http://dx.doi.org/10.1371/journal.pone.0132180 Text en © 2015 Ferrarini et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Ferrarini, Alberto Xumerle, Luciano Griggio, Francesca Garonzi, Marianna Cantaloni, Chiara Centomo, Cesare Vargas, Sergio Marin Descombes, Patrick Marquis, Julien Collino, Sebastiano Franceschi, Claudio Garagnani, Paolo Salisbury, Benjamin A. Harvey, John Max Delledonne, Massimo The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data |
title | The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data |
title_full | The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data |
title_fullStr | The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data |
title_full_unstemmed | The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data |
title_short | The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data |
title_sort | use of non-variant sites to improve the clinical assessment of whole-genome sequence data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4492948/ https://www.ncbi.nlm.nih.gov/pubmed/26147798 http://dx.doi.org/10.1371/journal.pone.0132180 |
work_keys_str_mv | AT ferrarinialberto theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT xumerleluciano theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT griggiofrancesca theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT garonzimarianna theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT cantalonichiara theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT centomocesare theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT vargassergiomarin theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT descombespatrick theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT marquisjulien theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT collinosebastiano theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT franceschiclaudio theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT garagnanipaolo theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT salisburybenjamina theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT harveyjohnmax theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT delledonnemassimo theuseofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT ferrarinialberto useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT xumerleluciano useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT griggiofrancesca useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT garonzimarianna useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT cantalonichiara useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT centomocesare useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT vargassergiomarin useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT descombespatrick useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT marquisjulien useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT collinosebastiano useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT franceschiclaudio useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT garagnanipaolo useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT salisburybenjamina useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT harveyjohnmax useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata AT delledonnemassimo useofnonvariantsitestoimprovetheclinicalassessmentofwholegenomesequencedata |