Cargando…
Copy number variation detection and genotyping from exome sequence data
While exome sequencing is readily amenable to single-nucleotide variant discovery, the sparse and nonuniform nature of the exome capture reaction has hindered exome-based detection and characterization of genic copy number variation. We developed a novel method using singular value decomposition (SV...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3409265/ https://www.ncbi.nlm.nih.gov/pubmed/22585873 http://dx.doi.org/10.1101/gr.138115.112 |
_version_ | 1782239571914784768 |
---|---|
author | Krumm, Niklas Sudmant, Peter H. Ko, Arthur O'Roak, Brian J. Malig, Maika Coe, Bradley P. Quinlan, Aaron R. Nickerson, Deborah A. Eichler, Evan E. |
author_facet | Krumm, Niklas Sudmant, Peter H. Ko, Arthur O'Roak, Brian J. Malig, Maika Coe, Bradley P. Quinlan, Aaron R. Nickerson, Deborah A. Eichler, Evan E. |
author_sort | Krumm, Niklas |
collection | PubMed |
description | While exome sequencing is readily amenable to single-nucleotide variant discovery, the sparse and nonuniform nature of the exome capture reaction has hindered exome-based detection and characterization of genic copy number variation. We developed a novel method using singular value decomposition (SVD) normalization to discover rare genic copy number variants (CNVs) as well as genotype copy number polymorphic (CNP) loci with high sensitivity and specificity from exome sequencing data. We estimate the precision of our algorithm using 122 trios (366 exomes) and show that this method can be used to reliably predict (94% overall precision) both de novo and inherited rare CNVs involving three or more consecutive exons. We demonstrate that exome-based genotyping of CNPs strongly correlates with whole-genome data (median r(2) = 0.91), especially for loci with fewer than eight copies, and can estimate the absolute copy number of multi-allelic genes with high accuracy (78% call level). The resulting user-friendly computational pipeline, CoNIFER (copy number inference from exome reads), can reliably be used to discover disruptive genic CNVs missed by standard approaches and should have broad application in human genetic studies of disease. |
format | Online Article Text |
id | pubmed-3409265 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34092652013-02-01 Copy number variation detection and genotyping from exome sequence data Krumm, Niklas Sudmant, Peter H. Ko, Arthur O'Roak, Brian J. Malig, Maika Coe, Bradley P. Quinlan, Aaron R. Nickerson, Deborah A. Eichler, Evan E. Genome Res Method While exome sequencing is readily amenable to single-nucleotide variant discovery, the sparse and nonuniform nature of the exome capture reaction has hindered exome-based detection and characterization of genic copy number variation. We developed a novel method using singular value decomposition (SVD) normalization to discover rare genic copy number variants (CNVs) as well as genotype copy number polymorphic (CNP) loci with high sensitivity and specificity from exome sequencing data. We estimate the precision of our algorithm using 122 trios (366 exomes) and show that this method can be used to reliably predict (94% overall precision) both de novo and inherited rare CNVs involving three or more consecutive exons. We demonstrate that exome-based genotyping of CNPs strongly correlates with whole-genome data (median r(2) = 0.91), especially for loci with fewer than eight copies, and can estimate the absolute copy number of multi-allelic genes with high accuracy (78% call level). The resulting user-friendly computational pipeline, CoNIFER (copy number inference from exome reads), can reliably be used to discover disruptive genic CNVs missed by standard approaches and should have broad application in human genetic studies of disease. Cold Spring Harbor Laboratory Press 2012-08 /pmc/articles/PMC3409265/ /pubmed/22585873 http://dx.doi.org/10.1101/gr.138115.112 Text en © 2012, Published by Cold Spring Harbor Laboratory Press This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/. |
spellingShingle | Method Krumm, Niklas Sudmant, Peter H. Ko, Arthur O'Roak, Brian J. Malig, Maika Coe, Bradley P. Quinlan, Aaron R. Nickerson, Deborah A. Eichler, Evan E. Copy number variation detection and genotyping from exome sequence data |
title | Copy number variation detection and genotyping from exome sequence data |
title_full | Copy number variation detection and genotyping from exome sequence data |
title_fullStr | Copy number variation detection and genotyping from exome sequence data |
title_full_unstemmed | Copy number variation detection and genotyping from exome sequence data |
title_short | Copy number variation detection and genotyping from exome sequence data |
title_sort | copy number variation detection and genotyping from exome sequence data |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3409265/ https://www.ncbi.nlm.nih.gov/pubmed/22585873 http://dx.doi.org/10.1101/gr.138115.112 |
work_keys_str_mv | AT krummniklas copynumbervariationdetectionandgenotypingfromexomesequencedata AT sudmantpeterh copynumbervariationdetectionandgenotypingfromexomesequencedata AT koarthur copynumbervariationdetectionandgenotypingfromexomesequencedata AT oroakbrianj copynumbervariationdetectionandgenotypingfromexomesequencedata AT maligmaika copynumbervariationdetectionandgenotypingfromexomesequencedata AT coebradleyp copynumbervariationdetectionandgenotypingfromexomesequencedata AT copynumbervariationdetectionandgenotypingfromexomesequencedata AT quinlanaaronr copynumbervariationdetectionandgenotypingfromexomesequencedata AT nickersondeboraha copynumbervariationdetectionandgenotypingfromexomesequencedata AT eichlerevane copynumbervariationdetectionandgenotypingfromexomesequencedata |