Cargando…

Copy number variation detection and genotyping from exome sequence data

While exome sequencing is readily amenable to single-nucleotide variant discovery, the sparse and nonuniform nature of the exome capture reaction has hindered exome-based detection and characterization of genic copy number variation. We developed a novel method using singular value decomposition (SV...

Descripción completa

Detalles Bibliográficos
Autores principales: Krumm, Niklas, Sudmant, Peter H., Ko, Arthur, O'Roak, Brian J., Malig, Maika, Coe, Bradley P., Quinlan, Aaron R., Nickerson, Deborah A., Eichler, Evan E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3409265/
https://www.ncbi.nlm.nih.gov/pubmed/22585873
http://dx.doi.org/10.1101/gr.138115.112
_version_ 1782239571914784768
author Krumm, Niklas
Sudmant, Peter H.
Ko, Arthur
O'Roak, Brian J.
Malig, Maika
Coe, Bradley P.
Quinlan, Aaron R.
Nickerson, Deborah A.
Eichler, Evan E.
author_facet Krumm, Niklas
Sudmant, Peter H.
Ko, Arthur
O'Roak, Brian J.
Malig, Maika
Coe, Bradley P.
Quinlan, Aaron R.
Nickerson, Deborah A.
Eichler, Evan E.
author_sort Krumm, Niklas
collection PubMed
description While exome sequencing is readily amenable to single-nucleotide variant discovery, the sparse and nonuniform nature of the exome capture reaction has hindered exome-based detection and characterization of genic copy number variation. We developed a novel method using singular value decomposition (SVD) normalization to discover rare genic copy number variants (CNVs) as well as genotype copy number polymorphic (CNP) loci with high sensitivity and specificity from exome sequencing data. We estimate the precision of our algorithm using 122 trios (366 exomes) and show that this method can be used to reliably predict (94% overall precision) both de novo and inherited rare CNVs involving three or more consecutive exons. We demonstrate that exome-based genotyping of CNPs strongly correlates with whole-genome data (median r(2) = 0.91), especially for loci with fewer than eight copies, and can estimate the absolute copy number of multi-allelic genes with high accuracy (78% call level). The resulting user-friendly computational pipeline, CoNIFER (copy number inference from exome reads), can reliably be used to discover disruptive genic CNVs missed by standard approaches and should have broad application in human genetic studies of disease.
format Online
Article
Text
id pubmed-3409265
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-34092652013-02-01 Copy number variation detection and genotyping from exome sequence data Krumm, Niklas Sudmant, Peter H. Ko, Arthur O'Roak, Brian J. Malig, Maika Coe, Bradley P. Quinlan, Aaron R. Nickerson, Deborah A. Eichler, Evan E. Genome Res Method While exome sequencing is readily amenable to single-nucleotide variant discovery, the sparse and nonuniform nature of the exome capture reaction has hindered exome-based detection and characterization of genic copy number variation. We developed a novel method using singular value decomposition (SVD) normalization to discover rare genic copy number variants (CNVs) as well as genotype copy number polymorphic (CNP) loci with high sensitivity and specificity from exome sequencing data. We estimate the precision of our algorithm using 122 trios (366 exomes) and show that this method can be used to reliably predict (94% overall precision) both de novo and inherited rare CNVs involving three or more consecutive exons. We demonstrate that exome-based genotyping of CNPs strongly correlates with whole-genome data (median r(2) = 0.91), especially for loci with fewer than eight copies, and can estimate the absolute copy number of multi-allelic genes with high accuracy (78% call level). The resulting user-friendly computational pipeline, CoNIFER (copy number inference from exome reads), can reliably be used to discover disruptive genic CNVs missed by standard approaches and should have broad application in human genetic studies of disease. Cold Spring Harbor Laboratory Press 2012-08 /pmc/articles/PMC3409265/ /pubmed/22585873 http://dx.doi.org/10.1101/gr.138115.112 Text en © 2012, Published by Cold Spring Harbor Laboratory Press This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/.
spellingShingle Method
Krumm, Niklas
Sudmant, Peter H.
Ko, Arthur
O'Roak, Brian J.
Malig, Maika
Coe, Bradley P.
Quinlan, Aaron R.
Nickerson, Deborah A.
Eichler, Evan E.
Copy number variation detection and genotyping from exome sequence data
title Copy number variation detection and genotyping from exome sequence data
title_full Copy number variation detection and genotyping from exome sequence data
title_fullStr Copy number variation detection and genotyping from exome sequence data
title_full_unstemmed Copy number variation detection and genotyping from exome sequence data
title_short Copy number variation detection and genotyping from exome sequence data
title_sort copy number variation detection and genotyping from exome sequence data
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3409265/
https://www.ncbi.nlm.nih.gov/pubmed/22585873
http://dx.doi.org/10.1101/gr.138115.112
work_keys_str_mv AT krummniklas copynumbervariationdetectionandgenotypingfromexomesequencedata
AT sudmantpeterh copynumbervariationdetectionandgenotypingfromexomesequencedata
AT koarthur copynumbervariationdetectionandgenotypingfromexomesequencedata
AT oroakbrianj copynumbervariationdetectionandgenotypingfromexomesequencedata
AT maligmaika copynumbervariationdetectionandgenotypingfromexomesequencedata
AT coebradleyp copynumbervariationdetectionandgenotypingfromexomesequencedata
AT copynumbervariationdetectionandgenotypingfromexomesequencedata
AT quinlanaaronr copynumbervariationdetectionandgenotypingfromexomesequencedata
AT nickersondeboraha copynumbervariationdetectionandgenotypingfromexomesequencedata
AT eichlerevane copynumbervariationdetectionandgenotypingfromexomesequencedata