Cargando…

Accuracy of CNV Detection from GWAS Data

Several computer programs are available for detecting copy number variants (CNVs) using genome-wide SNP arrays. We evaluated the performance of four CNV detection software suites—Birdsuite, Partek, HelixTree, and PennCNV-Affy—in the identification of both rare and common CNVs. Each program's pe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Dandan, Qian, Yudong, Akula, Nirmala, Alliey-Rodriguez, Ney, Tang, Jinsong, Gershon, Elliot S., Liu, Chunyu
Formato:	Texto
Lenguaje:	English
Publicado:	Public Library of Science 2011
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3020939/ https://www.ncbi.nlm.nih.gov/pubmed/21249187 http://dx.doi.org/10.1371/journal.pone.0014511

_version_	1782196335015886848
author	Zhang, Dandan Qian, Yudong Akula, Nirmala Alliey-Rodriguez, Ney Tang, Jinsong Gershon, Elliot S. Liu, Chunyu
author_facet	Zhang, Dandan Qian, Yudong Akula, Nirmala Alliey-Rodriguez, Ney Tang, Jinsong Gershon, Elliot S. Liu, Chunyu
author_sort	Zhang, Dandan
collection	PubMed
description	Several computer programs are available for detecting copy number variants (CNVs) using genome-wide SNP arrays. We evaluated the performance of four CNV detection software suites—Birdsuite, Partek, HelixTree, and PennCNV-Affy—in the identification of both rare and common CNVs. Each program's performance was assessed in two ways. The first was its recovery rate, i.e., its ability to call 893 CNVs previously identified in eight HapMap samples by paired-end sequencing of whole-genome fosmid clones, and 51,440 CNVs identified by array Comparative Genome Hybridization (aCGH) followed by validation procedures, in 90 HapMap CEU samples. The second evaluation was program performance calling rare and common CNVs in the Bipolar Genome Study (BiGS) data set (1001 bipolar cases and 1033 controls, all of European ancestry) as measured by the Affymetrix SNP 6.0 array. Accuracy in calling rare CNVs was assessed by positive predictive value, based on the proportion of rare CNVs validated by quantitative real-time PCR (qPCR), while accuracy in calling common CNVs was assessed by false positive/false negative rates based on qPCR validation results from a subset of common CNVs. Birdsuite recovered the highest percentages of known HapMap CNVs containing >20 markers in two reference CNV datasets. The recovery rate increased with decreased CNV frequency. In the tested rare CNV data, Birdsuite and Partek had higher positive predictive values than the other software suites. In a test of three common CNVs in the BiGS dataset, Birdsuite's call was 98.8% consistent with qPCR quantification in one CNV region, but the other two regions showed an unacceptable degree of accuracy. We found relatively poor consistency between the two “gold standards,” the sequence data of Kidd et al., and aCGH data of Conrad et al. Algorithms for calling CNVs especially common ones need substantial improvement, and a “gold standard” for detection of CNVs remains to be established.
format	Text
id	pubmed-3020939
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-30209392011-01-19 Accuracy of CNV Detection from GWAS Data Zhang, Dandan Qian, Yudong Akula, Nirmala Alliey-Rodriguez, Ney Tang, Jinsong Gershon, Elliot S. Liu, Chunyu PLoS One Research Article Several computer programs are available for detecting copy number variants (CNVs) using genome-wide SNP arrays. We evaluated the performance of four CNV detection software suites—Birdsuite, Partek, HelixTree, and PennCNV-Affy—in the identification of both rare and common CNVs. Each program's performance was assessed in two ways. The first was its recovery rate, i.e., its ability to call 893 CNVs previously identified in eight HapMap samples by paired-end sequencing of whole-genome fosmid clones, and 51,440 CNVs identified by array Comparative Genome Hybridization (aCGH) followed by validation procedures, in 90 HapMap CEU samples. The second evaluation was program performance calling rare and common CNVs in the Bipolar Genome Study (BiGS) data set (1001 bipolar cases and 1033 controls, all of European ancestry) as measured by the Affymetrix SNP 6.0 array. Accuracy in calling rare CNVs was assessed by positive predictive value, based on the proportion of rare CNVs validated by quantitative real-time PCR (qPCR), while accuracy in calling common CNVs was assessed by false positive/false negative rates based on qPCR validation results from a subset of common CNVs. Birdsuite recovered the highest percentages of known HapMap CNVs containing >20 markers in two reference CNV datasets. The recovery rate increased with decreased CNV frequency. In the tested rare CNV data, Birdsuite and Partek had higher positive predictive values than the other software suites. In a test of three common CNVs in the BiGS dataset, Birdsuite's call was 98.8% consistent with qPCR quantification in one CNV region, but the other two regions showed an unacceptable degree of accuracy. We found relatively poor consistency between the two “gold standards,” the sequence data of Kidd et al., and aCGH data of Conrad et al. Algorithms for calling CNVs especially common ones need substantial improvement, and a “gold standard” for detection of CNVs remains to be established. Public Library of Science 2011-01-13 /pmc/articles/PMC3020939/ /pubmed/21249187 http://dx.doi.org/10.1371/journal.pone.0014511 Text en Zhang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Zhang, Dandan Qian, Yudong Akula, Nirmala Alliey-Rodriguez, Ney Tang, Jinsong Gershon, Elliot S. Liu, Chunyu Accuracy of CNV Detection from GWAS Data
title	Accuracy of CNV Detection from GWAS Data
title_full	Accuracy of CNV Detection from GWAS Data
title_fullStr	Accuracy of CNV Detection from GWAS Data
title_full_unstemmed	Accuracy of CNV Detection from GWAS Data
title_short	Accuracy of CNV Detection from GWAS Data
title_sort	accuracy of cnv detection from gwas data
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3020939/ https://www.ncbi.nlm.nih.gov/pubmed/21249187 http://dx.doi.org/10.1371/journal.pone.0014511
work_keys_str_mv	AT zhangdandan accuracyofcnvdetectionfromgwasdata AT qianyudong accuracyofcnvdetectionfromgwasdata AT akulanirmala accuracyofcnvdetectionfromgwasdata AT allieyrodriguezney accuracyofcnvdetectionfromgwasdata AT tangjinsong accuracyofcnvdetectionfromgwasdata AT accuracyofcnvdetectionfromgwasdata AT gershonelliots accuracyofcnvdetectionfromgwasdata AT liuchunyu accuracyofcnvdetectionfromgwasdata

Accuracy of CNV Detection from GWAS Data

Ejemplares similares