Cargando…

Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays

Determination of copy number variants (CNVs) inferred in genome wide single nucleotide polymorphism arrays has shown increasing utility in genetic variant disease associations. Several CNV detection methods are available, but differences in CNV call thresholds and characteristics exist. We evaluated...

Descripción completa

Detalles Bibliográficos
Autores principales: Dellinger, Andrew E., Saw, Seang-Mei, Goh, Liang K., Seielstad, Mark, Young, Terri L., Li, Yi-Ju
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2875020/
https://www.ncbi.nlm.nih.gov/pubmed/20142258
http://dx.doi.org/10.1093/nar/gkq040
_version_ 1782181537775616000
author Dellinger, Andrew E.
Saw, Seang-Mei
Goh, Liang K.
Seielstad, Mark
Young, Terri L.
Li, Yi-Ju
author_facet Dellinger, Andrew E.
Saw, Seang-Mei
Goh, Liang K.
Seielstad, Mark
Young, Terri L.
Li, Yi-Ju
author_sort Dellinger, Andrew E.
collection PubMed
description Determination of copy number variants (CNVs) inferred in genome wide single nucleotide polymorphism arrays has shown increasing utility in genetic variant disease associations. Several CNV detection methods are available, but differences in CNV call thresholds and characteristics exist. We evaluated the relative performance of seven methods: circular binary segmentation, CNVFinder, cnvPartition, gain and loss of DNA, Nexus algorithms, PennCNV and QuantiSNP. Tested data included real and simulated Illumina HumHap 550 data from the Singapore cohort study of the risk factors for Myopia (SCORM) and simulated data from Affymetrix 6.0 and platform-independent distributions. The normalized singleton ratio (NSR) is proposed as a metric for parameter optimization before enacting full analysis. We used 10 SCORM samples for optimizing parameter settings for each method and then evaluated method performance at optimal parameters using 100 SCORM samples. The statistical power, false positive rates, and receiver operating characteristic (ROC) curve residuals were evaluated by simulation studies. Optimal parameters, as determined by NSR and ROC curve residuals, were consistent across datasets. QuantiSNP outperformed other methods based on ROC curve residuals over most datasets. Nexus Rank and SNPRank have low specificity and high power. Nexus Rank calls oversized CNVs. PennCNV detects one of the fewest numbers of CNVs.
format Text
id pubmed-2875020
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28750202010-08-11 Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays Dellinger, Andrew E. Saw, Seang-Mei Goh, Liang K. Seielstad, Mark Young, Terri L. Li, Yi-Ju Nucleic Acids Res Methods Online Determination of copy number variants (CNVs) inferred in genome wide single nucleotide polymorphism arrays has shown increasing utility in genetic variant disease associations. Several CNV detection methods are available, but differences in CNV call thresholds and characteristics exist. We evaluated the relative performance of seven methods: circular binary segmentation, CNVFinder, cnvPartition, gain and loss of DNA, Nexus algorithms, PennCNV and QuantiSNP. Tested data included real and simulated Illumina HumHap 550 data from the Singapore cohort study of the risk factors for Myopia (SCORM) and simulated data from Affymetrix 6.0 and platform-independent distributions. The normalized singleton ratio (NSR) is proposed as a metric for parameter optimization before enacting full analysis. We used 10 SCORM samples for optimizing parameter settings for each method and then evaluated method performance at optimal parameters using 100 SCORM samples. The statistical power, false positive rates, and receiver operating characteristic (ROC) curve residuals were evaluated by simulation studies. Optimal parameters, as determined by NSR and ROC curve residuals, were consistent across datasets. QuantiSNP outperformed other methods based on ROC curve residuals over most datasets. Nexus Rank and SNPRank have low specificity and high power. Nexus Rank calls oversized CNVs. PennCNV detects one of the fewest numbers of CNVs. Oxford University Press 2010-05 2010-02-08 /pmc/articles/PMC2875020/ /pubmed/20142258 http://dx.doi.org/10.1093/nar/gkq040 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Dellinger, Andrew E.
Saw, Seang-Mei
Goh, Liang K.
Seielstad, Mark
Young, Terri L.
Li, Yi-Ju
Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays
title Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays
title_full Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays
title_fullStr Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays
title_full_unstemmed Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays
title_short Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays
title_sort comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2875020/
https://www.ncbi.nlm.nih.gov/pubmed/20142258
http://dx.doi.org/10.1093/nar/gkq040
work_keys_str_mv AT dellingerandrewe comparativeanalysesofsevenalgorithmsforcopynumbervariantidentificationfromsinglenucleotidepolymorphismarrays
AT sawseangmei comparativeanalysesofsevenalgorithmsforcopynumbervariantidentificationfromsinglenucleotidepolymorphismarrays
AT gohliangk comparativeanalysesofsevenalgorithmsforcopynumbervariantidentificationfromsinglenucleotidepolymorphismarrays
AT seielstadmark comparativeanalysesofsevenalgorithmsforcopynumbervariantidentificationfromsinglenucleotidepolymorphismarrays
AT youngterril comparativeanalysesofsevenalgorithmsforcopynumbervariantidentificationfromsinglenucleotidepolymorphismarrays
AT liyiju comparativeanalysesofsevenalgorithmsforcopynumbervariantidentificationfromsinglenucleotidepolymorphismarrays