Cargando…

Hybridization and amplification rate correction for affymetrix SNP arrays

BACKGROUND: Copy number variation (CNV) is essential to understand the pathology of many complex diseases at the DNA level. Affymetrix SNP arrays, which are widely used for CNV studies, significantly depend on accurate copy number (CN) estimation. Nevertheless, CN estimation may be biased by several...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Quan, Peng, Peichao, Qian, Minping, Wan, Lin, Deng, Minghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3428662/
https://www.ncbi.nlm.nih.gov/pubmed/22691279
http://dx.doi.org/10.1186/1755-8794-5-24
_version_ 1782241722995048448
author Wang, Quan
Peng, Peichao
Qian, Minping
Wan, Lin
Deng, Minghua
author_facet Wang, Quan
Peng, Peichao
Qian, Minping
Wan, Lin
Deng, Minghua
author_sort Wang, Quan
collection PubMed
description BACKGROUND: Copy number variation (CNV) is essential to understand the pathology of many complex diseases at the DNA level. Affymetrix SNP arrays, which are widely used for CNV studies, significantly depend on accurate copy number (CN) estimation. Nevertheless, CN estimation may be biased by several factors, including cross-hybridization and training sample batch, as well as genomic waves of intensities induced by sequence-dependent hybridization rate and amplification efficiency. Since many available algorithms only address one or two of the three factors, a high false discovery rate (FDR) often results when identifying CNV. Therefore, we have developed a new CNV detection pipeline which is based on hybridization and amplification rate correction (CNVhac). METHODS: CNVhac first estimates the allelic concentrations (ACs) of target sequences by using the sample independent parameters trained through physicochemical hybridization law. Then the raw CN is estimated by taking the ratio of AC to the corresponding average AC from a reference sample set for one specific site. Finally, a hidden Markov model (HMM) segmentation process is implemented to detect CNV regions. RESULTS: Based on public HapMap data, the results show that CNVhac effectively smoothes the genomic waves and facilitates more accurate raw CN estimates compared to other methods. Moreover, CNVhac alleviates, to a certain extent, the sample dependence of inference and makes CNV calling with appreciable low FDRs. CONCLUSION: CNVhac is an effective approach to address the common difficulties in SNP array analysis, and the working principles of CNVhac can be easily extended to other platforms.
format Online
Article
Text
id pubmed-3428662
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34286622012-08-30 Hybridization and amplification rate correction for affymetrix SNP arrays Wang, Quan Peng, Peichao Qian, Minping Wan, Lin Deng, Minghua BMC Med Genomics Research Article BACKGROUND: Copy number variation (CNV) is essential to understand the pathology of many complex diseases at the DNA level. Affymetrix SNP arrays, which are widely used for CNV studies, significantly depend on accurate copy number (CN) estimation. Nevertheless, CN estimation may be biased by several factors, including cross-hybridization and training sample batch, as well as genomic waves of intensities induced by sequence-dependent hybridization rate and amplification efficiency. Since many available algorithms only address one or two of the three factors, a high false discovery rate (FDR) often results when identifying CNV. Therefore, we have developed a new CNV detection pipeline which is based on hybridization and amplification rate correction (CNVhac). METHODS: CNVhac first estimates the allelic concentrations (ACs) of target sequences by using the sample independent parameters trained through physicochemical hybridization law. Then the raw CN is estimated by taking the ratio of AC to the corresponding average AC from a reference sample set for one specific site. Finally, a hidden Markov model (HMM) segmentation process is implemented to detect CNV regions. RESULTS: Based on public HapMap data, the results show that CNVhac effectively smoothes the genomic waves and facilitates more accurate raw CN estimates compared to other methods. Moreover, CNVhac alleviates, to a certain extent, the sample dependence of inference and makes CNV calling with appreciable low FDRs. CONCLUSION: CNVhac is an effective approach to address the common difficulties in SNP array analysis, and the working principles of CNVhac can be easily extended to other platforms. BioMed Central 2012-06-12 /pmc/articles/PMC3428662/ /pubmed/22691279 http://dx.doi.org/10.1186/1755-8794-5-24 Text en Copyright ©2012 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wang, Quan
Peng, Peichao
Qian, Minping
Wan, Lin
Deng, Minghua
Hybridization and amplification rate correction for affymetrix SNP arrays
title Hybridization and amplification rate correction for affymetrix SNP arrays
title_full Hybridization and amplification rate correction for affymetrix SNP arrays
title_fullStr Hybridization and amplification rate correction for affymetrix SNP arrays
title_full_unstemmed Hybridization and amplification rate correction for affymetrix SNP arrays
title_short Hybridization and amplification rate correction for affymetrix SNP arrays
title_sort hybridization and amplification rate correction for affymetrix snp arrays
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3428662/
https://www.ncbi.nlm.nih.gov/pubmed/22691279
http://dx.doi.org/10.1186/1755-8794-5-24
work_keys_str_mv AT wangquan hybridizationandamplificationratecorrectionforaffymetrixsnparrays
AT pengpeichao hybridizationandamplificationratecorrectionforaffymetrixsnparrays
AT qianminping hybridizationandamplificationratecorrectionforaffymetrixsnparrays
AT wanlin hybridizationandamplificationratecorrectionforaffymetrixsnparrays
AT dengminghua hybridizationandamplificationratecorrectionforaffymetrixsnparrays