Cargando…
Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation
Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2761258/ https://www.ncbi.nlm.nih.gov/pubmed/19586935 http://dx.doi.org/10.1093/nar/gkp559 |
_version_ | 1782172815499198464 |
---|---|
author | Wan, Lin Sun, Kelian Ding, Qi Cui, Yuehua Li, Ming Wen, Yalu Elston, Robert C. Qian, Minping Fu, Wenjiang J |
author_facet | Wan, Lin Sun, Kelian Ding, Qi Cui, Yuehua Li, Ming Wen, Yalu Elston, Robert C. Qian, Minping Fu, Wenjiang J |
author_sort | Wan, Lin |
collection | PubMed |
description | Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-sets and has a critical effect on the estimation of allelic concentrations (ACs). We study sequence binding through binding free energy and then binding affinity, and develop a probe intensity composite representation (PICR) model. The PICR model allows the estimation of ACs at a given SNP through statistical regression. Furthermore, we demonstrate with cell-line data of known true copy numbers that the PICR model can achieve reasonable accuracy in copy number estimation at a single SNP locus, by using the ratio of the estimated AC of each sample to that of the reference sample, and can reveal subtle genotype structure of SNPs at abnormal loci. We also demonstrate with HapMap data that the PICR model yields accurate SNP genotype calls consistently across samples, laboratories and even across array platforms. |
format | Text |
id | pubmed-2761258 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-27612582009-10-14 Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation Wan, Lin Sun, Kelian Ding, Qi Cui, Yuehua Li, Ming Wen, Yalu Elston, Robert C. Qian, Minping Fu, Wenjiang J Nucleic Acids Res Methods Online Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-sets and has a critical effect on the estimation of allelic concentrations (ACs). We study sequence binding through binding free energy and then binding affinity, and develop a probe intensity composite representation (PICR) model. The PICR model allows the estimation of ACs at a given SNP through statistical regression. Furthermore, we demonstrate with cell-line data of known true copy numbers that the PICR model can achieve reasonable accuracy in copy number estimation at a single SNP locus, by using the ratio of the estimated AC of each sample to that of the reference sample, and can reveal subtle genotype structure of SNPs at abnormal loci. We also demonstrate with HapMap data that the PICR model yields accurate SNP genotype calls consistently across samples, laboratories and even across array platforms. Oxford University Press 2009-09 2009-07-07 /pmc/articles/PMC2761258/ /pubmed/19586935 http://dx.doi.org/10.1093/nar/gkp559 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Wan, Lin Sun, Kelian Ding, Qi Cui, Yuehua Li, Ming Wen, Yalu Elston, Robert C. Qian, Minping Fu, Wenjiang J Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation |
title | Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation |
title_full | Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation |
title_fullStr | Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation |
title_full_unstemmed | Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation |
title_short | Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation |
title_sort | hybridization modeling of oligonucleotide snp arrays for accurate dna copy number estimation |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2761258/ https://www.ncbi.nlm.nih.gov/pubmed/19586935 http://dx.doi.org/10.1093/nar/gkp559 |
work_keys_str_mv | AT wanlin hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation AT sunkelian hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation AT dingqi hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation AT cuiyuehua hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation AT liming hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation AT wenyalu hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation AT elstonrobertc hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation AT qianminping hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation AT fuwenjiangj hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation |