Cargando…

Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation

Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to...

Descripción completa

Detalles Bibliográficos
Autores principales: Wan, Lin, Sun, Kelian, Ding, Qi, Cui, Yuehua, Li, Ming, Wen, Yalu, Elston, Robert C., Qian, Minping, Fu, Wenjiang J
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2761258/
https://www.ncbi.nlm.nih.gov/pubmed/19586935
http://dx.doi.org/10.1093/nar/gkp559
_version_ 1782172815499198464
author Wan, Lin
Sun, Kelian
Ding, Qi
Cui, Yuehua
Li, Ming
Wen, Yalu
Elston, Robert C.
Qian, Minping
Fu, Wenjiang J
author_facet Wan, Lin
Sun, Kelian
Ding, Qi
Cui, Yuehua
Li, Ming
Wen, Yalu
Elston, Robert C.
Qian, Minping
Fu, Wenjiang J
author_sort Wan, Lin
collection PubMed
description Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-sets and has a critical effect on the estimation of allelic concentrations (ACs). We study sequence binding through binding free energy and then binding affinity, and develop a probe intensity composite representation (PICR) model. The PICR model allows the estimation of ACs at a given SNP through statistical regression. Furthermore, we demonstrate with cell-line data of known true copy numbers that the PICR model can achieve reasonable accuracy in copy number estimation at a single SNP locus, by using the ratio of the estimated AC of each sample to that of the reference sample, and can reveal subtle genotype structure of SNPs at abnormal loci. We also demonstrate with HapMap data that the PICR model yields accurate SNP genotype calls consistently across samples, laboratories and even across array platforms.
format Text
id pubmed-2761258
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27612582009-10-14 Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation Wan, Lin Sun, Kelian Ding, Qi Cui, Yuehua Li, Ming Wen, Yalu Elston, Robert C. Qian, Minping Fu, Wenjiang J Nucleic Acids Res Methods Online Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-sets and has a critical effect on the estimation of allelic concentrations (ACs). We study sequence binding through binding free energy and then binding affinity, and develop a probe intensity composite representation (PICR) model. The PICR model allows the estimation of ACs at a given SNP through statistical regression. Furthermore, we demonstrate with cell-line data of known true copy numbers that the PICR model can achieve reasonable accuracy in copy number estimation at a single SNP locus, by using the ratio of the estimated AC of each sample to that of the reference sample, and can reveal subtle genotype structure of SNPs at abnormal loci. We also demonstrate with HapMap data that the PICR model yields accurate SNP genotype calls consistently across samples, laboratories and even across array platforms. Oxford University Press 2009-09 2009-07-07 /pmc/articles/PMC2761258/ /pubmed/19586935 http://dx.doi.org/10.1093/nar/gkp559 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Wan, Lin
Sun, Kelian
Ding, Qi
Cui, Yuehua
Li, Ming
Wen, Yalu
Elston, Robert C.
Qian, Minping
Fu, Wenjiang J
Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation
title Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation
title_full Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation
title_fullStr Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation
title_full_unstemmed Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation
title_short Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation
title_sort hybridization modeling of oligonucleotide snp arrays for accurate dna copy number estimation
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2761258/
https://www.ncbi.nlm.nih.gov/pubmed/19586935
http://dx.doi.org/10.1093/nar/gkp559
work_keys_str_mv AT wanlin hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation
AT sunkelian hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation
AT dingqi hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation
AT cuiyuehua hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation
AT liming hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation
AT wenyalu hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation
AT elstonrobertc hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation
AT qianminping hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation
AT fuwenjiangj hybridizationmodelingofoligonucleotidesnparraysforaccuratednacopynumberestimation