Cargando…

Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays

BACKGROUND: Single nucleotide polymorphism (SNP) arrays are important tools widely used for genotyping and copy number estimation. This technology utilizes the specific affinity of fragmented DNA for binding to surface-attached oligonucleotide DNA probes. We analyze the variability of the probe sign...

Descripción completa

Detalles Bibliográficos
Autores principales: Binder, Hans, Fasold, Mario, Glomb, Torsten
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2775684/
https://www.ncbi.nlm.nih.gov/pubmed/19924253
http://dx.doi.org/10.1371/journal.pone.0007862
_version_ 1782174010062143488
author Binder, Hans
Fasold, Mario
Glomb, Torsten
author_facet Binder, Hans
Fasold, Mario
Glomb, Torsten
author_sort Binder, Hans
collection PubMed
description BACKGROUND: Single nucleotide polymorphism (SNP) arrays are important tools widely used for genotyping and copy number estimation. This technology utilizes the specific affinity of fragmented DNA for binding to surface-attached oligonucleotide DNA probes. We analyze the variability of the probe signals of Affymetrix GeneChip SNP arrays as a function of the probe sequence to identify relevant sequence motifs which potentially cause systematic biases of genotyping and copy number estimates. METHODOLOGY/PRINCIPAL FINDINGS: The probe design of GeneChip SNP arrays enables us to disentangle different sources of intensity modulations such as the number of mismatches per duplex, matched and mismatched base pairings including nearest and next-nearest neighbors and their position along the probe sequence. The effect of probe sequence was estimated in terms of triple-motifs with central matches and mismatches which include all 256 combinations of possible base pairings. The probe/target interactions on the chip can be decomposed into nearest neighbor contributions which correlate well with free energy terms of DNA/DNA-interactions in solution. The effect of mismatches is about twice as large as that of canonical pairings. Runs of guanines (G) and the particular type of mismatched pairings formed in cross-allelic probe/target duplexes constitute sources of systematic biases of the probe signals with consequences for genotyping and copy number estimates. The poly-G effect seems to be related to the crowded arrangement of probes which facilitates complex formation of neighboring probes with at minimum three adjacent G's in their sequence. CONCLUSIONS: The applied method of “triple-averaging” represents a model-free approach to estimate the mean intensity contributions of different sequence motifs which can be applied in calibration algorithms to correct signal values for sequence effects. Rules for appropriate sequence corrections are suggested.
format Text
id pubmed-2775684
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27756842009-11-19 Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays Binder, Hans Fasold, Mario Glomb, Torsten PLoS One Research Article BACKGROUND: Single nucleotide polymorphism (SNP) arrays are important tools widely used for genotyping and copy number estimation. This technology utilizes the specific affinity of fragmented DNA for binding to surface-attached oligonucleotide DNA probes. We analyze the variability of the probe signals of Affymetrix GeneChip SNP arrays as a function of the probe sequence to identify relevant sequence motifs which potentially cause systematic biases of genotyping and copy number estimates. METHODOLOGY/PRINCIPAL FINDINGS: The probe design of GeneChip SNP arrays enables us to disentangle different sources of intensity modulations such as the number of mismatches per duplex, matched and mismatched base pairings including nearest and next-nearest neighbors and their position along the probe sequence. The effect of probe sequence was estimated in terms of triple-motifs with central matches and mismatches which include all 256 combinations of possible base pairings. The probe/target interactions on the chip can be decomposed into nearest neighbor contributions which correlate well with free energy terms of DNA/DNA-interactions in solution. The effect of mismatches is about twice as large as that of canonical pairings. Runs of guanines (G) and the particular type of mismatched pairings formed in cross-allelic probe/target duplexes constitute sources of systematic biases of the probe signals with consequences for genotyping and copy number estimates. The poly-G effect seems to be related to the crowded arrangement of probes which facilitates complex formation of neighboring probes with at minimum three adjacent G's in their sequence. CONCLUSIONS: The applied method of “triple-averaging” represents a model-free approach to estimate the mean intensity contributions of different sequence motifs which can be applied in calibration algorithms to correct signal values for sequence effects. Rules for appropriate sequence corrections are suggested. Public Library of Science 2009-11-17 /pmc/articles/PMC2775684/ /pubmed/19924253 http://dx.doi.org/10.1371/journal.pone.0007862 Text en Binder et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Binder, Hans
Fasold, Mario
Glomb, Torsten
Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays
title Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays
title_full Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays
title_fullStr Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays
title_full_unstemmed Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays
title_short Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays
title_sort mismatch and g-stack modulated probe signals on snp microarrays
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2775684/
https://www.ncbi.nlm.nih.gov/pubmed/19924253
http://dx.doi.org/10.1371/journal.pone.0007862
work_keys_str_mv AT binderhans mismatchandgstackmodulatedprobesignalsonsnpmicroarrays
AT fasoldmario mismatchandgstackmodulatedprobesignalsonsnpmicroarrays
AT glombtorsten mismatchandgstackmodulatedprobesignalsonsnpmicroarrays