Cargando…

G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration

BACKGROUND: The brightness of the probe spots on expression microarrays intends to measure the abundance of specific mRNA targets. Probes with runs of at least three guanines (G) in their sequence show abnormal high intensities which reflect rather probe effects than target concentrations. This G-bi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fasold, Mario, Stadler, Peter F, Binder, Hans
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2010
Materias:	Research article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2884167/ https://www.ncbi.nlm.nih.gov/pubmed/20423484 http://dx.doi.org/10.1186/1471-2105-11-207

_version_	1782182311011287040
author	Fasold, Mario Stadler, Peter F Binder, Hans
author_facet	Fasold, Mario Stadler, Peter F Binder, Hans
author_sort	Fasold, Mario
collection	PubMed
description	BACKGROUND: The brightness of the probe spots on expression microarrays intends to measure the abundance of specific mRNA targets. Probes with runs of at least three guanines (G) in their sequence show abnormal high intensities which reflect rather probe effects than target concentrations. This G-bias requires correction prior to downstream expression analysis. RESULTS: Longer runs of three or more consecutive G along the probe sequence and in particular triple degenerated G at its solution end ((GGG)(1)-effect) are associated with exceptionally large probe intensities on GeneChip expression arrays. This intensity bias is related to non-specific hybridization and affects both perfect match and mismatch probes. The (GGG)(1)-effect tends to increase gradually for microarrays of later GeneChip generations. It was found for DNA/RNA as well as for DNA/DNA probe/target-hybridization chemistries. Amplification of sample RNA using T7-primers is associated with strong positive amplitudes of the G-bias whereas alternative amplification protocols using random primers give rise to much smaller and partly even negative amplitudes. We applied positional dependent sensitivity models to analyze the specifics of probe intensities in the context of all possible short sequence motifs of one to four adjacent nucleotides along the 25meric probe sequence. Most of the longer motifs are adequately described using a nearest-neighbor (NN) model. In contrast, runs of degenerated guanines require explicit consideration of next nearest neighbors (GGG terms). Preprocessing methods such as vsn, RMA, dChip, MAS5 and gcRMA only insufficiently remove the G-bias from data. CONCLUSIONS: Positional and motif dependent sensitivity models accounts for sequence effects of oligonucleotide probe intensities. We propose a positional dependent NN+GGG hybrid model to correct the intensity bias associated with probes containing poly-G motifs. It is implemented as a single-chip based calibration algorithm for GeneChips which can be applied in a pre-correction step prior to standard preprocessing.
format	Text
id	pubmed-2884167
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-28841672010-06-14 G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration Fasold, Mario Stadler, Peter F Binder, Hans BMC Bioinformatics Research article BACKGROUND: The brightness of the probe spots on expression microarrays intends to measure the abundance of specific mRNA targets. Probes with runs of at least three guanines (G) in their sequence show abnormal high intensities which reflect rather probe effects than target concentrations. This G-bias requires correction prior to downstream expression analysis. RESULTS: Longer runs of three or more consecutive G along the probe sequence and in particular triple degenerated G at its solution end ((GGG)(1)-effect) are associated with exceptionally large probe intensities on GeneChip expression arrays. This intensity bias is related to non-specific hybridization and affects both perfect match and mismatch probes. The (GGG)(1)-effect tends to increase gradually for microarrays of later GeneChip generations. It was found for DNA/RNA as well as for DNA/DNA probe/target-hybridization chemistries. Amplification of sample RNA using T7-primers is associated with strong positive amplitudes of the G-bias whereas alternative amplification protocols using random primers give rise to much smaller and partly even negative amplitudes. We applied positional dependent sensitivity models to analyze the specifics of probe intensities in the context of all possible short sequence motifs of one to four adjacent nucleotides along the 25meric probe sequence. Most of the longer motifs are adequately described using a nearest-neighbor (NN) model. In contrast, runs of degenerated guanines require explicit consideration of next nearest neighbors (GGG terms). Preprocessing methods such as vsn, RMA, dChip, MAS5 and gcRMA only insufficiently remove the G-bias from data. CONCLUSIONS: Positional and motif dependent sensitivity models accounts for sequence effects of oligonucleotide probe intensities. We propose a positional dependent NN+GGG hybrid model to correct the intensity bias associated with probes containing poly-G motifs. It is implemented as a single-chip based calibration algorithm for GeneChips which can be applied in a pre-correction step prior to standard preprocessing. BioMed Central 2010-04-27 /pmc/articles/PMC2884167/ /pubmed/20423484 http://dx.doi.org/10.1186/1471-2105-11-207 Text en Copyright ©2010 Fasold et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research article Fasold, Mario Stadler, Peter F Binder, Hans G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration
title	G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration
title_full	G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration
title_fullStr	G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration
title_full_unstemmed	G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration
title_short	G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration
title_sort	g-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration
topic	Research article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2884167/ https://www.ncbi.nlm.nih.gov/pubmed/20423484 http://dx.doi.org/10.1186/1471-2105-11-207
work_keys_str_mv	AT fasoldmario gstackmodulatedprobeintensitiesonexpressionarrayssequencecorrectionsandsignalcalibration AT stadlerpeterf gstackmodulatedprobeintensitiesonexpressionarrayssequencecorrectionsandsignalcalibration AT binderhans gstackmodulatedprobeintensitiesonexpressionarrayssequencecorrectionsandsignalcalibration

G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration

Ejemplares similares