Cargando…

Probe set algorithms: is there a rational best bet?

Affymetrix microarrays have become a standard experimental platform for studies of mRNA expression profiling. Their success is due, in part, to the multiple oligonucleotide features (probes) against each transcript (probe set). This multiple testing allows for more robust background assessments and...

Descripción completa

Detalles Bibliográficos
Autores principales: Seo, Jinwook, Hoffman, Eric P
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1569879/
https://www.ncbi.nlm.nih.gov/pubmed/16942624
http://dx.doi.org/10.1186/1471-2105-7-395
_version_ 1782130228304281600
author Seo, Jinwook
Hoffman, Eric P
author_facet Seo, Jinwook
Hoffman, Eric P
author_sort Seo, Jinwook
collection PubMed
description Affymetrix microarrays have become a standard experimental platform for studies of mRNA expression profiling. Their success is due, in part, to the multiple oligonucleotide features (probes) against each transcript (probe set). This multiple testing allows for more robust background assessments and gene expression measures, and has permitted the development of many computational methods to translate image data into a single normalized "signal" for mRNA transcript abundance. There are now many probe set algorithms that have been developed, with a gradual movement away from chip-by-chip methods (MAS5), to project-based model-fitting methods (dCHIP, RMA, others). Data interpretation is often profoundly changed by choice of algorithm, with disoriented biologists questioning what the "accurate" interpretation of their experiment is. Here, we summarize the debate concerning probe set algorithms. We provide examples of how changes in mismatch weight, normalizations, and construction of expression ratios each dramatically change data interpretation. All interpretations can be considered as computationally appropriate, but with varying biological credibility. We also illustrate the performance of two new hybrid algorithms (PLIER, GC-RMA) relative to more traditional algorithms (dCHIP, MAS5, Probe Profiler PCA, RMA) using an interactive power analysis tool. PLIER appears superior to other algorithms in avoiding false positives with poorly performing probe sets. Based on our interpretation of the literature, and examples presented here, we suggest that the variability in performance of probe set algorithms is more dependent upon assumptions regarding "background", than on calculations of "signal". We argue that "background" is an enormously complex variable that can only be vaguely quantified, and thus the "best" probe set algorithm will vary from project to project.
format Text
id pubmed-1569879
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15698792006-09-16 Probe set algorithms: is there a rational best bet? Seo, Jinwook Hoffman, Eric P BMC Bioinformatics Commentary Affymetrix microarrays have become a standard experimental platform for studies of mRNA expression profiling. Their success is due, in part, to the multiple oligonucleotide features (probes) against each transcript (probe set). This multiple testing allows for more robust background assessments and gene expression measures, and has permitted the development of many computational methods to translate image data into a single normalized "signal" for mRNA transcript abundance. There are now many probe set algorithms that have been developed, with a gradual movement away from chip-by-chip methods (MAS5), to project-based model-fitting methods (dCHIP, RMA, others). Data interpretation is often profoundly changed by choice of algorithm, with disoriented biologists questioning what the "accurate" interpretation of their experiment is. Here, we summarize the debate concerning probe set algorithms. We provide examples of how changes in mismatch weight, normalizations, and construction of expression ratios each dramatically change data interpretation. All interpretations can be considered as computationally appropriate, but with varying biological credibility. We also illustrate the performance of two new hybrid algorithms (PLIER, GC-RMA) relative to more traditional algorithms (dCHIP, MAS5, Probe Profiler PCA, RMA) using an interactive power analysis tool. PLIER appears superior to other algorithms in avoiding false positives with poorly performing probe sets. Based on our interpretation of the literature, and examples presented here, we suggest that the variability in performance of probe set algorithms is more dependent upon assumptions regarding "background", than on calculations of "signal". We argue that "background" is an enormously complex variable that can only be vaguely quantified, and thus the "best" probe set algorithm will vary from project to project. BioMed Central 2006-08-30 /pmc/articles/PMC1569879/ /pubmed/16942624 http://dx.doi.org/10.1186/1471-2105-7-395 Text en Copyright © 2006 Seo and Hoffman; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Commentary
Seo, Jinwook
Hoffman, Eric P
Probe set algorithms: is there a rational best bet?
title Probe set algorithms: is there a rational best bet?
title_full Probe set algorithms: is there a rational best bet?
title_fullStr Probe set algorithms: is there a rational best bet?
title_full_unstemmed Probe set algorithms: is there a rational best bet?
title_short Probe set algorithms: is there a rational best bet?
title_sort probe set algorithms: is there a rational best bet?
topic Commentary
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1569879/
https://www.ncbi.nlm.nih.gov/pubmed/16942624
http://dx.doi.org/10.1186/1471-2105-7-395
work_keys_str_mv AT seojinwook probesetalgorithmsistherearationalbestbet
AT hoffmanericp probesetalgorithmsistherearationalbestbet