Cargando…

Empirical Bayes models for multiple probe type microarrays at the probe level

BACKGROUND: When analyzing microarray data a primary objective is often to find differentially expressed genes. With empirical Bayes and penalized t-tests the sample variances are adjusted towards a global estimate, producing more stable results compared to ordinary t-tests. However, for Affymetrix...

Descripción completa

Detalles Bibliográficos
Autores principales: Åstrand, Magnus, Mostad, Petter, Rudemo, Mats
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2358895/
https://www.ncbi.nlm.nih.gov/pubmed/18366694
http://dx.doi.org/10.1186/1471-2105-9-156
_version_ 1782152864921026560
author Åstrand, Magnus
Mostad, Petter
Rudemo, Mats
author_facet Åstrand, Magnus
Mostad, Petter
Rudemo, Mats
author_sort Åstrand, Magnus
collection PubMed
description BACKGROUND: When analyzing microarray data a primary objective is often to find differentially expressed genes. With empirical Bayes and penalized t-tests the sample variances are adjusted towards a global estimate, producing more stable results compared to ordinary t-tests. However, for Affymetrix type data a clear dependency between variability and intensity-level generally exists, even for logged intensities, most clearly for data at the probe level but also for probe-set summarizes such as the MAS5 expression index. As a consequence, adjustment towards a global estimate results in an intensity-level dependent false positive rate. RESULTS: We propose two new methods for finding differentially expressed genes, Probe level Locally moderated Weighted median-t (PLW) and Locally Moderated Weighted-t (LMW). Both methods use an empirical Bayes model taking the dependency between variability and intensity-level into account. A global covariance matrix is also used allowing for differing variances between arrays as well as array-to-array correlations. PLW is specially designed for Affymetrix type arrays (or other multiple-probe arrays). Instead of making inference on probe-set summaries, comparisons are made separately for each perfect-match probe and are then summarized into one score for the probe-set. CONCLUSION: The proposed methods are compared to 14 existing methods using five spike-in data sets. For RMA and GCRMA processed data, PLW has the most accurate ranking of regulated genes in four out of the five data sets, and LMW consistently performs better than all examined moderated t-tests when used on RMA, GCRMA, and MAS5 expression indexes.
format Text
id pubmed-2358895
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23588952008-04-29 Empirical Bayes models for multiple probe type microarrays at the probe level Åstrand, Magnus Mostad, Petter Rudemo, Mats BMC Bioinformatics Methodology Article BACKGROUND: When analyzing microarray data a primary objective is often to find differentially expressed genes. With empirical Bayes and penalized t-tests the sample variances are adjusted towards a global estimate, producing more stable results compared to ordinary t-tests. However, for Affymetrix type data a clear dependency between variability and intensity-level generally exists, even for logged intensities, most clearly for data at the probe level but also for probe-set summarizes such as the MAS5 expression index. As a consequence, adjustment towards a global estimate results in an intensity-level dependent false positive rate. RESULTS: We propose two new methods for finding differentially expressed genes, Probe level Locally moderated Weighted median-t (PLW) and Locally Moderated Weighted-t (LMW). Both methods use an empirical Bayes model taking the dependency between variability and intensity-level into account. A global covariance matrix is also used allowing for differing variances between arrays as well as array-to-array correlations. PLW is specially designed for Affymetrix type arrays (or other multiple-probe arrays). Instead of making inference on probe-set summaries, comparisons are made separately for each perfect-match probe and are then summarized into one score for the probe-set. CONCLUSION: The proposed methods are compared to 14 existing methods using five spike-in data sets. For RMA and GCRMA processed data, PLW has the most accurate ranking of regulated genes in four out of the five data sets, and LMW consistently performs better than all examined moderated t-tests when used on RMA, GCRMA, and MAS5 expression indexes. BioMed Central 2008-03-20 /pmc/articles/PMC2358895/ /pubmed/18366694 http://dx.doi.org/10.1186/1471-2105-9-156 Text en Copyright © 2008 Åstrand et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Åstrand, Magnus
Mostad, Petter
Rudemo, Mats
Empirical Bayes models for multiple probe type microarrays at the probe level
title Empirical Bayes models for multiple probe type microarrays at the probe level
title_full Empirical Bayes models for multiple probe type microarrays at the probe level
title_fullStr Empirical Bayes models for multiple probe type microarrays at the probe level
title_full_unstemmed Empirical Bayes models for multiple probe type microarrays at the probe level
title_short Empirical Bayes models for multiple probe type microarrays at the probe level
title_sort empirical bayes models for multiple probe type microarrays at the probe level
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2358895/
https://www.ncbi.nlm.nih.gov/pubmed/18366694
http://dx.doi.org/10.1186/1471-2105-9-156
work_keys_str_mv AT astrandmagnus empiricalbayesmodelsformultipleprobetypemicroarraysattheprobelevel
AT mostadpetter empiricalbayesmodelsformultipleprobetypemicroarraysattheprobelevel
AT rudemomats empiricalbayesmodelsformultipleprobetypemicroarraysattheprobelevel