Cargando…

Mulcom: a multiple comparison statistical test for microarray data in Bioconductor

BACKGROUND: Many microarray experiments search for genes with differential expression between a common "reference" group and multiple "test" groups. In such cases currently employed statistical approaches based on t-tests or close derivatives have limited efficacy, mainly because...

Descripción completa

Detalles Bibliográficos
Autores principales: Isella, Claudio, Renzulli, Tommaso, Corà, Davide, Medico, Enzo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3230912/
https://www.ncbi.nlm.nih.gov/pubmed/21955789
http://dx.doi.org/10.1186/1471-2105-12-382
_version_ 1782218105476349952
author Isella, Claudio
Renzulli, Tommaso
Corà, Davide
Medico, Enzo
author_facet Isella, Claudio
Renzulli, Tommaso
Corà, Davide
Medico, Enzo
author_sort Isella, Claudio
collection PubMed
description BACKGROUND: Many microarray experiments search for genes with differential expression between a common "reference" group and multiple "test" groups. In such cases currently employed statistical approaches based on t-tests or close derivatives have limited efficacy, mainly because estimation of the standard error is done on only two groups at a time. Alternative approaches based on ANOVA correctly capture within-group variance from all the groups, but then do not confront single test groups with the reference. Ideally, a t-test better suited for this type of data would compare each test group with the reference, but use within-group variance calculated from all the groups. RESULTS: We implemented an R-Bioconductor package named Mulcom, with a statistical test derived from the Dunnett's t-test, designed to compare multiple test groups individually against a common reference. Interestingly, the Dunnett's test uses for the denominator of each comparison a within-group standard error aggregated from all the experimental groups. In addition to the basic Dunnett's t value, the package includes an optional minimal fold-change threshold, m. Due to the automated, permutation-based estimation of False Discovery Rate (FDR), the package also permits fast optimization of the test, to obtain the maximum number of significant genes at a given FDR value. When applied to a time-course experiment profiled in parallel on two microarray platforms, and compared with two commonly used tests, Mulcom displayed better concordance of significant genes in the two array platforms (39% vs. 26% or 15%), and higher enrichment in functional annotation to categories related to the biology of the experiment (p value < 0.001 in 4 categories vs. 3). CONCLUSIONS: The Mulcom package provides a powerful tool for the identification of differentially expressed genes when several experimental conditions are compared against a common reference. The results of the practical example presented here show that lists of differentially expressed genes generated by Mulcom are particularly consistent across microarray platforms and enriched in genes belonging to functionally significant groups.
format Online
Article
Text
id pubmed-3230912
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32309122011-12-07 Mulcom: a multiple comparison statistical test for microarray data in Bioconductor Isella, Claudio Renzulli, Tommaso Corà, Davide Medico, Enzo BMC Bioinformatics Software BACKGROUND: Many microarray experiments search for genes with differential expression between a common "reference" group and multiple "test" groups. In such cases currently employed statistical approaches based on t-tests or close derivatives have limited efficacy, mainly because estimation of the standard error is done on only two groups at a time. Alternative approaches based on ANOVA correctly capture within-group variance from all the groups, but then do not confront single test groups with the reference. Ideally, a t-test better suited for this type of data would compare each test group with the reference, but use within-group variance calculated from all the groups. RESULTS: We implemented an R-Bioconductor package named Mulcom, with a statistical test derived from the Dunnett's t-test, designed to compare multiple test groups individually against a common reference. Interestingly, the Dunnett's test uses for the denominator of each comparison a within-group standard error aggregated from all the experimental groups. In addition to the basic Dunnett's t value, the package includes an optional minimal fold-change threshold, m. Due to the automated, permutation-based estimation of False Discovery Rate (FDR), the package also permits fast optimization of the test, to obtain the maximum number of significant genes at a given FDR value. When applied to a time-course experiment profiled in parallel on two microarray platforms, and compared with two commonly used tests, Mulcom displayed better concordance of significant genes in the two array platforms (39% vs. 26% or 15%), and higher enrichment in functional annotation to categories related to the biology of the experiment (p value < 0.001 in 4 categories vs. 3). CONCLUSIONS: The Mulcom package provides a powerful tool for the identification of differentially expressed genes when several experimental conditions are compared against a common reference. The results of the practical example presented here show that lists of differentially expressed genes generated by Mulcom are particularly consistent across microarray platforms and enriched in genes belonging to functionally significant groups. BioMed Central 2011-09-28 /pmc/articles/PMC3230912/ /pubmed/21955789 http://dx.doi.org/10.1186/1471-2105-12-382 Text en Copyright © 2011 Isella et al; licensee BioMed Central Ltd. https://creativecommons.org/licenses/by/2.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Isella, Claudio
Renzulli, Tommaso
Corà, Davide
Medico, Enzo
Mulcom: a multiple comparison statistical test for microarray data in Bioconductor
title Mulcom: a multiple comparison statistical test for microarray data in Bioconductor
title_full Mulcom: a multiple comparison statistical test for microarray data in Bioconductor
title_fullStr Mulcom: a multiple comparison statistical test for microarray data in Bioconductor
title_full_unstemmed Mulcom: a multiple comparison statistical test for microarray data in Bioconductor
title_short Mulcom: a multiple comparison statistical test for microarray data in Bioconductor
title_sort mulcom: a multiple comparison statistical test for microarray data in bioconductor
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3230912/
https://www.ncbi.nlm.nih.gov/pubmed/21955789
http://dx.doi.org/10.1186/1471-2105-12-382
work_keys_str_mv AT isellaclaudio mulcomamultiplecomparisonstatisticaltestformicroarraydatainbioconductor
AT renzullitommaso mulcomamultiplecomparisonstatisticaltestformicroarraydatainbioconductor
AT coradavide mulcomamultiplecomparisonstatisticaltestformicroarraydatainbioconductor
AT medicoenzo mulcomamultiplecomparisonstatisticaltestformicroarraydatainbioconductor