Cargando…

Functional Analysis: Evaluation of Response Intensities - Tailoring ANOVA for Lists of Expression Subsets

BACKGROUND: Microarray data is frequently used to characterize the expression profile of a whole genome and to compare the characteristics of that genome under several conditions. Geneset analysis methods have been described previously to analyze the expression values of several genes related by kno...

Descripción completa

Detalles Bibliográficos
Autores principales: Berger, Fabrice, De Meulder, Bertrand, Gaigneaux, Anthoula, Depiereux, Sophie, Bareke, Eric, Pierre, Michael, De Hertogh, Benoît, Delorenzi, Mauro, Depiereux, Eric
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2964684/
https://www.ncbi.nlm.nih.gov/pubmed/20942918
http://dx.doi.org/10.1186/1471-2105-11-510
_version_ 1782189411322036224
author Berger, Fabrice
De Meulder, Bertrand
Gaigneaux, Anthoula
Depiereux, Sophie
Bareke, Eric
Pierre, Michael
De Hertogh, Benoît
Delorenzi, Mauro
Depiereux, Eric
author_facet Berger, Fabrice
De Meulder, Bertrand
Gaigneaux, Anthoula
Depiereux, Sophie
Bareke, Eric
Pierre, Michael
De Hertogh, Benoît
Delorenzi, Mauro
Depiereux, Eric
author_sort Berger, Fabrice
collection PubMed
description BACKGROUND: Microarray data is frequently used to characterize the expression profile of a whole genome and to compare the characteristics of that genome under several conditions. Geneset analysis methods have been described previously to analyze the expression values of several genes related by known biological criteria (metabolic pathway, pathology signature, co-regulation by a common factor, etc.) at the same time and the cost of these methods allows for the use of more values to help discover the underlying biological mechanisms. RESULTS: As several methods assume different null hypotheses, we propose to reformulate the main question that biologists seek to answer. To determine which genesets are associated with expression values that differ between two experiments, we focused on three ad hoc criteria: expression levels, the direction of individual gene expression changes (up or down regulation), and correlations between genes. We introduce the FAERI methodology, tailored from a two-way ANOVA to examine these criteria. The significance of the results was evaluated according to the self-contained null hypothesis, using label sampling or by inferring the null distribution from normally distributed random data. Evaluations performed on simulated data revealed that FAERI outperforms currently available methods for each type of set tested. We then applied the FAERI method to analyze three real-world datasets on hypoxia response. FAERI was able to detect more genesets than other methodologies, and the genesets selected were coherent with current knowledge of cellular response to hypoxia. Moreover, the genesets selected by FAERI were confirmed when the analysis was repeated on two additional related datasets. CONCLUSIONS: The expression values of genesets are associated with several biological effects. The underlying mathematical structure of the genesets allows for analysis of data from several genes at the same time. Focusing on expression levels, the direction of the expression changes, and correlations, we showed that two-step data reduction allowed us to significantly improve the performance of geneset analysis using a modified two-way ANOVA procedure, and to detect genesets that current methods fail to detect.
format Text
id pubmed-2964684
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29646842010-10-29 Functional Analysis: Evaluation of Response Intensities - Tailoring ANOVA for Lists of Expression Subsets Berger, Fabrice De Meulder, Bertrand Gaigneaux, Anthoula Depiereux, Sophie Bareke, Eric Pierre, Michael De Hertogh, Benoît Delorenzi, Mauro Depiereux, Eric BMC Bioinformatics Research Article BACKGROUND: Microarray data is frequently used to characterize the expression profile of a whole genome and to compare the characteristics of that genome under several conditions. Geneset analysis methods have been described previously to analyze the expression values of several genes related by known biological criteria (metabolic pathway, pathology signature, co-regulation by a common factor, etc.) at the same time and the cost of these methods allows for the use of more values to help discover the underlying biological mechanisms. RESULTS: As several methods assume different null hypotheses, we propose to reformulate the main question that biologists seek to answer. To determine which genesets are associated with expression values that differ between two experiments, we focused on three ad hoc criteria: expression levels, the direction of individual gene expression changes (up or down regulation), and correlations between genes. We introduce the FAERI methodology, tailored from a two-way ANOVA to examine these criteria. The significance of the results was evaluated according to the self-contained null hypothesis, using label sampling or by inferring the null distribution from normally distributed random data. Evaluations performed on simulated data revealed that FAERI outperforms currently available methods for each type of set tested. We then applied the FAERI method to analyze three real-world datasets on hypoxia response. FAERI was able to detect more genesets than other methodologies, and the genesets selected were coherent with current knowledge of cellular response to hypoxia. Moreover, the genesets selected by FAERI were confirmed when the analysis was repeated on two additional related datasets. CONCLUSIONS: The expression values of genesets are associated with several biological effects. The underlying mathematical structure of the genesets allows for analysis of data from several genes at the same time. Focusing on expression levels, the direction of the expression changes, and correlations, we showed that two-step data reduction allowed us to significantly improve the performance of geneset analysis using a modified two-way ANOVA procedure, and to detect genesets that current methods fail to detect. BioMed Central 2010-10-13 /pmc/articles/PMC2964684/ /pubmed/20942918 http://dx.doi.org/10.1186/1471-2105-11-510 Text en Copyright ©2010 Berger et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Berger, Fabrice
De Meulder, Bertrand
Gaigneaux, Anthoula
Depiereux, Sophie
Bareke, Eric
Pierre, Michael
De Hertogh, Benoît
Delorenzi, Mauro
Depiereux, Eric
Functional Analysis: Evaluation of Response Intensities - Tailoring ANOVA for Lists of Expression Subsets
title Functional Analysis: Evaluation of Response Intensities - Tailoring ANOVA for Lists of Expression Subsets
title_full Functional Analysis: Evaluation of Response Intensities - Tailoring ANOVA for Lists of Expression Subsets
title_fullStr Functional Analysis: Evaluation of Response Intensities - Tailoring ANOVA for Lists of Expression Subsets
title_full_unstemmed Functional Analysis: Evaluation of Response Intensities - Tailoring ANOVA for Lists of Expression Subsets
title_short Functional Analysis: Evaluation of Response Intensities - Tailoring ANOVA for Lists of Expression Subsets
title_sort functional analysis: evaluation of response intensities - tailoring anova for lists of expression subsets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2964684/
https://www.ncbi.nlm.nih.gov/pubmed/20942918
http://dx.doi.org/10.1186/1471-2105-11-510
work_keys_str_mv AT bergerfabrice functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets
AT demeulderbertrand functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets
AT gaigneauxanthoula functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets
AT depiereuxsophie functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets
AT barekeeric functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets
AT pierremichael functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets
AT dehertoghbenoit functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets
AT delorenzimauro functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets
AT depiereuxeric functionalanalysisevaluationofresponseintensitiestailoringanovaforlistsofexpressionsubsets