Cargando…

Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations

Enrichment analysis of gene sets is a popular approach that provides a functional interpretation of genome-wide expression data. Existing tests are affected by inter-gene correlations, resulting in a high Type I error. The most widely used test, Gene Set Enrichment Analysis, relies on computationall...

Descripción completa

Detalles Bibliográficos
Autores principales: Yaari, Gur, Bolen, Christopher R., Thakar, Juilee, Kleinstein, Steven H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3794608/
https://www.ncbi.nlm.nih.gov/pubmed/23921631
http://dx.doi.org/10.1093/nar/gkt660
_version_ 1782287229123559424
author Yaari, Gur
Bolen, Christopher R.
Thakar, Juilee
Kleinstein, Steven H.
author_facet Yaari, Gur
Bolen, Christopher R.
Thakar, Juilee
Kleinstein, Steven H.
author_sort Yaari, Gur
collection PubMed
description Enrichment analysis of gene sets is a popular approach that provides a functional interpretation of genome-wide expression data. Existing tests are affected by inter-gene correlations, resulting in a high Type I error. The most widely used test, Gene Set Enrichment Analysis, relies on computationally intensive permutations of sample labels to generate a null distribution that preserves gene–gene correlations. A more recent approach, CAMERA, attempts to correct for these correlations by estimating a variance inflation factor directly from the data. Although these methods generate P-values for detecting gene set activity, they are unable to produce confidence intervals or allow for post hoc comparisons. We have developed a new computational framework for Quantitative Set Analysis of Gene Expression (QuSAGE). QuSAGE accounts for inter-gene correlations, improves the estimation of the variance inflation factor and, rather than evaluating the deviation from a null hypothesis with a P-value, it quantifies gene-set activity with a complete probability density function. From this probability density function, P-values and confidence intervals can be extracted and post hoc analysis can be carried out while maintaining statistical traceability. Compared with Gene Set Enrichment Analysis and CAMERA, QuSAGE exhibits better sensitivity and specificity on real data profiling the response to interferon therapy (in chronic Hepatitis C virus patients) and Influenza A virus infection. QuSAGE is available as an R package, which includes the core functions for the method as well as functions to plot and visualize the results.
format Online
Article
Text
id pubmed-3794608
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-37946082013-10-21 Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations Yaari, Gur Bolen, Christopher R. Thakar, Juilee Kleinstein, Steven H. Nucleic Acids Res Methods Online Enrichment analysis of gene sets is a popular approach that provides a functional interpretation of genome-wide expression data. Existing tests are affected by inter-gene correlations, resulting in a high Type I error. The most widely used test, Gene Set Enrichment Analysis, relies on computationally intensive permutations of sample labels to generate a null distribution that preserves gene–gene correlations. A more recent approach, CAMERA, attempts to correct for these correlations by estimating a variance inflation factor directly from the data. Although these methods generate P-values for detecting gene set activity, they are unable to produce confidence intervals or allow for post hoc comparisons. We have developed a new computational framework for Quantitative Set Analysis of Gene Expression (QuSAGE). QuSAGE accounts for inter-gene correlations, improves the estimation of the variance inflation factor and, rather than evaluating the deviation from a null hypothesis with a P-value, it quantifies gene-set activity with a complete probability density function. From this probability density function, P-values and confidence intervals can be extracted and post hoc analysis can be carried out while maintaining statistical traceability. Compared with Gene Set Enrichment Analysis and CAMERA, QuSAGE exhibits better sensitivity and specificity on real data profiling the response to interferon therapy (in chronic Hepatitis C virus patients) and Influenza A virus infection. QuSAGE is available as an R package, which includes the core functions for the method as well as functions to plot and visualize the results. Oxford University Press 2013-10 2013-08-05 /pmc/articles/PMC3794608/ /pubmed/23921631 http://dx.doi.org/10.1093/nar/gkt660 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Yaari, Gur
Bolen, Christopher R.
Thakar, Juilee
Kleinstein, Steven H.
Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations
title Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations
title_full Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations
title_fullStr Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations
title_full_unstemmed Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations
title_short Quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations
title_sort quantitative set analysis for gene expression: a method to quantify gene set differential expression including gene-gene correlations
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3794608/
https://www.ncbi.nlm.nih.gov/pubmed/23921631
http://dx.doi.org/10.1093/nar/gkt660
work_keys_str_mv AT yaarigur quantitativesetanalysisforgeneexpressionamethodtoquantifygenesetdifferentialexpressionincludinggenegenecorrelations
AT bolenchristopherr quantitativesetanalysisforgeneexpressionamethodtoquantifygenesetdifferentialexpressionincludinggenegenecorrelations
AT thakarjuilee quantitativesetanalysisforgeneexpressionamethodtoquantifygenesetdifferentialexpressionincludinggenegenecorrelations
AT kleinsteinstevenh quantitativesetanalysisforgeneexpressionamethodtoquantifygenesetdifferentialexpressionincludinggenegenecorrelations