Cargando…

Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods

Gene set methods aim to assess the overall evidence of association of a set of genes with a phenotype, such as disease or a quantitative trait. Multiple approaches for gene set analysis of expression data have been proposed. They can be divided into two types: competitive and self-contained. Benefit...

Descripción completa

Detalles Bibliográficos
Autores principales: Fridley, Brooke L., Jenkins, Gregory D., Biernacka, Joanna M.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2941449/
https://www.ncbi.nlm.nih.gov/pubmed/20862301
http://dx.doi.org/10.1371/journal.pone.0012693
_version_ 1782186898521849856
author Fridley, Brooke L.
Jenkins, Gregory D.
Biernacka, Joanna M.
author_facet Fridley, Brooke L.
Jenkins, Gregory D.
Biernacka, Joanna M.
author_sort Fridley, Brooke L.
collection PubMed
description Gene set methods aim to assess the overall evidence of association of a set of genes with a phenotype, such as disease or a quantitative trait. Multiple approaches for gene set analysis of expression data have been proposed. They can be divided into two types: competitive and self-contained. Benefits of self-contained methods include that they can be used for genome-wide, candidate gene, or pathway studies, and have been reported to be more powerful than competitive methods. We therefore investigated ten self-contained methods that can be used for continuous, discrete and time-to-event phenotypes. To assess the power and type I error rate for the various previously proposed and novel approaches, an extensive simulation study was completed in which the scenarios varied according to: number of genes in a gene set, number of genes associated with the phenotype, effect sizes, correlation between expression of genes within a gene set, and the sample size. In addition to the simulated data, the various methods were applied to a pharmacogenomic study of the drug gemcitabine. Simulation results demonstrated that overall Fisher's method and the global model with random effects have the highest power for a wide range of scenarios, while the analysis based on the first principal component and Kolmogorov-Smirnov test tended to have lowest power. The methods investigated here are likely to play an important role in identifying pathways that contribute to complex traits.
format Text
id pubmed-2941449
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29414492010-09-22 Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods Fridley, Brooke L. Jenkins, Gregory D. Biernacka, Joanna M. PLoS One Research Article Gene set methods aim to assess the overall evidence of association of a set of genes with a phenotype, such as disease or a quantitative trait. Multiple approaches for gene set analysis of expression data have been proposed. They can be divided into two types: competitive and self-contained. Benefits of self-contained methods include that they can be used for genome-wide, candidate gene, or pathway studies, and have been reported to be more powerful than competitive methods. We therefore investigated ten self-contained methods that can be used for continuous, discrete and time-to-event phenotypes. To assess the power and type I error rate for the various previously proposed and novel approaches, an extensive simulation study was completed in which the scenarios varied according to: number of genes in a gene set, number of genes associated with the phenotype, effect sizes, correlation between expression of genes within a gene set, and the sample size. In addition to the simulated data, the various methods were applied to a pharmacogenomic study of the drug gemcitabine. Simulation results demonstrated that overall Fisher's method and the global model with random effects have the highest power for a wide range of scenarios, while the analysis based on the first principal component and Kolmogorov-Smirnov test tended to have lowest power. The methods investigated here are likely to play an important role in identifying pathways that contribute to complex traits. Public Library of Science 2010-09-17 /pmc/articles/PMC2941449/ /pubmed/20862301 http://dx.doi.org/10.1371/journal.pone.0012693 Text en Fridley et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fridley, Brooke L.
Jenkins, Gregory D.
Biernacka, Joanna M.
Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
title Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
title_full Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
title_fullStr Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
title_full_unstemmed Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
title_short Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods
title_sort self-contained gene-set analysis of expression data: an evaluation of existing and novel methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2941449/
https://www.ncbi.nlm.nih.gov/pubmed/20862301
http://dx.doi.org/10.1371/journal.pone.0012693
work_keys_str_mv AT fridleybrookel selfcontainedgenesetanalysisofexpressiondataanevaluationofexistingandnovelmethods
AT jenkinsgregoryd selfcontainedgenesetanalysisofexpressiondataanevaluationofexistingandnovelmethods
AT biernackajoannam selfcontainedgenesetanalysisofexpressiondataanevaluationofexistingandnovelmethods