Cargando…

Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets

Pathway analysis has been proposed as a complement to single SNP analyses in GWAS. This study compared pathway analysis methods using two lung cancer GWAS data sets based on four studies: one a combined data set from Central Europe and Toronto (CETO); the other a combined data set from Germany and M...

Descripción completa

Detalles Bibliográficos
Autores principales: Fehringer, Gordon, Liu, Geoffrey, Briollais, Laurent, Brennan, Paul, Amos, Christopher I., Spitz, Margaret R., Bickeböller, Heike, Wichmann, H. Erich, Risch, Angela, Hung, Rayjean J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3283683/
https://www.ncbi.nlm.nih.gov/pubmed/22363742
http://dx.doi.org/10.1371/journal.pone.0031816
_version_ 1782224234727079936
author Fehringer, Gordon
Liu, Geoffrey
Briollais, Laurent
Brennan, Paul
Amos, Christopher I.
Spitz, Margaret R.
Bickeböller, Heike
Wichmann, H. Erich
Risch, Angela
Hung, Rayjean J.
author_facet Fehringer, Gordon
Liu, Geoffrey
Briollais, Laurent
Brennan, Paul
Amos, Christopher I.
Spitz, Margaret R.
Bickeböller, Heike
Wichmann, H. Erich
Risch, Angela
Hung, Rayjean J.
author_sort Fehringer, Gordon
collection PubMed
description Pathway analysis has been proposed as a complement to single SNP analyses in GWAS. This study compared pathway analysis methods using two lung cancer GWAS data sets based on four studies: one a combined data set from Central Europe and Toronto (CETO); the other a combined data set from Germany and MD Anderson (GRMD). We searched the literature for pathway analysis methods that were widely used, representative of other methods, and had available software for performing analysis. We selected the programs EASE, which uses a modified Fishers Exact calculation to test for pathway associations, GenGen (a version of Gene Set Enrichment Analysis (GSEA)), which uses a Kolmogorov-Smirnov-like running sum statistic as the test statistic, and SLAT, which uses a p-value combination approach. We also included a modified version of the SUMSTAT method (mSUMSTAT), which tests for association by averaging χ(2) statistics from genotype association tests. There were nearly 18000 genes available for analysis, following mapping of more than 300,000 SNPs from each data set. These were mapped to 421 GO level 4 gene sets for pathway analysis. Among the methods designed to be robust to biases related to gene size and pathway SNP correlation (GenGen, mSUMSTAT and SLAT), the mSUMSTAT approach identified the most significant pathways (8 in CETO and 1 in GRMD). This included a highly plausible association for the acetylcholine receptor activity pathway in both CETO (FDR≤0.001) and GRMD (FDR = 0.009), although two strong association signals at a single gene cluster (CHRNA3-CHRNA5-CHRNB4) drive this result, complicating its interpretation. Few other replicated associations were found using any of these methods. Difficulty in replicating associations hindered our comparison, but results suggest mSUMSTAT has advantages over the other approaches, and may be a useful pathway analysis tool to use alongside other methods such as the commonly used GSEA (GenGen) approach.
format Online
Article
Text
id pubmed-3283683
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32836832012-02-23 Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets Fehringer, Gordon Liu, Geoffrey Briollais, Laurent Brennan, Paul Amos, Christopher I. Spitz, Margaret R. Bickeböller, Heike Wichmann, H. Erich Risch, Angela Hung, Rayjean J. PLoS One Research Article Pathway analysis has been proposed as a complement to single SNP analyses in GWAS. This study compared pathway analysis methods using two lung cancer GWAS data sets based on four studies: one a combined data set from Central Europe and Toronto (CETO); the other a combined data set from Germany and MD Anderson (GRMD). We searched the literature for pathway analysis methods that were widely used, representative of other methods, and had available software for performing analysis. We selected the programs EASE, which uses a modified Fishers Exact calculation to test for pathway associations, GenGen (a version of Gene Set Enrichment Analysis (GSEA)), which uses a Kolmogorov-Smirnov-like running sum statistic as the test statistic, and SLAT, which uses a p-value combination approach. We also included a modified version of the SUMSTAT method (mSUMSTAT), which tests for association by averaging χ(2) statistics from genotype association tests. There were nearly 18000 genes available for analysis, following mapping of more than 300,000 SNPs from each data set. These were mapped to 421 GO level 4 gene sets for pathway analysis. Among the methods designed to be robust to biases related to gene size and pathway SNP correlation (GenGen, mSUMSTAT and SLAT), the mSUMSTAT approach identified the most significant pathways (8 in CETO and 1 in GRMD). This included a highly plausible association for the acetylcholine receptor activity pathway in both CETO (FDR≤0.001) and GRMD (FDR = 0.009), although two strong association signals at a single gene cluster (CHRNA3-CHRNA5-CHRNB4) drive this result, complicating its interpretation. Few other replicated associations were found using any of these methods. Difficulty in replicating associations hindered our comparison, but results suggest mSUMSTAT has advantages over the other approaches, and may be a useful pathway analysis tool to use alongside other methods such as the commonly used GSEA (GenGen) approach. Public Library of Science 2012-02-21 /pmc/articles/PMC3283683/ /pubmed/22363742 http://dx.doi.org/10.1371/journal.pone.0031816 Text en Fehringer et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fehringer, Gordon
Liu, Geoffrey
Briollais, Laurent
Brennan, Paul
Amos, Christopher I.
Spitz, Margaret R.
Bickeböller, Heike
Wichmann, H. Erich
Risch, Angela
Hung, Rayjean J.
Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets
title Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets
title_full Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets
title_fullStr Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets
title_full_unstemmed Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets
title_short Comparison of Pathway Analysis Approaches Using Lung Cancer GWAS Data Sets
title_sort comparison of pathway analysis approaches using lung cancer gwas data sets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3283683/
https://www.ncbi.nlm.nih.gov/pubmed/22363742
http://dx.doi.org/10.1371/journal.pone.0031816
work_keys_str_mv AT fehringergordon comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT liugeoffrey comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT briollaislaurent comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT brennanpaul comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT amoschristopheri comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT spitzmargaretr comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT bickebollerheike comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT wichmannherich comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT rischangela comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets
AT hungrayjeanj comparisonofpathwayanalysisapproachesusinglungcancergwasdatasets