Cargando…

Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS

Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GS...

Descripción completa

Detalles Bibliográficos
Autores principales: Kwon, Ji-sun, Kim, Jihye, Nam, Dougu, Kim, Sangsoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korea Genome Organization 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480679/
https://www.ncbi.nlm.nih.gov/pubmed/23105940
http://dx.doi.org/10.5808/GI.2012.10.2.123
_version_ 1782247596742410240
author Kwon, Ji-sun
Kim, Jihye
Nam, Dougu
Kim, Sangsoo
author_facet Kwon, Ji-sun
Kim, Jihye
Nam, Dougu
Kim, Sangsoo
author_sort Kwon, Ji-sun
collection PubMed
description Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended.
format Online
Article
Text
id pubmed-3480679
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Korea Genome Organization
record_format MEDLINE/PubMed
spelling pubmed-34806792012-10-26 Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS Kwon, Ji-sun Kim, Jihye Nam, Dougu Kim, Sangsoo Genomics Inf Article Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended. Korea Genome Organization 2012-06 2012-06-30 /pmc/articles/PMC3480679/ /pubmed/23105940 http://dx.doi.org/10.5808/GI.2012.10.2.123 Text en Copyright © 2012 by The Korea Genome Organization http://creativecommons.org/licenses/by-nc/3.0 It is identical to the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/).
spellingShingle Article
Kwon, Ji-sun
Kim, Jihye
Nam, Dougu
Kim, Sangsoo
Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS
title Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS
title_full Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS
title_fullStr Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS
title_full_unstemmed Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS
title_short Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS
title_sort performance comparison of two gene set analysis methods for genome-wide association study results: gsa-snp vs i-gsea4gwas
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480679/
https://www.ncbi.nlm.nih.gov/pubmed/23105940
http://dx.doi.org/10.5808/GI.2012.10.2.123
work_keys_str_mv AT kwonjisun performancecomparisonoftwogenesetanalysismethodsforgenomewideassociationstudyresultsgsasnpvsigsea4gwas
AT kimjihye performancecomparisonoftwogenesetanalysismethodsforgenomewideassociationstudyresultsgsasnpvsigsea4gwas
AT namdougu performancecomparisonoftwogenesetanalysismethodsforgenomewideassociationstudyresultsgsasnpvsigsea4gwas
AT kimsangsoo performancecomparisonoftwogenesetanalysismethodsforgenomewideassociationstudyresultsgsasnpvsigsea4gwas