Cargando…
Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS
Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GS...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korea Genome Organization
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480679/ https://www.ncbi.nlm.nih.gov/pubmed/23105940 http://dx.doi.org/10.5808/GI.2012.10.2.123 |
_version_ | 1782247596742410240 |
---|---|
author | Kwon, Ji-sun Kim, Jihye Nam, Dougu Kim, Sangsoo |
author_facet | Kwon, Ji-sun Kim, Jihye Nam, Dougu Kim, Sangsoo |
author_sort | Kwon, Ji-sun |
collection | PubMed |
description | Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended. |
format | Online Article Text |
id | pubmed-3480679 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Korea Genome Organization |
record_format | MEDLINE/PubMed |
spelling | pubmed-34806792012-10-26 Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS Kwon, Ji-sun Kim, Jihye Nam, Dougu Kim, Sangsoo Genomics Inf Article Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended. Korea Genome Organization 2012-06 2012-06-30 /pmc/articles/PMC3480679/ /pubmed/23105940 http://dx.doi.org/10.5808/GI.2012.10.2.123 Text en Copyright © 2012 by The Korea Genome Organization http://creativecommons.org/licenses/by-nc/3.0 It is identical to the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/). |
spellingShingle | Article Kwon, Ji-sun Kim, Jihye Nam, Dougu Kim, Sangsoo Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS |
title | Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS |
title_full | Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS |
title_fullStr | Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS |
title_full_unstemmed | Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS |
title_short | Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS |
title_sort | performance comparison of two gene set analysis methods for genome-wide association study results: gsa-snp vs i-gsea4gwas |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480679/ https://www.ncbi.nlm.nih.gov/pubmed/23105940 http://dx.doi.org/10.5808/GI.2012.10.2.123 |
work_keys_str_mv | AT kwonjisun performancecomparisonoftwogenesetanalysismethodsforgenomewideassociationstudyresultsgsasnpvsigsea4gwas AT kimjihye performancecomparisonoftwogenesetanalysismethodsforgenomewideassociationstudyresultsgsasnpvsigsea4gwas AT namdougu performancecomparisonoftwogenesetanalysismethodsforgenomewideassociationstudyresultsgsasnpvsigsea4gwas AT kimsangsoo performancecomparisonoftwogenesetanalysismethodsforgenomewideassociationstudyresultsgsasnpvsigsea4gwas |