Cargando…

SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment

BACKGROUND: The current genome-wide association (GWA) analysis mainly focuses on the single genetic variant, which may not reveal some the genetic variants that have small individual effects but large joint effects. Considering the multiple SNPs jointly in Genome-wide association (GWA) analysis can...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Jaehoon, Ahn, Soyeon, Oh, Sohee, Weir, Bruce, Park, Taesung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287477/
https://www.ncbi.nlm.nih.gov/pubmed/22784568
http://dx.doi.org/10.1186/1752-0509-5-S2-S11
_version_ 1782224672083935232
author Lee, Jaehoon
Ahn, Soyeon
Oh, Sohee
Weir, Bruce
Park, Taesung
author_facet Lee, Jaehoon
Ahn, Soyeon
Oh, Sohee
Weir, Bruce
Park, Taesung
author_sort Lee, Jaehoon
collection PubMed
description BACKGROUND: The current genome-wide association (GWA) analysis mainly focuses on the single genetic variant, which may not reveal some the genetic variants that have small individual effects but large joint effects. Considering the multiple SNPs jointly in Genome-wide association (GWA) analysis can increase power. When multiple SNPs are jointly considered, the corresponding SNP-level association measures are likely to be correlated due to the linkage disequilibrium (LD) among SNPs. METHODS: We propose SNP-based parametric robust analysis of gene-set enrichment (SNP-PRAGE) method which handles correlation adequately among association measures of SNPs, and minimizes computing effort by the parametric assumption. SNP-PRAGE first obtains gene-level association measures from SNP-level association measures by incorporating the size of corresponding (or nearby) genes and the LD structure among SNPs. Afterward, SNP-PRAGE acquires the gene-set level summary of genes that undergo the same biological knowledge. This two-step summarization makes the within-set association measures to be independent from each other, and therefore the central limit theorem can be adequately applied for the parametric model. RESULTS & CONCLUSIONS: We applied SNP-PRAGE to two GWA data sets: hypertension data of 8,842 samples from the Korean population and bipolar disorder data of 4,806 samples from the Wellcome Trust Case Control Consortium (WTCCC). We found two enriched gene sets for hypertension and three enriched gene sets for bipolar disorder. By a simulation study, we compared our method to other gene set methods, and we found SNP-PRAGE reduced many false positives notably while requiring much less computational efforts than other permutation-based gene set approaches.
format Online
Article
Text
id pubmed-3287477
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32874772012-02-28 SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment Lee, Jaehoon Ahn, Soyeon Oh, Sohee Weir, Bruce Park, Taesung BMC Syst Biol Proceedings BACKGROUND: The current genome-wide association (GWA) analysis mainly focuses on the single genetic variant, which may not reveal some the genetic variants that have small individual effects but large joint effects. Considering the multiple SNPs jointly in Genome-wide association (GWA) analysis can increase power. When multiple SNPs are jointly considered, the corresponding SNP-level association measures are likely to be correlated due to the linkage disequilibrium (LD) among SNPs. METHODS: We propose SNP-based parametric robust analysis of gene-set enrichment (SNP-PRAGE) method which handles correlation adequately among association measures of SNPs, and minimizes computing effort by the parametric assumption. SNP-PRAGE first obtains gene-level association measures from SNP-level association measures by incorporating the size of corresponding (or nearby) genes and the LD structure among SNPs. Afterward, SNP-PRAGE acquires the gene-set level summary of genes that undergo the same biological knowledge. This two-step summarization makes the within-set association measures to be independent from each other, and therefore the central limit theorem can be adequately applied for the parametric model. RESULTS & CONCLUSIONS: We applied SNP-PRAGE to two GWA data sets: hypertension data of 8,842 samples from the Korean population and bipolar disorder data of 4,806 samples from the Wellcome Trust Case Control Consortium (WTCCC). We found two enriched gene sets for hypertension and three enriched gene sets for bipolar disorder. By a simulation study, we compared our method to other gene set methods, and we found SNP-PRAGE reduced many false positives notably while requiring much less computational efforts than other permutation-based gene set approaches. BioMed Central 2011-12-14 /pmc/articles/PMC3287477/ /pubmed/22784568 http://dx.doi.org/10.1186/1752-0509-5-S2-S11 Text en Copyright ©2011 Lee et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Lee, Jaehoon
Ahn, Soyeon
Oh, Sohee
Weir, Bruce
Park, Taesung
SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment
title SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment
title_full SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment
title_fullStr SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment
title_full_unstemmed SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment
title_short SNP-PRAGE: SNP-based parametric robust analysis of gene set enrichment
title_sort snp-prage: snp-based parametric robust analysis of gene set enrichment
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287477/
https://www.ncbi.nlm.nih.gov/pubmed/22784568
http://dx.doi.org/10.1186/1752-0509-5-S2-S11
work_keys_str_mv AT leejaehoon snppragesnpbasedparametricrobustanalysisofgenesetenrichment
AT ahnsoyeon snppragesnpbasedparametricrobustanalysisofgenesetenrichment
AT ohsohee snppragesnpbasedparametricrobustanalysisofgenesetenrichment
AT weirbruce snppragesnpbasedparametricrobustanalysisofgenesetenrichment
AT parktaesung snppragesnpbasedparametricrobustanalysisofgenesetenrichment