Cargando…
Weighted SNP Set Analysis in Genome-Wide Association Study
Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3786949/ https://www.ncbi.nlm.nih.gov/pubmed/24098741 http://dx.doi.org/10.1371/journal.pone.0075897 |
_version_ | 1782477804846186496 |
---|---|
author | Dai, Hui Zhao, Yang Qian, Cheng Cai, Min Zhang, Ruyang Chu, Minjie Dai, Juncheng Hu, Zhibin Shen, Hongbing Chen, Feng |
author_facet | Dai, Hui Zhao, Yang Qian, Cheng Cai, Min Zhang, Ruyang Chu, Minjie Dai, Juncheng Hu, Zhibin Shen, Hongbing Chen, Feng |
author_sort | Dai, Hui |
collection | PubMed |
description | Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA. |
format | Online Article Text |
id | pubmed-3786949 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37869492013-10-04 Weighted SNP Set Analysis in Genome-Wide Association Study Dai, Hui Zhao, Yang Qian, Cheng Cai, Min Zhang, Ruyang Chu, Minjie Dai, Juncheng Hu, Zhibin Shen, Hongbing Chen, Feng PLoS One Research Article Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA. Public Library of Science 2013-09-30 /pmc/articles/PMC3786949/ /pubmed/24098741 http://dx.doi.org/10.1371/journal.pone.0075897 Text en © 2013 Dai et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Dai, Hui Zhao, Yang Qian, Cheng Cai, Min Zhang, Ruyang Chu, Minjie Dai, Juncheng Hu, Zhibin Shen, Hongbing Chen, Feng Weighted SNP Set Analysis in Genome-Wide Association Study |
title | Weighted SNP Set Analysis in Genome-Wide Association Study |
title_full | Weighted SNP Set Analysis in Genome-Wide Association Study |
title_fullStr | Weighted SNP Set Analysis in Genome-Wide Association Study |
title_full_unstemmed | Weighted SNP Set Analysis in Genome-Wide Association Study |
title_short | Weighted SNP Set Analysis in Genome-Wide Association Study |
title_sort | weighted snp set analysis in genome-wide association study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3786949/ https://www.ncbi.nlm.nih.gov/pubmed/24098741 http://dx.doi.org/10.1371/journal.pone.0075897 |
work_keys_str_mv | AT daihui weightedsnpsetanalysisingenomewideassociationstudy AT zhaoyang weightedsnpsetanalysisingenomewideassociationstudy AT qiancheng weightedsnpsetanalysisingenomewideassociationstudy AT caimin weightedsnpsetanalysisingenomewideassociationstudy AT zhangruyang weightedsnpsetanalysisingenomewideassociationstudy AT chuminjie weightedsnpsetanalysisingenomewideassociationstudy AT daijuncheng weightedsnpsetanalysisingenomewideassociationstudy AT huzhibin weightedsnpsetanalysisingenomewideassociationstudy AT shenhongbing weightedsnpsetanalysisingenomewideassociationstudy AT chenfeng weightedsnpsetanalysisingenomewideassociationstudy |