Cargando…

Weighted SNP Set Analysis in Genome-Wide Association Study

Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus...

Descripción completa

Detalles Bibliográficos
Autores principales: Dai, Hui, Zhao, Yang, Qian, Cheng, Cai, Min, Zhang, Ruyang, Chu, Minjie, Dai, Juncheng, Hu, Zhibin, Shen, Hongbing, Chen, Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3786949/
https://www.ncbi.nlm.nih.gov/pubmed/24098741
http://dx.doi.org/10.1371/journal.pone.0075897
_version_ 1782477804846186496
author Dai, Hui
Zhao, Yang
Qian, Cheng
Cai, Min
Zhang, Ruyang
Chu, Minjie
Dai, Juncheng
Hu, Zhibin
Shen, Hongbing
Chen, Feng
author_facet Dai, Hui
Zhao, Yang
Qian, Cheng
Cai, Min
Zhang, Ruyang
Chu, Minjie
Dai, Juncheng
Hu, Zhibin
Shen, Hongbing
Chen, Feng
author_sort Dai, Hui
collection PubMed
description Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA.
format Online
Article
Text
id pubmed-3786949
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37869492013-10-04 Weighted SNP Set Analysis in Genome-Wide Association Study Dai, Hui Zhao, Yang Qian, Cheng Cai, Min Zhang, Ruyang Chu, Minjie Dai, Juncheng Hu, Zhibin Shen, Hongbing Chen, Feng PLoS One Research Article Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA. Public Library of Science 2013-09-30 /pmc/articles/PMC3786949/ /pubmed/24098741 http://dx.doi.org/10.1371/journal.pone.0075897 Text en © 2013 Dai et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dai, Hui
Zhao, Yang
Qian, Cheng
Cai, Min
Zhang, Ruyang
Chu, Minjie
Dai, Juncheng
Hu, Zhibin
Shen, Hongbing
Chen, Feng
Weighted SNP Set Analysis in Genome-Wide Association Study
title Weighted SNP Set Analysis in Genome-Wide Association Study
title_full Weighted SNP Set Analysis in Genome-Wide Association Study
title_fullStr Weighted SNP Set Analysis in Genome-Wide Association Study
title_full_unstemmed Weighted SNP Set Analysis in Genome-Wide Association Study
title_short Weighted SNP Set Analysis in Genome-Wide Association Study
title_sort weighted snp set analysis in genome-wide association study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3786949/
https://www.ncbi.nlm.nih.gov/pubmed/24098741
http://dx.doi.org/10.1371/journal.pone.0075897
work_keys_str_mv AT daihui weightedsnpsetanalysisingenomewideassociationstudy
AT zhaoyang weightedsnpsetanalysisingenomewideassociationstudy
AT qiancheng weightedsnpsetanalysisingenomewideassociationstudy
AT caimin weightedsnpsetanalysisingenomewideassociationstudy
AT zhangruyang weightedsnpsetanalysisingenomewideassociationstudy
AT chuminjie weightedsnpsetanalysisingenomewideassociationstudy
AT daijuncheng weightedsnpsetanalysisingenomewideassociationstudy
AT huzhibin weightedsnpsetanalysisingenomewideassociationstudy
AT shenhongbing weightedsnpsetanalysisingenomewideassociationstudy
AT chenfeng weightedsnpsetanalysisingenomewideassociationstudy