Cargando…

A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families

BACKGROUND: Genome-wide association studies (GWAS) have become a common approach to identifying single nucleotide polymorphisms (SNPs) associated with complex diseases. As complex diseases are caused by the joint effects of multiple genes, while the effect of individual gene or SNP is modest, a meth...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yi-Ting, Sung, Pei-Yuan, Lin, Peng-Lin, Yu, Ya-Wen, Chung, Ren-Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4433014/
https://www.ncbi.nlm.nih.gov/pubmed/25975968
http://dx.doi.org/10.1186/s12864-015-1620-3
_version_ 1782371569433051136
author Wang, Yi-Ting
Sung, Pei-Yuan
Lin, Peng-Lin
Yu, Ya-Wen
Chung, Ren-Hua
author_facet Wang, Yi-Ting
Sung, Pei-Yuan
Lin, Peng-Lin
Yu, Ya-Wen
Chung, Ren-Hua
author_sort Wang, Yi-Ting
collection PubMed
description BACKGROUND: Genome-wide association studies (GWAS) have become a common approach to identifying single nucleotide polymorphisms (SNPs) associated with complex diseases. As complex diseases are caused by the joint effects of multiple genes, while the effect of individual gene or SNP is modest, a method considering the joint effects of multiple SNPs can be more powerful than testing individual SNPs. The multi-SNP analysis aims to test association based on a SNP set, usually defined based on biological knowledge such as gene or pathway, which may contain only a portion of SNPs with effects on the disease. Therefore, a challenge for the multi-SNP analysis is how to effectively select a subset of SNPs with promising association signals from the SNP set. RESULTS: We developed the Optimal P-value Threshold Pedigree Disequilibrium Test (OPTPDT). The OPTPDT uses general nuclear families. A variable p-value threshold algorithm is used to determine an optimal p-value threshold for selecting a subset of SNPs. A permutation procedure is used to assess the significance of the test. We used simulations to verify that the OPTPDT has correct type I error rates. Our power studies showed that the OPTPDT can be more powerful than the set-based test in PLINK, the multi-SNP FBAT test, and the p-value based test GATES. We applied the OPTPDT to a family-based autism GWAS dataset for gene-based association analysis and identified MACROD2-AS1 with genome-wide significance (p-value= 2.5 × 10(− 6)). CONCLUSIONS: Our simulation results suggested that the OPTPDT is a valid and powerful test. The OPTPDT will be helpful for gene-based or pathway association analysis. The method is ideal for the secondary analysis of existing GWAS datasets, which may identify a set of SNPs with joint effects on the disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1620-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4433014
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44330142015-05-16 A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families Wang, Yi-Ting Sung, Pei-Yuan Lin, Peng-Lin Yu, Ya-Wen Chung, Ren-Hua BMC Genomics Methodology Article BACKGROUND: Genome-wide association studies (GWAS) have become a common approach to identifying single nucleotide polymorphisms (SNPs) associated with complex diseases. As complex diseases are caused by the joint effects of multiple genes, while the effect of individual gene or SNP is modest, a method considering the joint effects of multiple SNPs can be more powerful than testing individual SNPs. The multi-SNP analysis aims to test association based on a SNP set, usually defined based on biological knowledge such as gene or pathway, which may contain only a portion of SNPs with effects on the disease. Therefore, a challenge for the multi-SNP analysis is how to effectively select a subset of SNPs with promising association signals from the SNP set. RESULTS: We developed the Optimal P-value Threshold Pedigree Disequilibrium Test (OPTPDT). The OPTPDT uses general nuclear families. A variable p-value threshold algorithm is used to determine an optimal p-value threshold for selecting a subset of SNPs. A permutation procedure is used to assess the significance of the test. We used simulations to verify that the OPTPDT has correct type I error rates. Our power studies showed that the OPTPDT can be more powerful than the set-based test in PLINK, the multi-SNP FBAT test, and the p-value based test GATES. We applied the OPTPDT to a family-based autism GWAS dataset for gene-based association analysis and identified MACROD2-AS1 with genome-wide significance (p-value= 2.5 × 10(− 6)). CONCLUSIONS: Our simulation results suggested that the OPTPDT is a valid and powerful test. The OPTPDT will be helpful for gene-based or pathway association analysis. The method is ideal for the secondary analysis of existing GWAS datasets, which may identify a set of SNPs with joint effects on the disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1620-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-05-15 /pmc/articles/PMC4433014/ /pubmed/25975968 http://dx.doi.org/10.1186/s12864-015-1620-3 Text en © Wang et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wang, Yi-Ting
Sung, Pei-Yuan
Lin, Peng-Lin
Yu, Ya-Wen
Chung, Ren-Hua
A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families
title A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families
title_full A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families
title_fullStr A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families
title_full_unstemmed A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families
title_short A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families
title_sort multi-snp association test for complex diseases incorporating an optimal p-value threshold algorithm in nuclear families
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4433014/
https://www.ncbi.nlm.nih.gov/pubmed/25975968
http://dx.doi.org/10.1186/s12864-015-1620-3
work_keys_str_mv AT wangyiting amultisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT sungpeiyuan amultisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT linpenglin amultisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT yuyawen amultisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT chungrenhua amultisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT wangyiting multisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT sungpeiyuan multisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT linpenglin multisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT yuyawen multisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies
AT chungrenhua multisnpassociationtestforcomplexdiseasesincorporatinganoptimalpvaluethresholdalgorithminnuclearfamilies