Cargando…
A gene based combination test using GWAS summary data
BACKGROUND: Gene-based association tests provide a useful alternative and complement to the usual single marker association tests, especially in genome-wide association studies (GWAS). The way of weighting for variants in a gene plays an important role in boosting the power of a gene-based associati...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9811798/ https://www.ncbi.nlm.nih.gov/pubmed/36597047 http://dx.doi.org/10.1186/s12859-022-05114-x |
_version_ | 1784863601187618816 |
---|---|
author | Zhang, Jianjun Liang, Xiaoyu Gonzales, Samantha Liu, Jianguo Gao, Xiaoyi Raymond Wang, Xuexia |
author_facet | Zhang, Jianjun Liang, Xiaoyu Gonzales, Samantha Liu, Jianguo Gao, Xiaoyi Raymond Wang, Xuexia |
author_sort | Zhang, Jianjun |
collection | PubMed |
description | BACKGROUND: Gene-based association tests provide a useful alternative and complement to the usual single marker association tests, especially in genome-wide association studies (GWAS). The way of weighting for variants in a gene plays an important role in boosting the power of a gene-based association test. Appropriate weights can boost statistical power, especially when detecting genetic variants with weak effects on a trait. One major limitation of existing gene-based association tests lies in using weights that are predetermined biologically or empirically. This limitation often attenuates the power of a test. On another hand, effect sizes or directions of causal genetic variants in real data are usually unknown, driving a need for a flexible yet robust methodology of gene based association tests. Furthermore, access to individual-level data is often limited, while thousands of GWAS summary data are publicly and freely available. RESULTS: To resolve these limitations, we propose a combination test named as OWC which is based on summary statistics from GWAS data. Several traditional methods including burden test, weighted sum of squared score test [SSU], weighted sum statistic [WSS], SNP-set Kernel Association Test [SKAT], and the score test are special cases of OWC. To evaluate the performance of OWC, we perform extensive simulation studies. Results of simulation studies demonstrate that OWC outperforms several existing popular methods. We further show that OWC outperforms comparison methods in real-world data analyses using schizophrenia GWAS summary data and a fasting glucose GWAS meta-analysis data. The proposed method is implemented in an R package available at https://github.com/Xuexia-Wang/OWC-R-package CONCLUSIONS: We propose a novel gene-based association test that incorporates four different weighting schemes (two constant weights and two weights proportional to normal statistic Z) and includes several popular methods as its special cases. Results of the simulation studies and real data analyses illustrate that the proposed test, OWC, outperforms comparable methods in most scenarios. These results demonstrate that OWC is a useful tool that adapts to the underlying biological model for a disease by weighting appropriately genetic variants and combination of well-known gene-based tests. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05114-x. |
format | Online Article Text |
id | pubmed-9811798 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-98117982023-01-05 A gene based combination test using GWAS summary data Zhang, Jianjun Liang, Xiaoyu Gonzales, Samantha Liu, Jianguo Gao, Xiaoyi Raymond Wang, Xuexia BMC Bioinformatics Research BACKGROUND: Gene-based association tests provide a useful alternative and complement to the usual single marker association tests, especially in genome-wide association studies (GWAS). The way of weighting for variants in a gene plays an important role in boosting the power of a gene-based association test. Appropriate weights can boost statistical power, especially when detecting genetic variants with weak effects on a trait. One major limitation of existing gene-based association tests lies in using weights that are predetermined biologically or empirically. This limitation often attenuates the power of a test. On another hand, effect sizes or directions of causal genetic variants in real data are usually unknown, driving a need for a flexible yet robust methodology of gene based association tests. Furthermore, access to individual-level data is often limited, while thousands of GWAS summary data are publicly and freely available. RESULTS: To resolve these limitations, we propose a combination test named as OWC which is based on summary statistics from GWAS data. Several traditional methods including burden test, weighted sum of squared score test [SSU], weighted sum statistic [WSS], SNP-set Kernel Association Test [SKAT], and the score test are special cases of OWC. To evaluate the performance of OWC, we perform extensive simulation studies. Results of simulation studies demonstrate that OWC outperforms several existing popular methods. We further show that OWC outperforms comparison methods in real-world data analyses using schizophrenia GWAS summary data and a fasting glucose GWAS meta-analysis data. The proposed method is implemented in an R package available at https://github.com/Xuexia-Wang/OWC-R-package CONCLUSIONS: We propose a novel gene-based association test that incorporates four different weighting schemes (two constant weights and two weights proportional to normal statistic Z) and includes several popular methods as its special cases. Results of the simulation studies and real data analyses illustrate that the proposed test, OWC, outperforms comparable methods in most scenarios. These results demonstrate that OWC is a useful tool that adapts to the underlying biological model for a disease by weighting appropriately genetic variants and combination of well-known gene-based tests. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-05114-x. BioMed Central 2023-01-03 /pmc/articles/PMC9811798/ /pubmed/36597047 http://dx.doi.org/10.1186/s12859-022-05114-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhang, Jianjun Liang, Xiaoyu Gonzales, Samantha Liu, Jianguo Gao, Xiaoyi Raymond Wang, Xuexia A gene based combination test using GWAS summary data |
title | A gene based combination test using GWAS summary data |
title_full | A gene based combination test using GWAS summary data |
title_fullStr | A gene based combination test using GWAS summary data |
title_full_unstemmed | A gene based combination test using GWAS summary data |
title_short | A gene based combination test using GWAS summary data |
title_sort | gene based combination test using gwas summary data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9811798/ https://www.ncbi.nlm.nih.gov/pubmed/36597047 http://dx.doi.org/10.1186/s12859-022-05114-x |
work_keys_str_mv | AT zhangjianjun agenebasedcombinationtestusinggwassummarydata AT liangxiaoyu agenebasedcombinationtestusinggwassummarydata AT gonzalessamantha agenebasedcombinationtestusinggwassummarydata AT liujianguo agenebasedcombinationtestusinggwassummarydata AT gaoxiaoyiraymond agenebasedcombinationtestusinggwassummarydata AT wangxuexia agenebasedcombinationtestusinggwassummarydata AT zhangjianjun genebasedcombinationtestusinggwassummarydata AT liangxiaoyu genebasedcombinationtestusinggwassummarydata AT gonzalessamantha genebasedcombinationtestusinggwassummarydata AT liujianguo genebasedcombinationtestusinggwassummarydata AT gaoxiaoyiraymond genebasedcombinationtestusinggwassummarydata AT wangxuexia genebasedcombinationtestusinggwassummarydata |