Cargando…
Learning the kernel for rare variant genetic association test
Introduction: Compared to Genome-Wide Association Studies (GWAS) for common variants, single-marker association analysis for rare variants is underpowered. Set-based association analyses for rare variants are powerful tools that capture some of the missing heritability in trait association studies....
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598548/ https://www.ncbi.nlm.nih.gov/pubmed/37886683 http://dx.doi.org/10.3389/fgene.2023.1245238 |
_version_ | 1785125578195599360 |
---|---|
author | Falk, Isak Zhao, Millie Nait Saada, Juba Guo, Qi |
author_facet | Falk, Isak Zhao, Millie Nait Saada, Juba Guo, Qi |
author_sort | Falk, Isak |
collection | PubMed |
description | Introduction: Compared to Genome-Wide Association Studies (GWAS) for common variants, single-marker association analysis for rare variants is underpowered. Set-based association analyses for rare variants are powerful tools that capture some of the missing heritability in trait association studies. Methods: We extend the convex-optimized SKAT (cSKAT) test set procedure which learns from data the optimal convex combination of kernels, to the full Generalised Linear Model (GLM) setting with arbitrary non-genetic covariates. We call this extended cSKAT (ecSKAT) and show that the resulting optimization problem is a quadratic programming problem that can be solved with no additional cost compared to cSKAT. Results: We show that a modified objective is related to an upper bound for the p-value through a decreasing exponential term in the objective function, indicating that optimizing this objective function is a principled way of learning the combination of kernels. We evaluate the performance of the proposed method on continuous and binary traits using simulation studies and illustrate its application using UK Biobank Whole Exome Sequencing data on hand grip strength and systemic lupus erythematosus rare variant association analysis. Discussion: Our proposed ecSKAT method enables correcting for important confounders in association studies such as age, sex or population structure for both quantitative and binary traits. Simulation studies showed that ecSKAT can recover sensible weights and achieve higher power across different sample sizes and misspecification settings. Compared to the burden test and SKAT method, ecSKAT gives a lower p-value for the genes tested in both quantitative and binary traits in the UKBiobank cohort. |
format | Online Article Text |
id | pubmed-10598548 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105985482023-10-26 Learning the kernel for rare variant genetic association test Falk, Isak Zhao, Millie Nait Saada, Juba Guo, Qi Front Genet Genetics Introduction: Compared to Genome-Wide Association Studies (GWAS) for common variants, single-marker association analysis for rare variants is underpowered. Set-based association analyses for rare variants are powerful tools that capture some of the missing heritability in trait association studies. Methods: We extend the convex-optimized SKAT (cSKAT) test set procedure which learns from data the optimal convex combination of kernels, to the full Generalised Linear Model (GLM) setting with arbitrary non-genetic covariates. We call this extended cSKAT (ecSKAT) and show that the resulting optimization problem is a quadratic programming problem that can be solved with no additional cost compared to cSKAT. Results: We show that a modified objective is related to an upper bound for the p-value through a decreasing exponential term in the objective function, indicating that optimizing this objective function is a principled way of learning the combination of kernels. We evaluate the performance of the proposed method on continuous and binary traits using simulation studies and illustrate its application using UK Biobank Whole Exome Sequencing data on hand grip strength and systemic lupus erythematosus rare variant association analysis. Discussion: Our proposed ecSKAT method enables correcting for important confounders in association studies such as age, sex or population structure for both quantitative and binary traits. Simulation studies showed that ecSKAT can recover sensible weights and achieve higher power across different sample sizes and misspecification settings. Compared to the burden test and SKAT method, ecSKAT gives a lower p-value for the genes tested in both quantitative and binary traits in the UKBiobank cohort. Frontiers Media S.A. 2023-10-09 /pmc/articles/PMC10598548/ /pubmed/37886683 http://dx.doi.org/10.3389/fgene.2023.1245238 Text en Copyright © 2023 Falk, Zhao, Nait Saada and Guo. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Falk, Isak Zhao, Millie Nait Saada, Juba Guo, Qi Learning the kernel for rare variant genetic association test |
title | Learning the kernel for rare variant genetic association test |
title_full | Learning the kernel for rare variant genetic association test |
title_fullStr | Learning the kernel for rare variant genetic association test |
title_full_unstemmed | Learning the kernel for rare variant genetic association test |
title_short | Learning the kernel for rare variant genetic association test |
title_sort | learning the kernel for rare variant genetic association test |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598548/ https://www.ncbi.nlm.nih.gov/pubmed/37886683 http://dx.doi.org/10.3389/fgene.2023.1245238 |
work_keys_str_mv | AT falkisak learningthekernelforrarevariantgeneticassociationtest AT zhaomillie learningthekernelforrarevariantgeneticassociationtest AT naitsaadajuba learningthekernelforrarevariantgeneticassociationtest AT guoqi learningthekernelforrarevariantgeneticassociationtest |