Cargando…

Learning the kernel for rare variant genetic association test

Introduction: Compared to Genome-Wide Association Studies (GWAS) for common variants, single-marker association analysis for rare variants is underpowered. Set-based association analyses for rare variants are powerful tools that capture some of the missing heritability in trait association studies....

Descripción completa

Detalles Bibliográficos
Autores principales: Falk, Isak, Zhao, Millie, Nait Saada, Juba, Guo, Qi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598548/
https://www.ncbi.nlm.nih.gov/pubmed/37886683
http://dx.doi.org/10.3389/fgene.2023.1245238
_version_ 1785125578195599360
author Falk, Isak
Zhao, Millie
Nait Saada, Juba
Guo, Qi
author_facet Falk, Isak
Zhao, Millie
Nait Saada, Juba
Guo, Qi
author_sort Falk, Isak
collection PubMed
description Introduction: Compared to Genome-Wide Association Studies (GWAS) for common variants, single-marker association analysis for rare variants is underpowered. Set-based association analyses for rare variants are powerful tools that capture some of the missing heritability in trait association studies. Methods: We extend the convex-optimized SKAT (cSKAT) test set procedure which learns from data the optimal convex combination of kernels, to the full Generalised Linear Model (GLM) setting with arbitrary non-genetic covariates. We call this extended cSKAT (ecSKAT) and show that the resulting optimization problem is a quadratic programming problem that can be solved with no additional cost compared to cSKAT. Results: We show that a modified objective is related to an upper bound for the p-value through a decreasing exponential term in the objective function, indicating that optimizing this objective function is a principled way of learning the combination of kernels. We evaluate the performance of the proposed method on continuous and binary traits using simulation studies and illustrate its application using UK Biobank Whole Exome Sequencing data on hand grip strength and systemic lupus erythematosus rare variant association analysis. Discussion: Our proposed ecSKAT method enables correcting for important confounders in association studies such as age, sex or population structure for both quantitative and binary traits. Simulation studies showed that ecSKAT can recover sensible weights and achieve higher power across different sample sizes and misspecification settings. Compared to the burden test and SKAT method, ecSKAT gives a lower p-value for the genes tested in both quantitative and binary traits in the UKBiobank cohort.
format Online
Article
Text
id pubmed-10598548
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105985482023-10-26 Learning the kernel for rare variant genetic association test Falk, Isak Zhao, Millie Nait Saada, Juba Guo, Qi Front Genet Genetics Introduction: Compared to Genome-Wide Association Studies (GWAS) for common variants, single-marker association analysis for rare variants is underpowered. Set-based association analyses for rare variants are powerful tools that capture some of the missing heritability in trait association studies. Methods: We extend the convex-optimized SKAT (cSKAT) test set procedure which learns from data the optimal convex combination of kernels, to the full Generalised Linear Model (GLM) setting with arbitrary non-genetic covariates. We call this extended cSKAT (ecSKAT) and show that the resulting optimization problem is a quadratic programming problem that can be solved with no additional cost compared to cSKAT. Results: We show that a modified objective is related to an upper bound for the p-value through a decreasing exponential term in the objective function, indicating that optimizing this objective function is a principled way of learning the combination of kernels. We evaluate the performance of the proposed method on continuous and binary traits using simulation studies and illustrate its application using UK Biobank Whole Exome Sequencing data on hand grip strength and systemic lupus erythematosus rare variant association analysis. Discussion: Our proposed ecSKAT method enables correcting for important confounders in association studies such as age, sex or population structure for both quantitative and binary traits. Simulation studies showed that ecSKAT can recover sensible weights and achieve higher power across different sample sizes and misspecification settings. Compared to the burden test and SKAT method, ecSKAT gives a lower p-value for the genes tested in both quantitative and binary traits in the UKBiobank cohort. Frontiers Media S.A. 2023-10-09 /pmc/articles/PMC10598548/ /pubmed/37886683 http://dx.doi.org/10.3389/fgene.2023.1245238 Text en Copyright © 2023 Falk, Zhao, Nait Saada and Guo. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Falk, Isak
Zhao, Millie
Nait Saada, Juba
Guo, Qi
Learning the kernel for rare variant genetic association test
title Learning the kernel for rare variant genetic association test
title_full Learning the kernel for rare variant genetic association test
title_fullStr Learning the kernel for rare variant genetic association test
title_full_unstemmed Learning the kernel for rare variant genetic association test
title_short Learning the kernel for rare variant genetic association test
title_sort learning the kernel for rare variant genetic association test
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598548/
https://www.ncbi.nlm.nih.gov/pubmed/37886683
http://dx.doi.org/10.3389/fgene.2023.1245238
work_keys_str_mv AT falkisak learningthekernelforrarevariantgeneticassociationtest
AT zhaomillie learningthekernelforrarevariantgeneticassociationtest
AT naitsaadajuba learningthekernelforrarevariantgeneticassociationtest
AT guoqi learningthekernelforrarevariantgeneticassociationtest