Cargando…

Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits

Genome-Wide association studies (GWAS), based on testing one single nucleotide polymorphism (SNP) at a time, have revolutionized our understanding of the genetics of complex traits. In GWAS, there is a need to consider confounding effects such as due to population structure, and take groups of SNPs...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Yingjie, Wu, Chenxi, Guo, Maozu, Zou, Quan, Liu, Xiaoyan, Keinan, Alon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6469383/
https://www.ncbi.nlm.nih.gov/pubmed/31024614
http://dx.doi.org/10.3389/fgene.2019.00271
_version_ 1783411633323769856
author Guo, Yingjie
Wu, Chenxi
Guo, Maozu
Zou, Quan
Liu, Xiaoyan
Keinan, Alon
author_facet Guo, Yingjie
Wu, Chenxi
Guo, Maozu
Zou, Quan
Liu, Xiaoyan
Keinan, Alon
author_sort Guo, Yingjie
collection PubMed
description Genome-Wide association studies (GWAS), based on testing one single nucleotide polymorphism (SNP) at a time, have revolutionized our understanding of the genetics of complex traits. In GWAS, there is a need to consider confounding effects such as due to population structure, and take groups of SNPs into account simultaneously due to the “polygenic” attribute of complex quantitative traits. In this paper, we propose a new approach SGL-LMM that puts together sparse group lasso (SGL) and linear mixed model (LMM) for multivariate associations of quantitative traits. LMM, as has been often used in GWAS, controls for confounders, while SGL maintains sparsity of the underlying multivariate regression model. SGL-LMM first sets a fixed zero effect to learn the parameters of random effects using LMM, and then estimates fixed effects using SGL regularization. We present efficient algorithms for hyperparameter tuning and feature selection using stability selection. While controlling for confounders and constraining for sparse solutions, SGL-LMM also provides a natural framework for incorporating prior biological information into the group structure underlying the model. Results based on both simulated and real data show SGL-LMM outperforms previous approaches in terms of power to detect associations and accuracy of quantitative trait prediction.
format Online
Article
Text
id pubmed-6469383
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64693832019-04-25 Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits Guo, Yingjie Wu, Chenxi Guo, Maozu Zou, Quan Liu, Xiaoyan Keinan, Alon Front Genet Genetics Genome-Wide association studies (GWAS), based on testing one single nucleotide polymorphism (SNP) at a time, have revolutionized our understanding of the genetics of complex traits. In GWAS, there is a need to consider confounding effects such as due to population structure, and take groups of SNPs into account simultaneously due to the “polygenic” attribute of complex quantitative traits. In this paper, we propose a new approach SGL-LMM that puts together sparse group lasso (SGL) and linear mixed model (LMM) for multivariate associations of quantitative traits. LMM, as has been often used in GWAS, controls for confounders, while SGL maintains sparsity of the underlying multivariate regression model. SGL-LMM first sets a fixed zero effect to learn the parameters of random effects using LMM, and then estimates fixed effects using SGL regularization. We present efficient algorithms for hyperparameter tuning and feature selection using stability selection. While controlling for confounders and constraining for sparse solutions, SGL-LMM also provides a natural framework for incorporating prior biological information into the group structure underlying the model. Results based on both simulated and real data show SGL-LMM outperforms previous approaches in terms of power to detect associations and accuracy of quantitative trait prediction. Frontiers Media S.A. 2019-04-10 /pmc/articles/PMC6469383/ /pubmed/31024614 http://dx.doi.org/10.3389/fgene.2019.00271 Text en Copyright © 2019 Guo, Wu, Guo, Zou, Liu and Keinan. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Guo, Yingjie
Wu, Chenxi
Guo, Maozu
Zou, Quan
Liu, Xiaoyan
Keinan, Alon
Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits
title Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits
title_full Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits
title_fullStr Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits
title_full_unstemmed Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits
title_short Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits
title_sort combining sparse group lasso and linear mixed model improves power to detect genetic variants underlying quantitative traits
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6469383/
https://www.ncbi.nlm.nih.gov/pubmed/31024614
http://dx.doi.org/10.3389/fgene.2019.00271
work_keys_str_mv AT guoyingjie combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits
AT wuchenxi combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits
AT guomaozu combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits
AT zouquan combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits
AT liuxiaoyan combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits
AT keinanalon combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits