Cargando…
Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits
Genome-Wide association studies (GWAS), based on testing one single nucleotide polymorphism (SNP) at a time, have revolutionized our understanding of the genetics of complex traits. In GWAS, there is a need to consider confounding effects such as due to population structure, and take groups of SNPs...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6469383/ https://www.ncbi.nlm.nih.gov/pubmed/31024614 http://dx.doi.org/10.3389/fgene.2019.00271 |
_version_ | 1783411633323769856 |
---|---|
author | Guo, Yingjie Wu, Chenxi Guo, Maozu Zou, Quan Liu, Xiaoyan Keinan, Alon |
author_facet | Guo, Yingjie Wu, Chenxi Guo, Maozu Zou, Quan Liu, Xiaoyan Keinan, Alon |
author_sort | Guo, Yingjie |
collection | PubMed |
description | Genome-Wide association studies (GWAS), based on testing one single nucleotide polymorphism (SNP) at a time, have revolutionized our understanding of the genetics of complex traits. In GWAS, there is a need to consider confounding effects such as due to population structure, and take groups of SNPs into account simultaneously due to the “polygenic” attribute of complex quantitative traits. In this paper, we propose a new approach SGL-LMM that puts together sparse group lasso (SGL) and linear mixed model (LMM) for multivariate associations of quantitative traits. LMM, as has been often used in GWAS, controls for confounders, while SGL maintains sparsity of the underlying multivariate regression model. SGL-LMM first sets a fixed zero effect to learn the parameters of random effects using LMM, and then estimates fixed effects using SGL regularization. We present efficient algorithms for hyperparameter tuning and feature selection using stability selection. While controlling for confounders and constraining for sparse solutions, SGL-LMM also provides a natural framework for incorporating prior biological information into the group structure underlying the model. Results based on both simulated and real data show SGL-LMM outperforms previous approaches in terms of power to detect associations and accuracy of quantitative trait prediction. |
format | Online Article Text |
id | pubmed-6469383 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-64693832019-04-25 Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits Guo, Yingjie Wu, Chenxi Guo, Maozu Zou, Quan Liu, Xiaoyan Keinan, Alon Front Genet Genetics Genome-Wide association studies (GWAS), based on testing one single nucleotide polymorphism (SNP) at a time, have revolutionized our understanding of the genetics of complex traits. In GWAS, there is a need to consider confounding effects such as due to population structure, and take groups of SNPs into account simultaneously due to the “polygenic” attribute of complex quantitative traits. In this paper, we propose a new approach SGL-LMM that puts together sparse group lasso (SGL) and linear mixed model (LMM) for multivariate associations of quantitative traits. LMM, as has been often used in GWAS, controls for confounders, while SGL maintains sparsity of the underlying multivariate regression model. SGL-LMM first sets a fixed zero effect to learn the parameters of random effects using LMM, and then estimates fixed effects using SGL regularization. We present efficient algorithms for hyperparameter tuning and feature selection using stability selection. While controlling for confounders and constraining for sparse solutions, SGL-LMM also provides a natural framework for incorporating prior biological information into the group structure underlying the model. Results based on both simulated and real data show SGL-LMM outperforms previous approaches in terms of power to detect associations and accuracy of quantitative trait prediction. Frontiers Media S.A. 2019-04-10 /pmc/articles/PMC6469383/ /pubmed/31024614 http://dx.doi.org/10.3389/fgene.2019.00271 Text en Copyright © 2019 Guo, Wu, Guo, Zou, Liu and Keinan. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Guo, Yingjie Wu, Chenxi Guo, Maozu Zou, Quan Liu, Xiaoyan Keinan, Alon Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits |
title | Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits |
title_full | Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits |
title_fullStr | Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits |
title_full_unstemmed | Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits |
title_short | Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits |
title_sort | combining sparse group lasso and linear mixed model improves power to detect genetic variants underlying quantitative traits |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6469383/ https://www.ncbi.nlm.nih.gov/pubmed/31024614 http://dx.doi.org/10.3389/fgene.2019.00271 |
work_keys_str_mv | AT guoyingjie combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits AT wuchenxi combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits AT guomaozu combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits AT zouquan combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits AT liuxiaoyan combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits AT keinanalon combiningsparsegrouplassoandlinearmixedmodelimprovespowertodetectgeneticvariantsunderlyingquantitativetraits |