Cargando…

Resampling-based tests for Lasso in genome-wide association studies

BACKGROUND: Genome-wide association studies involve detecting association between millions of genetic variants and a trait, which typically use univariate regression to test association between each single variant and the phenotype. Alternatively, Lasso penalized regression allows one to jointly mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Arbet, Jaron, McGue, Matt, Chatterjee, Snigdhansu, Basu, Saonli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5525347/
https://www.ncbi.nlm.nih.gov/pubmed/28738830
http://dx.doi.org/10.1186/s12863-017-0533-3
_version_ 1783252627775029248
author Arbet, Jaron
McGue, Matt
Chatterjee, Snigdhansu
Basu, Saonli
author_facet Arbet, Jaron
McGue, Matt
Chatterjee, Snigdhansu
Basu, Saonli
author_sort Arbet, Jaron
collection PubMed
description BACKGROUND: Genome-wide association studies involve detecting association between millions of genetic variants and a trait, which typically use univariate regression to test association between each single variant and the phenotype. Alternatively, Lasso penalized regression allows one to jointly model the relationship between all genetic variants and the phenotype. However, it is unclear how to best conduct inference on the individual Lasso coefficients, especially in high-dimensional settings. METHODS: We consider six methods for testing the Lasso coefficients: two permutation (Lasso-Ayers, Lasso-PL) and one analytic approach (Lasso-AL) to select the penalty parameter for type-1-error control, residual bootstrap (Lasso-RB), modified residual bootstrap (Lasso-MRB), and a permutation test (Lasso-PT). Methods are compared via simulations and application to the Minnesota Center for Twins and Family Study. RESULTS: We show that for finite sample sizes with increasing number of null predictors, Lasso-RB, Lasso-MRB, and Lasso-PT fail to be viable methods of inference. However, Lasso-PL and Lasso-AL remain fast and powerful tools for conducting inference with the Lasso, even in high-dimensions. CONCLUSION: Our results suggest that the proposed permutation selection procedure (Lasso-PL) and the analytic selection method (Lasso-AL) are fast and powerful alternatives to the standard univariate analysis in genome-wide association studies.
format Online
Article
Text
id pubmed-5525347
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55253472017-07-26 Resampling-based tests for Lasso in genome-wide association studies Arbet, Jaron McGue, Matt Chatterjee, Snigdhansu Basu, Saonli BMC Genet Methodology Article BACKGROUND: Genome-wide association studies involve detecting association between millions of genetic variants and a trait, which typically use univariate regression to test association between each single variant and the phenotype. Alternatively, Lasso penalized regression allows one to jointly model the relationship between all genetic variants and the phenotype. However, it is unclear how to best conduct inference on the individual Lasso coefficients, especially in high-dimensional settings. METHODS: We consider six methods for testing the Lasso coefficients: two permutation (Lasso-Ayers, Lasso-PL) and one analytic approach (Lasso-AL) to select the penalty parameter for type-1-error control, residual bootstrap (Lasso-RB), modified residual bootstrap (Lasso-MRB), and a permutation test (Lasso-PT). Methods are compared via simulations and application to the Minnesota Center for Twins and Family Study. RESULTS: We show that for finite sample sizes with increasing number of null predictors, Lasso-RB, Lasso-MRB, and Lasso-PT fail to be viable methods of inference. However, Lasso-PL and Lasso-AL remain fast and powerful tools for conducting inference with the Lasso, even in high-dimensions. CONCLUSION: Our results suggest that the proposed permutation selection procedure (Lasso-PL) and the analytic selection method (Lasso-AL) are fast and powerful alternatives to the standard univariate analysis in genome-wide association studies. BioMed Central 2017-07-24 /pmc/articles/PMC5525347/ /pubmed/28738830 http://dx.doi.org/10.1186/s12863-017-0533-3 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Arbet, Jaron
McGue, Matt
Chatterjee, Snigdhansu
Basu, Saonli
Resampling-based tests for Lasso in genome-wide association studies
title Resampling-based tests for Lasso in genome-wide association studies
title_full Resampling-based tests for Lasso in genome-wide association studies
title_fullStr Resampling-based tests for Lasso in genome-wide association studies
title_full_unstemmed Resampling-based tests for Lasso in genome-wide association studies
title_short Resampling-based tests for Lasso in genome-wide association studies
title_sort resampling-based tests for lasso in genome-wide association studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5525347/
https://www.ncbi.nlm.nih.gov/pubmed/28738830
http://dx.doi.org/10.1186/s12863-017-0533-3
work_keys_str_mv AT arbetjaron resamplingbasedtestsforlassoingenomewideassociationstudies
AT mcguematt resamplingbasedtestsforlassoingenomewideassociationstudies
AT chatterjeesnigdhansu resamplingbasedtestsforlassoingenomewideassociationstudies
AT basusaonli resamplingbasedtestsforlassoingenomewideassociationstudies