Cargando…

Identification of Grouped Rare and Common Variants via Penalized Logistic Regression

In spite of the success of genome-wide association studies in finding many common variants associated with disease, these variants seem to explain only a small proportion of the estimated heritability. Data collection has turned toward exome and whole genome sequencing, but it is well known that sin...

Descripción completa

Detalles Bibliográficos
Autores principales: Ayers, Kristin L, Cordell, Heather J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Blackwell Publishing Ltd 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3842118/
https://www.ncbi.nlm.nih.gov/pubmed/23836590
http://dx.doi.org/10.1002/gepi.21746
_version_ 1782292894631067648
author Ayers, Kristin L
Cordell, Heather J
author_facet Ayers, Kristin L
Cordell, Heather J
author_sort Ayers, Kristin L
collection PubMed
description In spite of the success of genome-wide association studies in finding many common variants associated with disease, these variants seem to explain only a small proportion of the estimated heritability. Data collection has turned toward exome and whole genome sequencing, but it is well known that single marker methods frequently used for common variants have low power to detect rare variants associated with disease, even with very large sample sizes. In response, a variety of methods have been developed that attempt to cluster rare variants so that they may gather strength from one another under the premise that there may be multiple causal variants within a gene. Most of these methods group variants by gene or proximity, and test one gene or marker window at a time. We propose a penalized regression method (PeRC) that analyzes all genes at once, allowing grouping of all (rare and common) variants within a gene, along with subgrouping of the rare variants, thus borrowing strength from both rare and common variants within the same gene. The method can incorporate either a burden-based weighting of the rare variants or one in which the weights are data driven. In simulations, our method performs favorably when compared to many previously proposed approaches, including its predecessor, the sparse group lasso [Friedman et al., 2010].
format Online
Article
Text
id pubmed-3842118
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Blackwell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-38421182013-12-02 Identification of Grouped Rare and Common Variants via Penalized Logistic Regression Ayers, Kristin L Cordell, Heather J Genet Epidemiol Research Articles In spite of the success of genome-wide association studies in finding many common variants associated with disease, these variants seem to explain only a small proportion of the estimated heritability. Data collection has turned toward exome and whole genome sequencing, but it is well known that single marker methods frequently used for common variants have low power to detect rare variants associated with disease, even with very large sample sizes. In response, a variety of methods have been developed that attempt to cluster rare variants so that they may gather strength from one another under the premise that there may be multiple causal variants within a gene. Most of these methods group variants by gene or proximity, and test one gene or marker window at a time. We propose a penalized regression method (PeRC) that analyzes all genes at once, allowing grouping of all (rare and common) variants within a gene, along with subgrouping of the rare variants, thus borrowing strength from both rare and common variants within the same gene. The method can incorporate either a burden-based weighting of the rare variants or one in which the weights are data driven. In simulations, our method performs favorably when compared to many previously proposed approaches, including its predecessor, the sparse group lasso [Friedman et al., 2010]. Blackwell Publishing Ltd 2013-09 2013-07-08 /pmc/articles/PMC3842118/ /pubmed/23836590 http://dx.doi.org/10.1002/gepi.21746 Text en © 2013 WILEY PERIODICALS, INC. http://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open access article under the terms of the Creative Commons License, which permits use and distribution in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Ayers, Kristin L
Cordell, Heather J
Identification of Grouped Rare and Common Variants via Penalized Logistic Regression
title Identification of Grouped Rare and Common Variants via Penalized Logistic Regression
title_full Identification of Grouped Rare and Common Variants via Penalized Logistic Regression
title_fullStr Identification of Grouped Rare and Common Variants via Penalized Logistic Regression
title_full_unstemmed Identification of Grouped Rare and Common Variants via Penalized Logistic Regression
title_short Identification of Grouped Rare and Common Variants via Penalized Logistic Regression
title_sort identification of grouped rare and common variants via penalized logistic regression
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3842118/
https://www.ncbi.nlm.nih.gov/pubmed/23836590
http://dx.doi.org/10.1002/gepi.21746
work_keys_str_mv AT ayerskristinl identificationofgroupedrareandcommonvariantsviapenalizedlogisticregression
AT cordellheatherj identificationofgroupedrareandcommonvariantsviapenalizedlogisticregression