Cargando…
Identification of Grouped Rare and Common Variants via Penalized Logistic Regression
In spite of the success of genome-wide association studies in finding many common variants associated with disease, these variants seem to explain only a small proportion of the estimated heritability. Data collection has turned toward exome and whole genome sequencing, but it is well known that sin...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Blackwell Publishing Ltd
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3842118/ https://www.ncbi.nlm.nih.gov/pubmed/23836590 http://dx.doi.org/10.1002/gepi.21746 |
_version_ | 1782292894631067648 |
---|---|
author | Ayers, Kristin L Cordell, Heather J |
author_facet | Ayers, Kristin L Cordell, Heather J |
author_sort | Ayers, Kristin L |
collection | PubMed |
description | In spite of the success of genome-wide association studies in finding many common variants associated with disease, these variants seem to explain only a small proportion of the estimated heritability. Data collection has turned toward exome and whole genome sequencing, but it is well known that single marker methods frequently used for common variants have low power to detect rare variants associated with disease, even with very large sample sizes. In response, a variety of methods have been developed that attempt to cluster rare variants so that they may gather strength from one another under the premise that there may be multiple causal variants within a gene. Most of these methods group variants by gene or proximity, and test one gene or marker window at a time. We propose a penalized regression method (PeRC) that analyzes all genes at once, allowing grouping of all (rare and common) variants within a gene, along with subgrouping of the rare variants, thus borrowing strength from both rare and common variants within the same gene. The method can incorporate either a burden-based weighting of the rare variants or one in which the weights are data driven. In simulations, our method performs favorably when compared to many previously proposed approaches, including its predecessor, the sparse group lasso [Friedman et al., 2010]. |
format | Online Article Text |
id | pubmed-3842118 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Blackwell Publishing Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-38421182013-12-02 Identification of Grouped Rare and Common Variants via Penalized Logistic Regression Ayers, Kristin L Cordell, Heather J Genet Epidemiol Research Articles In spite of the success of genome-wide association studies in finding many common variants associated with disease, these variants seem to explain only a small proportion of the estimated heritability. Data collection has turned toward exome and whole genome sequencing, but it is well known that single marker methods frequently used for common variants have low power to detect rare variants associated with disease, even with very large sample sizes. In response, a variety of methods have been developed that attempt to cluster rare variants so that they may gather strength from one another under the premise that there may be multiple causal variants within a gene. Most of these methods group variants by gene or proximity, and test one gene or marker window at a time. We propose a penalized regression method (PeRC) that analyzes all genes at once, allowing grouping of all (rare and common) variants within a gene, along with subgrouping of the rare variants, thus borrowing strength from both rare and common variants within the same gene. The method can incorporate either a burden-based weighting of the rare variants or one in which the weights are data driven. In simulations, our method performs favorably when compared to many previously proposed approaches, including its predecessor, the sparse group lasso [Friedman et al., 2010]. Blackwell Publishing Ltd 2013-09 2013-07-08 /pmc/articles/PMC3842118/ /pubmed/23836590 http://dx.doi.org/10.1002/gepi.21746 Text en © 2013 WILEY PERIODICALS, INC. http://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open access article under the terms of the Creative Commons License, which permits use and distribution in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Ayers, Kristin L Cordell, Heather J Identification of Grouped Rare and Common Variants via Penalized Logistic Regression |
title | Identification of Grouped Rare and Common Variants via Penalized Logistic Regression |
title_full | Identification of Grouped Rare and Common Variants via Penalized Logistic Regression |
title_fullStr | Identification of Grouped Rare and Common Variants via Penalized Logistic Regression |
title_full_unstemmed | Identification of Grouped Rare and Common Variants via Penalized Logistic Regression |
title_short | Identification of Grouped Rare and Common Variants via Penalized Logistic Regression |
title_sort | identification of grouped rare and common variants via penalized logistic regression |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3842118/ https://www.ncbi.nlm.nih.gov/pubmed/23836590 http://dx.doi.org/10.1002/gepi.21746 |
work_keys_str_mv | AT ayerskristinl identificationofgroupedrareandcommonvariantsviapenalizedlogisticregression AT cordellheatherj identificationofgroupedrareandcommonvariantsviapenalizedlogisticregression |