Cargando…
Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data
Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as var...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287934/ https://www.ncbi.nlm.nih.gov/pubmed/22373158 http://dx.doi.org/10.1186/1753-6561-5-S9-S92 |
_version_ | 1782224777055830016 |
---|---|
author | Ayers, Kristin L Mamasoula, Chrysovalanto Cordell, Heather J |
author_facet | Ayers, Kristin L Mamasoula, Chrysovalanto Cordell, Heather J |
author_sort | Ayers, Kristin L |
collection | PubMed |
description | Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as variables in a penalized regression. We investigate a penalty with three separate components: (1) a group least absolute shrinkage and selection operator (LASSO) that selects multimarker genotypes in a gene to be included in or excluded from the model, (2) an allele-sharing penalty that encourages multimarker genotypes with similar alleles to have similar coefficients, and (3) a penalty that shrinks the size of coefficients while performing model selection. The penalized likelihood is minimized with a cyclic coordinate descent algorithm, allowing quick coefficient estimation for a large number of markers. We compare our method to single-marker analysis and a gene-based sparse group LASSO on the Genetic Analysis Workshop 17 data for quantitative trait Q2. We found that all of the methods were underpowered to detect the simulated rare causal variants at the low false-positive rates desired in association studies. However, the sparse group LASSO on multi-marker genotypes seems to provide some advantage over the sparse group LASSO applied to single SNPs within genes, giving further evidence that there may be an advantage to modeling combinations of rare variant alleles over modeling them individually. |
format | Online Article Text |
id | pubmed-3287934 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32879342012-02-28 Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data Ayers, Kristin L Mamasoula, Chrysovalanto Cordell, Heather J BMC Proc Proceedings Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as variables in a penalized regression. We investigate a penalty with three separate components: (1) a group least absolute shrinkage and selection operator (LASSO) that selects multimarker genotypes in a gene to be included in or excluded from the model, (2) an allele-sharing penalty that encourages multimarker genotypes with similar alleles to have similar coefficients, and (3) a penalty that shrinks the size of coefficients while performing model selection. The penalized likelihood is minimized with a cyclic coordinate descent algorithm, allowing quick coefficient estimation for a large number of markers. We compare our method to single-marker analysis and a gene-based sparse group LASSO on the Genetic Analysis Workshop 17 data for quantitative trait Q2. We found that all of the methods were underpowered to detect the simulated rare causal variants at the low false-positive rates desired in association studies. However, the sparse group LASSO on multi-marker genotypes seems to provide some advantage over the sparse group LASSO applied to single SNPs within genes, giving further evidence that there may be an advantage to modeling combinations of rare variant alleles over modeling them individually. BioMed Central 2011-11-29 /pmc/articles/PMC3287934/ /pubmed/22373158 http://dx.doi.org/10.1186/1753-6561-5-S9-S92 Text en Copyright ©2011 Ayers et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Ayers, Kristin L Mamasoula, Chrysovalanto Cordell, Heather J Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data |
title | Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data |
title_full | Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data |
title_fullStr | Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data |
title_full_unstemmed | Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data |
title_short | Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data |
title_sort | penalized-regression-based multimarker genotype analysis of genetic analysis workshop 17 data |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287934/ https://www.ncbi.nlm.nih.gov/pubmed/22373158 http://dx.doi.org/10.1186/1753-6561-5-S9-S92 |
work_keys_str_mv | AT ayerskristinl penalizedregressionbasedmultimarkergenotypeanalysisofgeneticanalysisworkshop17data AT mamasoulachrysovalanto penalizedregressionbasedmultimarkergenotypeanalysisofgeneticanalysisworkshop17data AT cordellheatherj penalizedregressionbasedmultimarkergenotypeanalysisofgeneticanalysisworkshop17data |