Cargando…

Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data

Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as var...

Descripción completa

Detalles Bibliográficos
Autores principales: Ayers, Kristin L, Mamasoula, Chrysovalanto, Cordell, Heather J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287934/
https://www.ncbi.nlm.nih.gov/pubmed/22373158
http://dx.doi.org/10.1186/1753-6561-5-S9-S92
_version_ 1782224777055830016
author Ayers, Kristin L
Mamasoula, Chrysovalanto
Cordell, Heather J
author_facet Ayers, Kristin L
Mamasoula, Chrysovalanto
Cordell, Heather J
author_sort Ayers, Kristin L
collection PubMed
description Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as variables in a penalized regression. We investigate a penalty with three separate components: (1) a group least absolute shrinkage and selection operator (LASSO) that selects multimarker genotypes in a gene to be included in or excluded from the model, (2) an allele-sharing penalty that encourages multimarker genotypes with similar alleles to have similar coefficients, and (3) a penalty that shrinks the size of coefficients while performing model selection. The penalized likelihood is minimized with a cyclic coordinate descent algorithm, allowing quick coefficient estimation for a large number of markers. We compare our method to single-marker analysis and a gene-based sparse group LASSO on the Genetic Analysis Workshop 17 data for quantitative trait Q2. We found that all of the methods were underpowered to detect the simulated rare causal variants at the low false-positive rates desired in association studies. However, the sparse group LASSO on multi-marker genotypes seems to provide some advantage over the sparse group LASSO applied to single SNPs within genes, giving further evidence that there may be an advantage to modeling combinations of rare variant alleles over modeling them individually.
format Online
Article
Text
id pubmed-3287934
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32879342012-02-28 Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data Ayers, Kristin L Mamasoula, Chrysovalanto Cordell, Heather J BMC Proc Proceedings Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as variables in a penalized regression. We investigate a penalty with three separate components: (1) a group least absolute shrinkage and selection operator (LASSO) that selects multimarker genotypes in a gene to be included in or excluded from the model, (2) an allele-sharing penalty that encourages multimarker genotypes with similar alleles to have similar coefficients, and (3) a penalty that shrinks the size of coefficients while performing model selection. The penalized likelihood is minimized with a cyclic coordinate descent algorithm, allowing quick coefficient estimation for a large number of markers. We compare our method to single-marker analysis and a gene-based sparse group LASSO on the Genetic Analysis Workshop 17 data for quantitative trait Q2. We found that all of the methods were underpowered to detect the simulated rare causal variants at the low false-positive rates desired in association studies. However, the sparse group LASSO on multi-marker genotypes seems to provide some advantage over the sparse group LASSO applied to single SNPs within genes, giving further evidence that there may be an advantage to modeling combinations of rare variant alleles over modeling them individually. BioMed Central 2011-11-29 /pmc/articles/PMC3287934/ /pubmed/22373158 http://dx.doi.org/10.1186/1753-6561-5-S9-S92 Text en Copyright ©2011 Ayers et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Ayers, Kristin L
Mamasoula, Chrysovalanto
Cordell, Heather J
Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data
title Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data
title_full Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data
title_fullStr Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data
title_full_unstemmed Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data
title_short Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data
title_sort penalized-regression-based multimarker genotype analysis of genetic analysis workshop 17 data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287934/
https://www.ncbi.nlm.nih.gov/pubmed/22373158
http://dx.doi.org/10.1186/1753-6561-5-S9-S92
work_keys_str_mv AT ayerskristinl penalizedregressionbasedmultimarkergenotypeanalysisofgeneticanalysisworkshop17data
AT mamasoulachrysovalanto penalizedregressionbasedmultimarkergenotypeanalysisofgeneticanalysisworkshop17data
AT cordellheatherj penalizedregressionbasedmultimarkergenotypeanalysisofgeneticanalysisworkshop17data