Cargando…
Identifying rare and common variants with Bayesian variable selection
BACKGROUND: Recent advances in next-generation sequencing technologies have made it possible to generate large amounts of sequence data with rare variants in a cost-effective way. Yet, the statistical aspect of testing disease association of rare variants is quite challenging as the typical assumpti...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133477/ https://www.ncbi.nlm.nih.gov/pubmed/27980665 http://dx.doi.org/10.1186/s12919-016-0059-0 |
_version_ | 1782471269958025216 |
---|---|
author | Oh, Cheongeun |
author_facet | Oh, Cheongeun |
author_sort | Oh, Cheongeun |
collection | PubMed |
description | BACKGROUND: Recent advances in next-generation sequencing technologies have made it possible to generate large amounts of sequence data with rare variants in a cost-effective way. Yet, the statistical aspect of testing disease association of rare variants is quite challenging as the typical assumptions fail to hold owing to low minor allele frequency (<0.5 or 1 %). METHODS: I present a Bayesian variable selection approach to detect associations with both rare and common genetic variants for quantitative traits simultaneously. In my model, I frame the problem of identifying disease-associated variants as a problem of variable selection in a sparse space, that is, how best to model the relationship between phenotypes and a set of genetic variants. By constructing a risk index score for a group of rare variants, my method can effectively consider all variants in a multivariate model. I also use a within-chain permutation to generate the empirical thresholds to detect true-positive variants. RESULTS: I apply our method to study the association between increases in baseline systolic and diastolic blood pressure (SBP and DBP, respectively) and genetic variants in the data from Genetic Analysis Workshop 19 unrelated samples. I identify several rare and common variants in the gene MAP4 that are potentially associated with SBP and DBP. CONCLUSIONS: The application shows that my method is powerful in identifying disease-associated variants even with the extreme rarity. |
format | Online Article Text |
id | pubmed-5133477 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-51334772016-12-15 Identifying rare and common variants with Bayesian variable selection Oh, Cheongeun BMC Proc Proceedings BACKGROUND: Recent advances in next-generation sequencing technologies have made it possible to generate large amounts of sequence data with rare variants in a cost-effective way. Yet, the statistical aspect of testing disease association of rare variants is quite challenging as the typical assumptions fail to hold owing to low minor allele frequency (<0.5 or 1 %). METHODS: I present a Bayesian variable selection approach to detect associations with both rare and common genetic variants for quantitative traits simultaneously. In my model, I frame the problem of identifying disease-associated variants as a problem of variable selection in a sparse space, that is, how best to model the relationship between phenotypes and a set of genetic variants. By constructing a risk index score for a group of rare variants, my method can effectively consider all variants in a multivariate model. I also use a within-chain permutation to generate the empirical thresholds to detect true-positive variants. RESULTS: I apply our method to study the association between increases in baseline systolic and diastolic blood pressure (SBP and DBP, respectively) and genetic variants in the data from Genetic Analysis Workshop 19 unrelated samples. I identify several rare and common variants in the gene MAP4 that are potentially associated with SBP and DBP. CONCLUSIONS: The application shows that my method is powerful in identifying disease-associated variants even with the extreme rarity. BioMed Central 2016-10-18 /pmc/articles/PMC5133477/ /pubmed/27980665 http://dx.doi.org/10.1186/s12919-016-0059-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Oh, Cheongeun Identifying rare and common variants with Bayesian variable selection |
title | Identifying rare and common variants with Bayesian variable selection |
title_full | Identifying rare and common variants with Bayesian variable selection |
title_fullStr | Identifying rare and common variants with Bayesian variable selection |
title_full_unstemmed | Identifying rare and common variants with Bayesian variable selection |
title_short | Identifying rare and common variants with Bayesian variable selection |
title_sort | identifying rare and common variants with bayesian variable selection |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133477/ https://www.ncbi.nlm.nih.gov/pubmed/27980665 http://dx.doi.org/10.1186/s12919-016-0059-0 |
work_keys_str_mv | AT ohcheongeun identifyingrareandcommonvariantswithbayesianvariableselection |