Cargando…

Identifying rare and common variants with Bayesian variable selection

BACKGROUND: Recent advances in next-generation sequencing technologies have made it possible to generate large amounts of sequence data with rare variants in a cost-effective way. Yet, the statistical aspect of testing disease association of rare variants is quite challenging as the typical assumpti...

Descripción completa

Detalles Bibliográficos
Autor principal: Oh, Cheongeun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133477/
https://www.ncbi.nlm.nih.gov/pubmed/27980665
http://dx.doi.org/10.1186/s12919-016-0059-0
_version_ 1782471269958025216
author Oh, Cheongeun
author_facet Oh, Cheongeun
author_sort Oh, Cheongeun
collection PubMed
description BACKGROUND: Recent advances in next-generation sequencing technologies have made it possible to generate large amounts of sequence data with rare variants in a cost-effective way. Yet, the statistical aspect of testing disease association of rare variants is quite challenging as the typical assumptions fail to hold owing to low minor allele frequency (<0.5 or 1 %). METHODS: I present a Bayesian variable selection approach to detect associations with both rare and common genetic variants for quantitative traits simultaneously. In my model, I frame the problem of identifying disease-associated variants as a problem of variable selection in a sparse space, that is, how best to model the relationship between phenotypes and a set of genetic variants. By constructing a risk index score for a group of rare variants, my method can effectively consider all variants in a multivariate model. I also use a within-chain permutation to generate the empirical thresholds to detect true-positive variants. RESULTS: I apply our method to study the association between increases in baseline systolic and diastolic blood pressure (SBP and DBP, respectively) and genetic variants in the data from Genetic Analysis Workshop 19 unrelated samples. I identify several rare and common variants in the gene MAP4 that are potentially associated with SBP and DBP. CONCLUSIONS: The application shows that my method is powerful in identifying disease-associated variants even with the extreme rarity.
format Online
Article
Text
id pubmed-5133477
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51334772016-12-15 Identifying rare and common variants with Bayesian variable selection Oh, Cheongeun BMC Proc Proceedings BACKGROUND: Recent advances in next-generation sequencing technologies have made it possible to generate large amounts of sequence data with rare variants in a cost-effective way. Yet, the statistical aspect of testing disease association of rare variants is quite challenging as the typical assumptions fail to hold owing to low minor allele frequency (<0.5 or 1 %). METHODS: I present a Bayesian variable selection approach to detect associations with both rare and common genetic variants for quantitative traits simultaneously. In my model, I frame the problem of identifying disease-associated variants as a problem of variable selection in a sparse space, that is, how best to model the relationship between phenotypes and a set of genetic variants. By constructing a risk index score for a group of rare variants, my method can effectively consider all variants in a multivariate model. I also use a within-chain permutation to generate the empirical thresholds to detect true-positive variants. RESULTS: I apply our method to study the association between increases in baseline systolic and diastolic blood pressure (SBP and DBP, respectively) and genetic variants in the data from Genetic Analysis Workshop 19 unrelated samples. I identify several rare and common variants in the gene MAP4 that are potentially associated with SBP and DBP. CONCLUSIONS: The application shows that my method is powerful in identifying disease-associated variants even with the extreme rarity. BioMed Central 2016-10-18 /pmc/articles/PMC5133477/ /pubmed/27980665 http://dx.doi.org/10.1186/s12919-016-0059-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Oh, Cheongeun
Identifying rare and common variants with Bayesian variable selection
title Identifying rare and common variants with Bayesian variable selection
title_full Identifying rare and common variants with Bayesian variable selection
title_fullStr Identifying rare and common variants with Bayesian variable selection
title_full_unstemmed Identifying rare and common variants with Bayesian variable selection
title_short Identifying rare and common variants with Bayesian variable selection
title_sort identifying rare and common variants with bayesian variable selection
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133477/
https://www.ncbi.nlm.nih.gov/pubmed/27980665
http://dx.doi.org/10.1186/s12919-016-0059-0
work_keys_str_mv AT ohcheongeun identifyingrareandcommonvariantswithbayesianvariableselection