Cargando…

Filtering genetic variants and placing informative priors based on putative biological function

High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be ba...

Descripción completa

Detalles Bibliográficos
Autores principales: Friedrichs, Stefanie, Malzahn, Dörthe, Pugh, Elizabeth W., Almeida, Marcio, Liu, Xiao Qing, Bailey, Julia N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895695/
https://www.ncbi.nlm.nih.gov/pubmed/26866982
http://dx.doi.org/10.1186/s12863-015-0313-x
_version_ 1782435903120080896
author Friedrichs, Stefanie
Malzahn, Dörthe
Pugh, Elizabeth W.
Almeida, Marcio
Liu, Xiao Qing
Bailey, Julia N.
author_facet Friedrichs, Stefanie
Malzahn, Dörthe
Pugh, Elizabeth W.
Almeida, Marcio
Liu, Xiao Qing
Bailey, Julia N.
author_sort Friedrichs, Stefanie
collection PubMed
description High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure.
format Online
Article
Text
id pubmed-4895695
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48956952016-06-10 Filtering genetic variants and placing informative priors based on putative biological function Friedrichs, Stefanie Malzahn, Dörthe Pugh, Elizabeth W. Almeida, Marcio Liu, Xiao Qing Bailey, Julia N. BMC Genet Proceedings High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure. BioMed Central 2016-02-03 /pmc/articles/PMC4895695/ /pubmed/26866982 http://dx.doi.org/10.1186/s12863-015-0313-x Text en © Friedrichs et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Friedrichs, Stefanie
Malzahn, Dörthe
Pugh, Elizabeth W.
Almeida, Marcio
Liu, Xiao Qing
Bailey, Julia N.
Filtering genetic variants and placing informative priors based on putative biological function
title Filtering genetic variants and placing informative priors based on putative biological function
title_full Filtering genetic variants and placing informative priors based on putative biological function
title_fullStr Filtering genetic variants and placing informative priors based on putative biological function
title_full_unstemmed Filtering genetic variants and placing informative priors based on putative biological function
title_short Filtering genetic variants and placing informative priors based on putative biological function
title_sort filtering genetic variants and placing informative priors based on putative biological function
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895695/
https://www.ncbi.nlm.nih.gov/pubmed/26866982
http://dx.doi.org/10.1186/s12863-015-0313-x
work_keys_str_mv AT friedrichsstefanie filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction
AT malzahndorthe filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction
AT pughelizabethw filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction
AT almeidamarcio filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction
AT liuxiaoqing filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction
AT baileyjulian filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction