Cargando…
Filtering genetic variants and placing informative priors based on putative biological function
High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be ba...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895695/ https://www.ncbi.nlm.nih.gov/pubmed/26866982 http://dx.doi.org/10.1186/s12863-015-0313-x |
_version_ | 1782435903120080896 |
---|---|
author | Friedrichs, Stefanie Malzahn, Dörthe Pugh, Elizabeth W. Almeida, Marcio Liu, Xiao Qing Bailey, Julia N. |
author_facet | Friedrichs, Stefanie Malzahn, Dörthe Pugh, Elizabeth W. Almeida, Marcio Liu, Xiao Qing Bailey, Julia N. |
author_sort | Friedrichs, Stefanie |
collection | PubMed |
description | High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure. |
format | Online Article Text |
id | pubmed-4895695 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48956952016-06-10 Filtering genetic variants and placing informative priors based on putative biological function Friedrichs, Stefanie Malzahn, Dörthe Pugh, Elizabeth W. Almeida, Marcio Liu, Xiao Qing Bailey, Julia N. BMC Genet Proceedings High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure. BioMed Central 2016-02-03 /pmc/articles/PMC4895695/ /pubmed/26866982 http://dx.doi.org/10.1186/s12863-015-0313-x Text en © Friedrichs et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Friedrichs, Stefanie Malzahn, Dörthe Pugh, Elizabeth W. Almeida, Marcio Liu, Xiao Qing Bailey, Julia N. Filtering genetic variants and placing informative priors based on putative biological function |
title | Filtering genetic variants and placing informative priors based on putative biological function |
title_full | Filtering genetic variants and placing informative priors based on putative biological function |
title_fullStr | Filtering genetic variants and placing informative priors based on putative biological function |
title_full_unstemmed | Filtering genetic variants and placing informative priors based on putative biological function |
title_short | Filtering genetic variants and placing informative priors based on putative biological function |
title_sort | filtering genetic variants and placing informative priors based on putative biological function |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895695/ https://www.ncbi.nlm.nih.gov/pubmed/26866982 http://dx.doi.org/10.1186/s12863-015-0313-x |
work_keys_str_mv | AT friedrichsstefanie filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction AT malzahndorthe filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction AT pughelizabethw filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction AT almeidamarcio filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction AT liuxiaoqing filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction AT baileyjulian filteringgeneticvariantsandplacinginformativepriorsbasedonputativebiologicalfunction |