Cargando…

Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation

When analyzing the data that arises from exome or whole-genome sequencing studies, window-based tests, (i.e., tests that jointly analyze all genetic data in a small genomic region), are very popular. However, power is known to be quite low for finding associations with phenotypes using these tests,...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, ChangJiang, Ciampi, Antonio, Greenwood, Celia M. T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3905218/
https://www.ncbi.nlm.nih.gov/pubmed/24523729
http://dx.doi.org/10.3389/fgene.2014.00011
_version_ 1782301311297912832
author Xu, ChangJiang
Ciampi, Antonio
Greenwood, Celia M. T.
author_facet Xu, ChangJiang
Ciampi, Antonio
Greenwood, Celia M. T.
author_sort Xu, ChangJiang
collection PubMed
description When analyzing the data that arises from exome or whole-genome sequencing studies, window-based tests, (i.e., tests that jointly analyze all genetic data in a small genomic region), are very popular. However, power is known to be quite low for finding associations with phenotypes using these tests, and therefore a variety of analytic strategies may be employed to potentially improve power. Using sequencing data of all of chromosome 3 from an interim release of data on 2432 individuals from the UK10K project, we simulated phenotypes associated with rare genetic variation, and used the results to explore the window-based test power. We asked two specific questions: firstly, whether there could be substantial benefits associated with incorporating information from external annotation on the genetic variants, and secondly whether the false discovery rate (FDRs) would be a useful metric for assessing significance. Although, as expected, there are benefits to using additional information (such as annotation) when it is associated with causality, we confirmed the general pattern of low sensitivity and power for window-based tests. For our chosen example, even when power is high to detect some of the associations, many of the regions containing causal variants are not detectable, despite using lax significance thresholds and optimal analytic methods. Furthermore, our estimated FDR values tended to be much smaller than the true FDRs. Long-range correlations between variants—due to linkage disequilibrium—likely explain some of this bias. A more sophisticated approach to using the annotation information may improve power, however, many causal variants of realistic effect sizes may simply be undetectable, at least with this sample size. Perhaps annotation information could assist in distinguishing windows containing causal variants from windows that are merely correlated with causal variants.
format Online
Article
Text
id pubmed-3905218
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-39052182014-02-12 Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation Xu, ChangJiang Ciampi, Antonio Greenwood, Celia M. T. Front Genet Genetics When analyzing the data that arises from exome or whole-genome sequencing studies, window-based tests, (i.e., tests that jointly analyze all genetic data in a small genomic region), are very popular. However, power is known to be quite low for finding associations with phenotypes using these tests, and therefore a variety of analytic strategies may be employed to potentially improve power. Using sequencing data of all of chromosome 3 from an interim release of data on 2432 individuals from the UK10K project, we simulated phenotypes associated with rare genetic variation, and used the results to explore the window-based test power. We asked two specific questions: firstly, whether there could be substantial benefits associated with incorporating information from external annotation on the genetic variants, and secondly whether the false discovery rate (FDRs) would be a useful metric for assessing significance. Although, as expected, there are benefits to using additional information (such as annotation) when it is associated with causality, we confirmed the general pattern of low sensitivity and power for window-based tests. For our chosen example, even when power is high to detect some of the associations, many of the regions containing causal variants are not detectable, despite using lax significance thresholds and optimal analytic methods. Furthermore, our estimated FDR values tended to be much smaller than the true FDRs. Long-range correlations between variants—due to linkage disequilibrium—likely explain some of this bias. A more sophisticated approach to using the annotation information may improve power, however, many causal variants of realistic effect sizes may simply be undetectable, at least with this sample size. Perhaps annotation information could assist in distinguishing windows containing causal variants from windows that are merely correlated with causal variants. Frontiers Media S.A. 2014-01-29 /pmc/articles/PMC3905218/ /pubmed/24523729 http://dx.doi.org/10.3389/fgene.2014.00011 Text en Copyright © 2014 Xu, Ciampi and Greenwood. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, ChangJiang
Ciampi, Antonio
Greenwood, Celia M. T.
Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation
title Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation
title_full Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation
title_fullStr Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation
title_full_unstemmed Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation
title_short Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation
title_sort exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3905218/
https://www.ncbi.nlm.nih.gov/pubmed/24523729
http://dx.doi.org/10.3389/fgene.2014.00011
work_keys_str_mv AT xuchangjiang exploringthepotentialbenefitsofstratifiedfalsediscoveryratesforregionbasedtestingofassociationwithraregeneticvariation
AT ciampiantonio exploringthepotentialbenefitsofstratifiedfalsediscoveryratesforregionbasedtestingofassociationwithraregeneticvariation
AT greenwoodceliamt exploringthepotentialbenefitsofstratifiedfalsediscoveryratesforregionbasedtestingofassociationwithraregeneticvariation
AT exploringthepotentialbenefitsofstratifiedfalsediscoveryratesforregionbasedtestingofassociationwithraregeneticvariation