Cargando…
Rare Variant Association Testing by Adaptive Combination of P-values
With the development of next-generation sequencing technology, there is a great demand for powerful statistical methods to detect rare variants (minor allele frequencies (MAFs)<1%) associated with diseases. Testing for each variant site individually is known to be underpowered, and therefore many...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893264/ https://www.ncbi.nlm.nih.gov/pubmed/24454922 http://dx.doi.org/10.1371/journal.pone.0085728 |
_version_ | 1782299654034030592 |
---|---|
author | Lin, Wan-Yu Lou, Xiang-Yang Gao, Guimin Liu, Nianjun |
author_facet | Lin, Wan-Yu Lou, Xiang-Yang Gao, Guimin Liu, Nianjun |
author_sort | Lin, Wan-Yu |
collection | PubMed |
description | With the development of next-generation sequencing technology, there is a great demand for powerful statistical methods to detect rare variants (minor allele frequencies (MAFs)<1%) associated with diseases. Testing for each variant site individually is known to be underpowered, and therefore many methods have been proposed to test for the association of a group of variants with phenotypes, by pooling signals of the variants in a chromosomal region. However, this pooling strategy inevitably leads to the inclusion of a large proportion of neutral variants, which may compromise the power of association tests. To address this issue, we extend the [Image: see text] -MidP method (Cheung et al., 2012, Genet Epidemiol 36: 675–685) and propose an approach (named ‘adaptive combination of P-values for rare variant association testing’, abbreviated as ‘ADA’) that adaptively combines per-site P-values with the weights based on MAFs. Before combining P-values, we first imposed a truncation threshold upon the per-site P-values, to guard against the noise caused by the inclusion of neutral variants. This ADA method is shown to outperform popular burden tests and non-burden tests under many scenarios. ADA is recommended for next-generation sequencing data analysis where many neutral variants may be included in a functional region. |
format | Online Article Text |
id | pubmed-3893264 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38932642014-01-21 Rare Variant Association Testing by Adaptive Combination of P-values Lin, Wan-Yu Lou, Xiang-Yang Gao, Guimin Liu, Nianjun PLoS One Research Article With the development of next-generation sequencing technology, there is a great demand for powerful statistical methods to detect rare variants (minor allele frequencies (MAFs)<1%) associated with diseases. Testing for each variant site individually is known to be underpowered, and therefore many methods have been proposed to test for the association of a group of variants with phenotypes, by pooling signals of the variants in a chromosomal region. However, this pooling strategy inevitably leads to the inclusion of a large proportion of neutral variants, which may compromise the power of association tests. To address this issue, we extend the [Image: see text] -MidP method (Cheung et al., 2012, Genet Epidemiol 36: 675–685) and propose an approach (named ‘adaptive combination of P-values for rare variant association testing’, abbreviated as ‘ADA’) that adaptively combines per-site P-values with the weights based on MAFs. Before combining P-values, we first imposed a truncation threshold upon the per-site P-values, to guard against the noise caused by the inclusion of neutral variants. This ADA method is shown to outperform popular burden tests and non-burden tests under many scenarios. ADA is recommended for next-generation sequencing data analysis where many neutral variants may be included in a functional region. Public Library of Science 2014-01-15 /pmc/articles/PMC3893264/ /pubmed/24454922 http://dx.doi.org/10.1371/journal.pone.0085728 Text en © 2014 Lin et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Lin, Wan-Yu Lou, Xiang-Yang Gao, Guimin Liu, Nianjun Rare Variant Association Testing by Adaptive Combination of P-values |
title | Rare Variant Association Testing by Adaptive Combination of P-values |
title_full | Rare Variant Association Testing by Adaptive Combination of P-values |
title_fullStr | Rare Variant Association Testing by Adaptive Combination of P-values |
title_full_unstemmed | Rare Variant Association Testing by Adaptive Combination of P-values |
title_short | Rare Variant Association Testing by Adaptive Combination of P-values |
title_sort | rare variant association testing by adaptive combination of p-values |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893264/ https://www.ncbi.nlm.nih.gov/pubmed/24454922 http://dx.doi.org/10.1371/journal.pone.0085728 |
work_keys_str_mv | AT linwanyu rarevariantassociationtestingbyadaptivecombinationofpvalues AT louxiangyang rarevariantassociationtestingbyadaptivecombinationofpvalues AT gaoguimin rarevariantassociationtestingbyadaptivecombinationofpvalues AT liunianjun rarevariantassociationtestingbyadaptivecombinationofpvalues |