Cargando…

Rare Variant Association Testing by Adaptive Combination of P-values

With the development of next-generation sequencing technology, there is a great demand for powerful statistical methods to detect rare variants (minor allele frequencies (MAFs)<1%) associated with diseases. Testing for each variant site individually is known to be underpowered, and therefore many...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Wan-Yu, Lou, Xiang-Yang, Gao, Guimin, Liu, Nianjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893264/
https://www.ncbi.nlm.nih.gov/pubmed/24454922
http://dx.doi.org/10.1371/journal.pone.0085728
_version_ 1782299654034030592
author Lin, Wan-Yu
Lou, Xiang-Yang
Gao, Guimin
Liu, Nianjun
author_facet Lin, Wan-Yu
Lou, Xiang-Yang
Gao, Guimin
Liu, Nianjun
author_sort Lin, Wan-Yu
collection PubMed
description With the development of next-generation sequencing technology, there is a great demand for powerful statistical methods to detect rare variants (minor allele frequencies (MAFs)<1%) associated with diseases. Testing for each variant site individually is known to be underpowered, and therefore many methods have been proposed to test for the association of a group of variants with phenotypes, by pooling signals of the variants in a chromosomal region. However, this pooling strategy inevitably leads to the inclusion of a large proportion of neutral variants, which may compromise the power of association tests. To address this issue, we extend the [Image: see text] -MidP method (Cheung et al., 2012, Genet Epidemiol 36: 675–685) and propose an approach (named ‘adaptive combination of P-values for rare variant association testing’, abbreviated as ‘ADA’) that adaptively combines per-site P-values with the weights based on MAFs. Before combining P-values, we first imposed a truncation threshold upon the per-site P-values, to guard against the noise caused by the inclusion of neutral variants. This ADA method is shown to outperform popular burden tests and non-burden tests under many scenarios. ADA is recommended for next-generation sequencing data analysis where many neutral variants may be included in a functional region.
format Online
Article
Text
id pubmed-3893264
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38932642014-01-21 Rare Variant Association Testing by Adaptive Combination of P-values Lin, Wan-Yu Lou, Xiang-Yang Gao, Guimin Liu, Nianjun PLoS One Research Article With the development of next-generation sequencing technology, there is a great demand for powerful statistical methods to detect rare variants (minor allele frequencies (MAFs)<1%) associated with diseases. Testing for each variant site individually is known to be underpowered, and therefore many methods have been proposed to test for the association of a group of variants with phenotypes, by pooling signals of the variants in a chromosomal region. However, this pooling strategy inevitably leads to the inclusion of a large proportion of neutral variants, which may compromise the power of association tests. To address this issue, we extend the [Image: see text] -MidP method (Cheung et al., 2012, Genet Epidemiol 36: 675–685) and propose an approach (named ‘adaptive combination of P-values for rare variant association testing’, abbreviated as ‘ADA’) that adaptively combines per-site P-values with the weights based on MAFs. Before combining P-values, we first imposed a truncation threshold upon the per-site P-values, to guard against the noise caused by the inclusion of neutral variants. This ADA method is shown to outperform popular burden tests and non-burden tests under many scenarios. ADA is recommended for next-generation sequencing data analysis where many neutral variants may be included in a functional region. Public Library of Science 2014-01-15 /pmc/articles/PMC3893264/ /pubmed/24454922 http://dx.doi.org/10.1371/journal.pone.0085728 Text en © 2014 Lin et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lin, Wan-Yu
Lou, Xiang-Yang
Gao, Guimin
Liu, Nianjun
Rare Variant Association Testing by Adaptive Combination of P-values
title Rare Variant Association Testing by Adaptive Combination of P-values
title_full Rare Variant Association Testing by Adaptive Combination of P-values
title_fullStr Rare Variant Association Testing by Adaptive Combination of P-values
title_full_unstemmed Rare Variant Association Testing by Adaptive Combination of P-values
title_short Rare Variant Association Testing by Adaptive Combination of P-values
title_sort rare variant association testing by adaptive combination of p-values
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893264/
https://www.ncbi.nlm.nih.gov/pubmed/24454922
http://dx.doi.org/10.1371/journal.pone.0085728
work_keys_str_mv AT linwanyu rarevariantassociationtestingbyadaptivecombinationofpvalues
AT louxiangyang rarevariantassociationtestingbyadaptivecombinationofpvalues
AT gaoguimin rarevariantassociationtestingbyadaptivecombinationofpvalues
AT liunianjun rarevariantassociationtestingbyadaptivecombinationofpvalues