Cargando…

Weighted pedigree-based statistics for testing the association of rare variants

BACKGROUND: With the advent of next-generation sequencing (NGS) technologies, researchers are now generating a deluge of data on high dimensional genomic variations, whose analysis is likely to reveal rare variants involved in the complex etiology of disease. Standing in the way of such discoveries,...

Descripción completa

Detalles Bibliográficos
Autores principales: Shugart, Yin Yao, Zhu, Yun, Guo, Wei, Xiong, Momiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3827928/
https://www.ncbi.nlm.nih.gov/pubmed/23176082
http://dx.doi.org/10.1186/1471-2164-13-667
_version_ 1782291166039900160
author Shugart, Yin Yao
Zhu, Yun
Guo, Wei
Xiong, Momiao
author_facet Shugart, Yin Yao
Zhu, Yun
Guo, Wei
Xiong, Momiao
author_sort Shugart, Yin Yao
collection PubMed
description BACKGROUND: With the advent of next-generation sequencing (NGS) technologies, researchers are now generating a deluge of data on high dimensional genomic variations, whose analysis is likely to reveal rare variants involved in the complex etiology of disease. Standing in the way of such discoveries, however, is the fact that statistics for rare variants are currently designed for use with population-based data. In this paper, we introduce a pedigree-based statistic specifically designed to test for rare variants in family-based data. The additional power of pedigree-based statistics stems from the fact that while rare variants related to diseases or traits of interest occur only infrequently in populations, in families with multiple affected individuals, such variants are enriched. Note that while the proposed statistic can be applied with and without statistical weighting, our simulations show that its power increases when weighting (WSS and VT) are applied. RESULTS: Our working hypothesis was that, since rare variants are concentrated in families with multiple affected individuals, pedigree-based statistics should detect rare variants more powerfully than population-based statistics. To evaluate how well our new pedigree-based statistics perform in association studies, we develop a general framework for sequence-based association studies capable of handling data from pedigrees of various types and also from unrelated individuals. In short, we developed a procedure for transforming population-based statistics into tests for family-based associations. Furthermore, we modify two existing tests, the weighted sum-square test and the variable-threshold test, and apply both to our family-based collapsing methods. We demonstrate that the new family-based tests are more powerful than corresponding population-based test and they generate a reasonable type I error rate. To demonstrate feasibility, we apply the newly developed tests to a pedigree-based GWAS data set from the Framingham Heart Study (FHS). FHS-GWAS data contain approximately 5000 uncommon variants with frequencies less than 0.05. Potential association findings in these data demonstrate the feasibility of the software PB-STAR (note, PB-STAR is now freely available to the public). CONCLUSION: Our tests show that when analyzing for rare variants, a pedigree-based design is more powerful than a population-based case–control design. We further demonstrate that a pedigree-based statistic’s power to detect rare variants increases in direct relation to the proportion of affected individuals within the pedigree.
format Online
Article
Text
id pubmed-3827928
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38279282013-11-20 Weighted pedigree-based statistics for testing the association of rare variants Shugart, Yin Yao Zhu, Yun Guo, Wei Xiong, Momiao BMC Genomics Methodology Article BACKGROUND: With the advent of next-generation sequencing (NGS) technologies, researchers are now generating a deluge of data on high dimensional genomic variations, whose analysis is likely to reveal rare variants involved in the complex etiology of disease. Standing in the way of such discoveries, however, is the fact that statistics for rare variants are currently designed for use with population-based data. In this paper, we introduce a pedigree-based statistic specifically designed to test for rare variants in family-based data. The additional power of pedigree-based statistics stems from the fact that while rare variants related to diseases or traits of interest occur only infrequently in populations, in families with multiple affected individuals, such variants are enriched. Note that while the proposed statistic can be applied with and without statistical weighting, our simulations show that its power increases when weighting (WSS and VT) are applied. RESULTS: Our working hypothesis was that, since rare variants are concentrated in families with multiple affected individuals, pedigree-based statistics should detect rare variants more powerfully than population-based statistics. To evaluate how well our new pedigree-based statistics perform in association studies, we develop a general framework for sequence-based association studies capable of handling data from pedigrees of various types and also from unrelated individuals. In short, we developed a procedure for transforming population-based statistics into tests for family-based associations. Furthermore, we modify two existing tests, the weighted sum-square test and the variable-threshold test, and apply both to our family-based collapsing methods. We demonstrate that the new family-based tests are more powerful than corresponding population-based test and they generate a reasonable type I error rate. To demonstrate feasibility, we apply the newly developed tests to a pedigree-based GWAS data set from the Framingham Heart Study (FHS). FHS-GWAS data contain approximately 5000 uncommon variants with frequencies less than 0.05. Potential association findings in these data demonstrate the feasibility of the software PB-STAR (note, PB-STAR is now freely available to the public). CONCLUSION: Our tests show that when analyzing for rare variants, a pedigree-based design is more powerful than a population-based case–control design. We further demonstrate that a pedigree-based statistic’s power to detect rare variants increases in direct relation to the proportion of affected individuals within the pedigree. BioMed Central 2012-11-24 /pmc/articles/PMC3827928/ /pubmed/23176082 http://dx.doi.org/10.1186/1471-2164-13-667 Text en Copyright © 2012 Shugart et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Shugart, Yin Yao
Zhu, Yun
Guo, Wei
Xiong, Momiao
Weighted pedigree-based statistics for testing the association of rare variants
title Weighted pedigree-based statistics for testing the association of rare variants
title_full Weighted pedigree-based statistics for testing the association of rare variants
title_fullStr Weighted pedigree-based statistics for testing the association of rare variants
title_full_unstemmed Weighted pedigree-based statistics for testing the association of rare variants
title_short Weighted pedigree-based statistics for testing the association of rare variants
title_sort weighted pedigree-based statistics for testing the association of rare variants
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3827928/
https://www.ncbi.nlm.nih.gov/pubmed/23176082
http://dx.doi.org/10.1186/1471-2164-13-667
work_keys_str_mv AT shugartyinyao weightedpedigreebasedstatisticsfortestingtheassociationofrarevariants
AT zhuyun weightedpedigreebasedstatisticsfortestingtheassociationofrarevariants
AT guowei weightedpedigreebasedstatisticsfortestingtheassociationofrarevariants
AT xiongmomiao weightedpedigreebasedstatisticsfortestingtheassociationofrarevariants