Cargando…

Performance of statistical methods on CHARGE targeted sequencing data

BACKGROUND: The CHARGE (Cohorts for Heart and Aging Research in Genomic Epidemiology) Sequencing Project is a national, collaborative effort from 3 studies: Framingham Heart Study (FHS), Cardiovascular Health Study (CHS), and Atherosclerosis Risk in Communities (ARIC). It uses a case-cohort design,...

Descripción completa

Detalles Bibliográficos
Autores principales: Xing, Chuanhua, Dupuis, Josée, Cupples, L Adrienne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4197341/
https://www.ncbi.nlm.nih.gov/pubmed/25277365
http://dx.doi.org/10.1186/s12863-014-0104-9
_version_ 1782339609332547584
author Xing, Chuanhua
Dupuis, Josée
Cupples, L Adrienne
author_facet Xing, Chuanhua
Dupuis, Josée
Cupples, L Adrienne
author_sort Xing, Chuanhua
collection PubMed
description BACKGROUND: The CHARGE (Cohorts for Heart and Aging Research in Genomic Epidemiology) Sequencing Project is a national, collaborative effort from 3 studies: Framingham Heart Study (FHS), Cardiovascular Health Study (CHS), and Atherosclerosis Risk in Communities (ARIC). It uses a case-cohort design, whereby a random sample of study participants is enriched with participants in extremes of traits. Although statistical methods are available to investigate the role of rare variants, few have evaluated their performance in a case-cohort design. RESULTS: We evaluate several methods, including the sequence kernel association test (SKAT), Score-Seq, and weighted (Madsen and Browning) and unweighted burden tests. Using genotypes from the CHARGE targeted-sequencing project for FHS (n = 1096), we simulate phenotypes in a large population for 11 correlated traits and then sample individuals to mimic the CHARGE Sequencing study design. We evaluate type I error and power for 77 targeted regions. CONCLUSIONS: We provide some guidelines on the performance of these aggregate-based tests to detect associations with rare variants when applied to case-cohort study designs, using CHARGE targeted sequencing data. Type I error is conservative when we consider variants with minor allele frequency (MAF) < 1%. Power is generally low, although it is relatively larger for Score-Seq. Greater numbers of causal variants and a greater proportion of variance improve the power, but it tends to be lower in the presence of bi-directionality of effects of causal genotypes, especially for Score-Seq. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-014-0104-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4197341
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41973412014-10-23 Performance of statistical methods on CHARGE targeted sequencing data Xing, Chuanhua Dupuis, Josée Cupples, L Adrienne BMC Genet Research Article BACKGROUND: The CHARGE (Cohorts for Heart and Aging Research in Genomic Epidemiology) Sequencing Project is a national, collaborative effort from 3 studies: Framingham Heart Study (FHS), Cardiovascular Health Study (CHS), and Atherosclerosis Risk in Communities (ARIC). It uses a case-cohort design, whereby a random sample of study participants is enriched with participants in extremes of traits. Although statistical methods are available to investigate the role of rare variants, few have evaluated their performance in a case-cohort design. RESULTS: We evaluate several methods, including the sequence kernel association test (SKAT), Score-Seq, and weighted (Madsen and Browning) and unweighted burden tests. Using genotypes from the CHARGE targeted-sequencing project for FHS (n = 1096), we simulate phenotypes in a large population for 11 correlated traits and then sample individuals to mimic the CHARGE Sequencing study design. We evaluate type I error and power for 77 targeted regions. CONCLUSIONS: We provide some guidelines on the performance of these aggregate-based tests to detect associations with rare variants when applied to case-cohort study designs, using CHARGE targeted sequencing data. Type I error is conservative when we consider variants with minor allele frequency (MAF) < 1%. Power is generally low, although it is relatively larger for Score-Seq. Greater numbers of causal variants and a greater proportion of variance improve the power, but it tends to be lower in the presence of bi-directionality of effects of causal genotypes, especially for Score-Seq. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-014-0104-9) contains supplementary material, which is available to authorized users. BioMed Central 2014-10-03 /pmc/articles/PMC4197341/ /pubmed/25277365 http://dx.doi.org/10.1186/s12863-014-0104-9 Text en © Xing et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Xing, Chuanhua
Dupuis, Josée
Cupples, L Adrienne
Performance of statistical methods on CHARGE targeted sequencing data
title Performance of statistical methods on CHARGE targeted sequencing data
title_full Performance of statistical methods on CHARGE targeted sequencing data
title_fullStr Performance of statistical methods on CHARGE targeted sequencing data
title_full_unstemmed Performance of statistical methods on CHARGE targeted sequencing data
title_short Performance of statistical methods on CHARGE targeted sequencing data
title_sort performance of statistical methods on charge targeted sequencing data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4197341/
https://www.ncbi.nlm.nih.gov/pubmed/25277365
http://dx.doi.org/10.1186/s12863-014-0104-9
work_keys_str_mv AT xingchuanhua performanceofstatisticalmethodsonchargetargetedsequencingdata
AT dupuisjosee performanceofstatisticalmethodsonchargetargetedsequencingdata
AT cupplesladrienne performanceofstatisticalmethodsonchargetargetedsequencingdata