Cargando…

Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data

Recently there has been great interest in identifying rare variants associated with common diseases. We apply several collapsing-based and kernel-based single-gene association tests to Genetic Analysis Workshop 17 (GAW17) rare variant association data with unrelated individuals without knowledge of...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Lun, Zheng, Wei, Lee, Joon Sang, Zhang, Xianghua, Ferguson, John, Yan, Xiting, Zhao, Hongyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287841/
https://www.ncbi.nlm.nih.gov/pubmed/22373309
http://dx.doi.org/10.1186/1753-6561-5-S9-S117
_version_ 1782224755842088960
author Li, Lun
Zheng, Wei
Lee, Joon Sang
Zhang, Xianghua
Ferguson, John
Yan, Xiting
Zhao, Hongyu
author_facet Li, Lun
Zheng, Wei
Lee, Joon Sang
Zhang, Xianghua
Ferguson, John
Yan, Xiting
Zhao, Hongyu
author_sort Li, Lun
collection PubMed
description Recently there has been great interest in identifying rare variants associated with common diseases. We apply several collapsing-based and kernel-based single-gene association tests to Genetic Analysis Workshop 17 (GAW17) rare variant association data with unrelated individuals without knowledge of the simulation model. We also implement modified versions of these methods using additional information, such as minor allele frequency (MAF) and functional annotation. For each of four given traits provided in GAW17, we use the Bayesian mixed-effects model to estimate the phenotypic variance explained by the given environmental and genotypic data and to infer an individual-specific genetic effect to use directly in single-gene association tests. After obtaining information on the GAW17 simulation model, we compare the performance of all methods and examine the top genes identified by those methods. We find that collapsing-based methods with weights based on MAFs are sensitive to the “lower MAF, larger effect size” assumption, whereas kernel-based methods are more robust when this assumption is violated. In addition, many false-positive genes identified by multiple methods often contain variants with exactly the same genotype distribution as the causal variants used in the simulation model. When the sample size is much smaller than the number of rare variants, it is more likely that causal and noncausal variants will share the same or similar genotype distribution. This likely contributes to the low power and large number of false-positive results of all methods in detecting causal variants associated with disease in the GAW17 data set.
format Online
Article
Text
id pubmed-3287841
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32878412012-02-28 Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data Li, Lun Zheng, Wei Lee, Joon Sang Zhang, Xianghua Ferguson, John Yan, Xiting Zhao, Hongyu BMC Proc Proceedings Recently there has been great interest in identifying rare variants associated with common diseases. We apply several collapsing-based and kernel-based single-gene association tests to Genetic Analysis Workshop 17 (GAW17) rare variant association data with unrelated individuals without knowledge of the simulation model. We also implement modified versions of these methods using additional information, such as minor allele frequency (MAF) and functional annotation. For each of four given traits provided in GAW17, we use the Bayesian mixed-effects model to estimate the phenotypic variance explained by the given environmental and genotypic data and to infer an individual-specific genetic effect to use directly in single-gene association tests. After obtaining information on the GAW17 simulation model, we compare the performance of all methods and examine the top genes identified by those methods. We find that collapsing-based methods with weights based on MAFs are sensitive to the “lower MAF, larger effect size” assumption, whereas kernel-based methods are more robust when this assumption is violated. In addition, many false-positive genes identified by multiple methods often contain variants with exactly the same genotype distribution as the causal variants used in the simulation model. When the sample size is much smaller than the number of rare variants, it is more likely that causal and noncausal variants will share the same or similar genotype distribution. This likely contributes to the low power and large number of false-positive results of all methods in detecting causal variants associated with disease in the GAW17 data set. BioMed Central 2011-11-29 /pmc/articles/PMC3287841/ /pubmed/22373309 http://dx.doi.org/10.1186/1753-6561-5-S9-S117 Text en Copyright ©2011 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Li, Lun
Zheng, Wei
Lee, Joon Sang
Zhang, Xianghua
Ferguson, John
Yan, Xiting
Zhao, Hongyu
Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data
title Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data
title_full Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data
title_fullStr Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data
title_full_unstemmed Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data
title_short Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data
title_sort collapsing-based and kernel-based single-gene analyses applied to genetic analysis workshop 17 mini-exome data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287841/
https://www.ncbi.nlm.nih.gov/pubmed/22373309
http://dx.doi.org/10.1186/1753-6561-5-S9-S117
work_keys_str_mv AT lilun collapsingbasedandkernelbasedsinglegeneanalysesappliedtogeneticanalysisworkshop17miniexomedata
AT zhengwei collapsingbasedandkernelbasedsinglegeneanalysesappliedtogeneticanalysisworkshop17miniexomedata
AT leejoonsang collapsingbasedandkernelbasedsinglegeneanalysesappliedtogeneticanalysisworkshop17miniexomedata
AT zhangxianghua collapsingbasedandkernelbasedsinglegeneanalysesappliedtogeneticanalysisworkshop17miniexomedata
AT fergusonjohn collapsingbasedandkernelbasedsinglegeneanalysesappliedtogeneticanalysisworkshop17miniexomedata
AT yanxiting collapsingbasedandkernelbasedsinglegeneanalysesappliedtogeneticanalysisworkshop17miniexomedata
AT zhaohongyu collapsingbasedandkernelbasedsinglegeneanalysesappliedtogeneticanalysisworkshop17miniexomedata