Cargando…
Gene genealogies for genetic association mapping, with application to Crohn's disease
A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true g...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3845011/ https://www.ncbi.nlm.nih.gov/pubmed/24348515 http://dx.doi.org/10.3389/fgene.2013.00260 |
_version_ | 1782293274424246272 |
---|---|
author | Burkett, Kelly M. Greenwood, Celia M. T. McNeney, Brad Graham, Jinko |
author_facet | Burkett, Kelly M. Greenwood, Celia M. T. McNeney, Brad Graham, Jinko |
author_sort | Burkett, Kelly M. |
collection | PubMed |
description | A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true gene genealogy is unknown, Markov chain Monte Carlo (MCMC) approaches have been used to sample genealogies conditional on data at multiple genetic markers. We previously implemented an MCMC algorithm to sample from an approximation to the distribution of the gene genealogy conditional on haplotype data. Our approach samples ancestral trees, recombination and mutation rates at a genomic focal point. In this work, we describe how our sampler can be used to find disease-predisposing genetic variants in samples of cases and controls. We use a tree-based association statistic that quantifies the degree to which case haplotypes are more closely related to each other around the focal point than control haplotypes, without relying on a disease model. As the ancestral tree is a latent variable, so is the tree-based association statistic. We show how the sampler can be used to estimate the posterior distribution of the latent test statistic and corresponding latent p-values, which together comprise a fuzzy p-value. We illustrate the approach on a publicly-available dataset from a study of Crohn's disease that consists of genotypes at multiple SNP markers in a small genomic region. We estimate the posterior distribution of the tree-based association statistic and the recombination rate at multiple focal points in the region. Reassuringly, the posterior mean recombination rates estimated at the different focal points are consistent with previously published estimates. The tree-based association approach finds multiple sub-regions where the case haplotypes are more genetically related than the control haplotypes, and that there may be one or multiple disease-predisposing loci. |
format | Online Article Text |
id | pubmed-3845011 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-38450112013-12-13 Gene genealogies for genetic association mapping, with application to Crohn's disease Burkett, Kelly M. Greenwood, Celia M. T. McNeney, Brad Graham, Jinko Front Genet Genetics A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true gene genealogy is unknown, Markov chain Monte Carlo (MCMC) approaches have been used to sample genealogies conditional on data at multiple genetic markers. We previously implemented an MCMC algorithm to sample from an approximation to the distribution of the gene genealogy conditional on haplotype data. Our approach samples ancestral trees, recombination and mutation rates at a genomic focal point. In this work, we describe how our sampler can be used to find disease-predisposing genetic variants in samples of cases and controls. We use a tree-based association statistic that quantifies the degree to which case haplotypes are more closely related to each other around the focal point than control haplotypes, without relying on a disease model. As the ancestral tree is a latent variable, so is the tree-based association statistic. We show how the sampler can be used to estimate the posterior distribution of the latent test statistic and corresponding latent p-values, which together comprise a fuzzy p-value. We illustrate the approach on a publicly-available dataset from a study of Crohn's disease that consists of genotypes at multiple SNP markers in a small genomic region. We estimate the posterior distribution of the tree-based association statistic and the recombination rate at multiple focal points in the region. Reassuringly, the posterior mean recombination rates estimated at the different focal points are consistent with previously published estimates. The tree-based association approach finds multiple sub-regions where the case haplotypes are more genetically related than the control haplotypes, and that there may be one or multiple disease-predisposing loci. Frontiers Media S.A. 2013-12-02 /pmc/articles/PMC3845011/ /pubmed/24348515 http://dx.doi.org/10.3389/fgene.2013.00260 Text en Copyright © 2013 Burkett, Greenwood, McNeney and Graham. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Burkett, Kelly M. Greenwood, Celia M. T. McNeney, Brad Graham, Jinko Gene genealogies for genetic association mapping, with application to Crohn's disease |
title | Gene genealogies for genetic association mapping, with application to Crohn's disease |
title_full | Gene genealogies for genetic association mapping, with application to Crohn's disease |
title_fullStr | Gene genealogies for genetic association mapping, with application to Crohn's disease |
title_full_unstemmed | Gene genealogies for genetic association mapping, with application to Crohn's disease |
title_short | Gene genealogies for genetic association mapping, with application to Crohn's disease |
title_sort | gene genealogies for genetic association mapping, with application to crohn's disease |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3845011/ https://www.ncbi.nlm.nih.gov/pubmed/24348515 http://dx.doi.org/10.3389/fgene.2013.00260 |
work_keys_str_mv | AT burkettkellym genegenealogiesforgeneticassociationmappingwithapplicationtocrohnsdisease AT greenwoodceliamt genegenealogiesforgeneticassociationmappingwithapplicationtocrohnsdisease AT mcneneybrad genegenealogiesforgeneticassociationmappingwithapplicationtocrohnsdisease AT grahamjinko genegenealogiesforgeneticassociationmappingwithapplicationtocrohnsdisease |