Cargando…

Gene genealogies for genetic association mapping, with application to Crohn's disease

A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true g...

Descripción completa

Detalles Bibliográficos
Autores principales: Burkett, Kelly M., Greenwood, Celia M. T., McNeney, Brad, Graham, Jinko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3845011/
https://www.ncbi.nlm.nih.gov/pubmed/24348515
http://dx.doi.org/10.3389/fgene.2013.00260
_version_ 1782293274424246272
author Burkett, Kelly M.
Greenwood, Celia M. T.
McNeney, Brad
Graham, Jinko
author_facet Burkett, Kelly M.
Greenwood, Celia M. T.
McNeney, Brad
Graham, Jinko
author_sort Burkett, Kelly M.
collection PubMed
description A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true gene genealogy is unknown, Markov chain Monte Carlo (MCMC) approaches have been used to sample genealogies conditional on data at multiple genetic markers. We previously implemented an MCMC algorithm to sample from an approximation to the distribution of the gene genealogy conditional on haplotype data. Our approach samples ancestral trees, recombination and mutation rates at a genomic focal point. In this work, we describe how our sampler can be used to find disease-predisposing genetic variants in samples of cases and controls. We use a tree-based association statistic that quantifies the degree to which case haplotypes are more closely related to each other around the focal point than control haplotypes, without relying on a disease model. As the ancestral tree is a latent variable, so is the tree-based association statistic. We show how the sampler can be used to estimate the posterior distribution of the latent test statistic and corresponding latent p-values, which together comprise a fuzzy p-value. We illustrate the approach on a publicly-available dataset from a study of Crohn's disease that consists of genotypes at multiple SNP markers in a small genomic region. We estimate the posterior distribution of the tree-based association statistic and the recombination rate at multiple focal points in the region. Reassuringly, the posterior mean recombination rates estimated at the different focal points are consistent with previously published estimates. The tree-based association approach finds multiple sub-regions where the case haplotypes are more genetically related than the control haplotypes, and that there may be one or multiple disease-predisposing loci.
format Online
Article
Text
id pubmed-3845011
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-38450112013-12-13 Gene genealogies for genetic association mapping, with application to Crohn's disease Burkett, Kelly M. Greenwood, Celia M. T. McNeney, Brad Graham, Jinko Front Genet Genetics A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true gene genealogy is unknown, Markov chain Monte Carlo (MCMC) approaches have been used to sample genealogies conditional on data at multiple genetic markers. We previously implemented an MCMC algorithm to sample from an approximation to the distribution of the gene genealogy conditional on haplotype data. Our approach samples ancestral trees, recombination and mutation rates at a genomic focal point. In this work, we describe how our sampler can be used to find disease-predisposing genetic variants in samples of cases and controls. We use a tree-based association statistic that quantifies the degree to which case haplotypes are more closely related to each other around the focal point than control haplotypes, without relying on a disease model. As the ancestral tree is a latent variable, so is the tree-based association statistic. We show how the sampler can be used to estimate the posterior distribution of the latent test statistic and corresponding latent p-values, which together comprise a fuzzy p-value. We illustrate the approach on a publicly-available dataset from a study of Crohn's disease that consists of genotypes at multiple SNP markers in a small genomic region. We estimate the posterior distribution of the tree-based association statistic and the recombination rate at multiple focal points in the region. Reassuringly, the posterior mean recombination rates estimated at the different focal points are consistent with previously published estimates. The tree-based association approach finds multiple sub-regions where the case haplotypes are more genetically related than the control haplotypes, and that there may be one or multiple disease-predisposing loci. Frontiers Media S.A. 2013-12-02 /pmc/articles/PMC3845011/ /pubmed/24348515 http://dx.doi.org/10.3389/fgene.2013.00260 Text en Copyright © 2013 Burkett, Greenwood, McNeney and Graham. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Burkett, Kelly M.
Greenwood, Celia M. T.
McNeney, Brad
Graham, Jinko
Gene genealogies for genetic association mapping, with application to Crohn's disease
title Gene genealogies for genetic association mapping, with application to Crohn's disease
title_full Gene genealogies for genetic association mapping, with application to Crohn's disease
title_fullStr Gene genealogies for genetic association mapping, with application to Crohn's disease
title_full_unstemmed Gene genealogies for genetic association mapping, with application to Crohn's disease
title_short Gene genealogies for genetic association mapping, with application to Crohn's disease
title_sort gene genealogies for genetic association mapping, with application to crohn's disease
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3845011/
https://www.ncbi.nlm.nih.gov/pubmed/24348515
http://dx.doi.org/10.3389/fgene.2013.00260
work_keys_str_mv AT burkettkellym genegenealogiesforgeneticassociationmappingwithapplicationtocrohnsdisease
AT greenwoodceliamt genegenealogiesforgeneticassociationmappingwithapplicationtocrohnsdisease
AT mcneneybrad genegenealogiesforgeneticassociationmappingwithapplicationtocrohnsdisease
AT grahamjinko genegenealogiesforgeneticassociationmappingwithapplicationtocrohnsdisease