Cargando…

Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'

BACKGROUND: The frequency of a haplotype comprising one allele at each of two loci can be expressed as a cubic equation (the 'Hill equation'), the solution of which gives that frequency. Most haplotype and linkage disequilibrium analysis programs use iteration-based algorithms which substi...

Descripción completa

Detalles Bibliográficos
Autores principales: Gaunt, Tom R, Rodríguez, Santiago, Day, Ian NM
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2180187/
https://www.ncbi.nlm.nih.gov/pubmed/17980034
http://dx.doi.org/10.1186/1471-2105-8-428
_version_ 1782145494175186944
author Gaunt, Tom R
Rodríguez, Santiago
Day, Ian NM
author_facet Gaunt, Tom R
Rodríguez, Santiago
Day, Ian NM
author_sort Gaunt, Tom R
collection PubMed
description BACKGROUND: The frequency of a haplotype comprising one allele at each of two loci can be expressed as a cubic equation (the 'Hill equation'), the solution of which gives that frequency. Most haplotype and linkage disequilibrium analysis programs use iteration-based algorithms which substitute an estimate of haplotype frequency into the equation, producing a new estimate which is repeatedly fed back into the equation until the values converge to a maximum likelihood estimate (expectation-maximisation). RESULTS: We present a program, "CubeX", which calculates the biologically possible exact solution(s) and provides estimated haplotype frequencies, D', r(2 )and χ(2 )values for each. CubeX provides a "complete" analysis of haplotype frequencies and linkage disequilibrium for a pair of biallelic markers under situations where sampling variation and genotyping errors distort sample Hardy-Weinberg equilibrium, potentially causing more than one biologically possible solution. We also present an analysis of simulations and real data using the algebraically exact solution, which indicates that under perfect sample Hardy-Weinberg equilibrium there is only one biologically possible solution, but that under other conditions there may be more. CONCLUSION: Our analyses demonstrate that lower allele frequencies, lower sample numbers, population stratification and a possible |D'| value of 1 are particularly susceptible to distortion of sample Hardy-Weinberg equilibrium, which has significant implications for calculation of linkage disequilibrium in small sample sizes (eg HapMap) and rarer alleles (eg paucimorphisms, q < 0.05) that may have particular disease relevance and require improved approaches for meaningful evaluation.
format Text
id pubmed-2180187
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21801872008-01-09 Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX' Gaunt, Tom R Rodríguez, Santiago Day, Ian NM BMC Bioinformatics Software BACKGROUND: The frequency of a haplotype comprising one allele at each of two loci can be expressed as a cubic equation (the 'Hill equation'), the solution of which gives that frequency. Most haplotype and linkage disequilibrium analysis programs use iteration-based algorithms which substitute an estimate of haplotype frequency into the equation, producing a new estimate which is repeatedly fed back into the equation until the values converge to a maximum likelihood estimate (expectation-maximisation). RESULTS: We present a program, "CubeX", which calculates the biologically possible exact solution(s) and provides estimated haplotype frequencies, D', r(2 )and χ(2 )values for each. CubeX provides a "complete" analysis of haplotype frequencies and linkage disequilibrium for a pair of biallelic markers under situations where sampling variation and genotyping errors distort sample Hardy-Weinberg equilibrium, potentially causing more than one biologically possible solution. We also present an analysis of simulations and real data using the algebraically exact solution, which indicates that under perfect sample Hardy-Weinberg equilibrium there is only one biologically possible solution, but that under other conditions there may be more. CONCLUSION: Our analyses demonstrate that lower allele frequencies, lower sample numbers, population stratification and a possible |D'| value of 1 are particularly susceptible to distortion of sample Hardy-Weinberg equilibrium, which has significant implications for calculation of linkage disequilibrium in small sample sizes (eg HapMap) and rarer alleles (eg paucimorphisms, q < 0.05) that may have particular disease relevance and require improved approaches for meaningful evaluation. BioMed Central 2007-11-02 /pmc/articles/PMC2180187/ /pubmed/17980034 http://dx.doi.org/10.1186/1471-2105-8-428 Text en Copyright © 2007 Gaunt et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Gaunt, Tom R
Rodríguez, Santiago
Day, Ian NM
Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'
title Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'
title_full Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'
title_fullStr Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'
title_full_unstemmed Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'
title_short Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'
title_sort cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'cubex'
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2180187/
https://www.ncbi.nlm.nih.gov/pubmed/17980034
http://dx.doi.org/10.1186/1471-2105-8-428
work_keys_str_mv AT gaunttomr cubicexactsolutionsfortheestimationofpairwisehaplotypefrequenciesimplicationsforlinkagedisequilibriumanalysesandawebtoolcubex
AT rodriguezsantiago cubicexactsolutionsfortheestimationofpairwisehaplotypefrequenciesimplicationsforlinkagedisequilibriumanalysesandawebtoolcubex
AT dayiannm cubicexactsolutionsfortheestimationofpairwisehaplotypefrequenciesimplicationsforlinkagedisequilibriumanalysesandawebtoolcubex