Cargando…

Global haplotype partitioning for maximal associated SNP pairs

BACKGROUND: Global partitioning based on pairwise associations of SNPs has not previously been used to define haplotype blocks within genomes. Here, we define an association index based on LD between SNP pairs. We use the Fisher's exact test to assess the statistical significance of the LD esti...

Descripción completa

Detalles Bibliográficos
Autores principales: Katanforoush, Ali, Sadeghi, Mehdi, Pezeshk, Hamid, Elahi, Elahe
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2749056/
https://www.ncbi.nlm.nih.gov/pubmed/19712447
http://dx.doi.org/10.1186/1471-2105-10-269
_version_ 1782172161651245056
author Katanforoush, Ali
Sadeghi, Mehdi
Pezeshk, Hamid
Elahi, Elahe
author_facet Katanforoush, Ali
Sadeghi, Mehdi
Pezeshk, Hamid
Elahi, Elahe
author_sort Katanforoush, Ali
collection PubMed
description BACKGROUND: Global partitioning based on pairwise associations of SNPs has not previously been used to define haplotype blocks within genomes. Here, we define an association index based on LD between SNP pairs. We use the Fisher's exact test to assess the statistical significance of the LD estimator. By this test, each SNP pair is characterized as associated, independent, or not-statistically-significant. We set limits on the maximum acceptable proportion of independent pairs within all blocks and search for the partitioning with maximal proportion of associated SNP pairs. Essentially, this model is reduced to a constrained optimization problem, the solution of which is obtained by iterating a dynamic programming algorithm. RESULTS: In comparison with other methods, our algorithm reports blocks of larger average size. Nevertheless, the haplotype diversity within the blocks is captured by a small number of tagSNPs. Resampling HapMap haplotypes under a block-based model of recombination showed that our algorithm is robust in reproducing the same partitioning for recombinant samples. Our algorithm performed better than previously reported models in a case-control association study aimed at mapping a single locus trait, based on simulation results that were evaluated by a block-based statistical test. Compared to methods of haplotype block partitioning, we performed best on detection of recombination hotspots. CONCLUSION: Our proposed method divides chromosomes into the regions within which allelic associations of SNP pairs are maximized. This approach presents a native design for dimension reduction in genome-wide association studies. Our results show that the pairwise allelic association of SNPs can describe various features of genomic variation, in particular recombination hotspots.
format Text
id pubmed-2749056
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27490562009-09-23 Global haplotype partitioning for maximal associated SNP pairs Katanforoush, Ali Sadeghi, Mehdi Pezeshk, Hamid Elahi, Elahe BMC Bioinformatics Methodology Article BACKGROUND: Global partitioning based on pairwise associations of SNPs has not previously been used to define haplotype blocks within genomes. Here, we define an association index based on LD between SNP pairs. We use the Fisher's exact test to assess the statistical significance of the LD estimator. By this test, each SNP pair is characterized as associated, independent, or not-statistically-significant. We set limits on the maximum acceptable proportion of independent pairs within all blocks and search for the partitioning with maximal proportion of associated SNP pairs. Essentially, this model is reduced to a constrained optimization problem, the solution of which is obtained by iterating a dynamic programming algorithm. RESULTS: In comparison with other methods, our algorithm reports blocks of larger average size. Nevertheless, the haplotype diversity within the blocks is captured by a small number of tagSNPs. Resampling HapMap haplotypes under a block-based model of recombination showed that our algorithm is robust in reproducing the same partitioning for recombinant samples. Our algorithm performed better than previously reported models in a case-control association study aimed at mapping a single locus trait, based on simulation results that were evaluated by a block-based statistical test. Compared to methods of haplotype block partitioning, we performed best on detection of recombination hotspots. CONCLUSION: Our proposed method divides chromosomes into the regions within which allelic associations of SNP pairs are maximized. This approach presents a native design for dimension reduction in genome-wide association studies. Our results show that the pairwise allelic association of SNPs can describe various features of genomic variation, in particular recombination hotspots. BioMed Central 2009-08-27 /pmc/articles/PMC2749056/ /pubmed/19712447 http://dx.doi.org/10.1186/1471-2105-10-269 Text en Copyright © 2009 Katanforoush et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Katanforoush, Ali
Sadeghi, Mehdi
Pezeshk, Hamid
Elahi, Elahe
Global haplotype partitioning for maximal associated SNP pairs
title Global haplotype partitioning for maximal associated SNP pairs
title_full Global haplotype partitioning for maximal associated SNP pairs
title_fullStr Global haplotype partitioning for maximal associated SNP pairs
title_full_unstemmed Global haplotype partitioning for maximal associated SNP pairs
title_short Global haplotype partitioning for maximal associated SNP pairs
title_sort global haplotype partitioning for maximal associated snp pairs
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2749056/
https://www.ncbi.nlm.nih.gov/pubmed/19712447
http://dx.doi.org/10.1186/1471-2105-10-269
work_keys_str_mv AT katanforoushali globalhaplotypepartitioningformaximalassociatedsnppairs
AT sadeghimehdi globalhaplotypepartitioningformaximalassociatedsnppairs
AT pezeshkhamid globalhaplotypepartitioningformaximalassociatedsnppairs
AT elahielahe globalhaplotypepartitioningformaximalassociatedsnppairs