Cargando…

Computation of haplotypes on SNPs subsets: advantage of the "global method"

BACKGROUND: Genetic association studies aim at finding correlations between a disease state and genetic variations such as SNPs or combinations of SNPs, termed haplotypes. Some haplotypes have a particular biological meaning such as the ones derived from SNPs located in the promoters, or the ones de...

Descripción completa

Detalles Bibliográficos
Autores principales: Coulonges, Cédric, Delaneau, Olivier, Girard, Manon, Do, Hervé, Adkins, Ronald, Spadoni, Jean-Louis, Zagury, Jean-François
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1636337/
https://www.ncbi.nlm.nih.gov/pubmed/17067372
http://dx.doi.org/10.1186/1471-2156-7-50
_version_ 1782130733623541760
author Coulonges, Cédric
Delaneau, Olivier
Girard, Manon
Do, Hervé
Adkins, Ronald
Spadoni, Jean-Louis
Zagury, Jean-François
author_facet Coulonges, Cédric
Delaneau, Olivier
Girard, Manon
Do, Hervé
Adkins, Ronald
Spadoni, Jean-Louis
Zagury, Jean-François
author_sort Coulonges, Cédric
collection PubMed
description BACKGROUND: Genetic association studies aim at finding correlations between a disease state and genetic variations such as SNPs or combinations of SNPs, termed haplotypes. Some haplotypes have a particular biological meaning such as the ones derived from SNPs located in the promoters, or the ones derived from non synonymous SNPs. All these haplotypes are "subhaplotypes" because they refer only to a part of the SNPs found in the gene. Until now, subhaplotypes were directly computed from the very SNPs chosen to constitute them, without taking into account the rest of the information corresponding to the other SNPs located in the gene. In the present work, we describe an alternative approach, called the "global method", which takes into account all the SNPs known in the region and compare the efficacy of the two "direct" and "global" methods. RESULTS: We used empirical haplotypes data sets from the GH1 promoter and the APOE gene, and 10 simulated datasets, and randomly introduced in them missing information (from 0% up to 20%) to compare the 2 methods. For each method, we used the PHASE haplotyping software since it was described to be the best. We showed that the use of the "global method" for subhaplotyping leads always to a better error rate than the classical direct haplotyping. The advantage provided by this alternative method increases with the percentage of missing genotyping data (diminution of the average error rate from 25% to less than 10%). We applied the global method software on the GRIV cohort for AIDS genetic associations and some associations previously identified through direct subhaplotyping were found to be erroneous. CONCLUSION: The global method for subhaplotyping can reduce, sometimes dramatically, the error rate on patient resolutions and haplotypes frequencies. One should thus use this method in order to minimise the risk of a false interpretation in genetic studies involving subhaplotypes. In practice the global method is always more efficient than the direct method, but a combination method taking into account the level of missing information in each subject appears to be even more interesting when the level of missing information becomes larger (>10%).
format Text
id pubmed-1636337
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16363372006-11-15 Computation of haplotypes on SNPs subsets: advantage of the "global method" Coulonges, Cédric Delaneau, Olivier Girard, Manon Do, Hervé Adkins, Ronald Spadoni, Jean-Louis Zagury, Jean-François BMC Genet Research Article BACKGROUND: Genetic association studies aim at finding correlations between a disease state and genetic variations such as SNPs or combinations of SNPs, termed haplotypes. Some haplotypes have a particular biological meaning such as the ones derived from SNPs located in the promoters, or the ones derived from non synonymous SNPs. All these haplotypes are "subhaplotypes" because they refer only to a part of the SNPs found in the gene. Until now, subhaplotypes were directly computed from the very SNPs chosen to constitute them, without taking into account the rest of the information corresponding to the other SNPs located in the gene. In the present work, we describe an alternative approach, called the "global method", which takes into account all the SNPs known in the region and compare the efficacy of the two "direct" and "global" methods. RESULTS: We used empirical haplotypes data sets from the GH1 promoter and the APOE gene, and 10 simulated datasets, and randomly introduced in them missing information (from 0% up to 20%) to compare the 2 methods. For each method, we used the PHASE haplotyping software since it was described to be the best. We showed that the use of the "global method" for subhaplotyping leads always to a better error rate than the classical direct haplotyping. The advantage provided by this alternative method increases with the percentage of missing genotyping data (diminution of the average error rate from 25% to less than 10%). We applied the global method software on the GRIV cohort for AIDS genetic associations and some associations previously identified through direct subhaplotyping were found to be erroneous. CONCLUSION: The global method for subhaplotyping can reduce, sometimes dramatically, the error rate on patient resolutions and haplotypes frequencies. One should thus use this method in order to minimise the risk of a false interpretation in genetic studies involving subhaplotypes. In practice the global method is always more efficient than the direct method, but a combination method taking into account the level of missing information in each subject appears to be even more interesting when the level of missing information becomes larger (>10%). BioMed Central 2006-10-26 /pmc/articles/PMC1636337/ /pubmed/17067372 http://dx.doi.org/10.1186/1471-2156-7-50 Text en Copyright © 2006 Coulonges et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Coulonges, Cédric
Delaneau, Olivier
Girard, Manon
Do, Hervé
Adkins, Ronald
Spadoni, Jean-Louis
Zagury, Jean-François
Computation of haplotypes on SNPs subsets: advantage of the "global method"
title Computation of haplotypes on SNPs subsets: advantage of the "global method"
title_full Computation of haplotypes on SNPs subsets: advantage of the "global method"
title_fullStr Computation of haplotypes on SNPs subsets: advantage of the "global method"
title_full_unstemmed Computation of haplotypes on SNPs subsets: advantage of the "global method"
title_short Computation of haplotypes on SNPs subsets: advantage of the "global method"
title_sort computation of haplotypes on snps subsets: advantage of the "global method"
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1636337/
https://www.ncbi.nlm.nih.gov/pubmed/17067372
http://dx.doi.org/10.1186/1471-2156-7-50
work_keys_str_mv AT coulongescedric computationofhaplotypesonsnpssubsetsadvantageoftheglobalmethod
AT delaneauolivier computationofhaplotypesonsnpssubsetsadvantageoftheglobalmethod
AT girardmanon computationofhaplotypesonsnpssubsetsadvantageoftheglobalmethod
AT doherve computationofhaplotypesonsnpssubsetsadvantageoftheglobalmethod
AT adkinsronald computationofhaplotypesonsnpssubsetsadvantageoftheglobalmethod
AT spadonijeanlouis computationofhaplotypesonsnpssubsetsadvantageoftheglobalmethod
AT zaguryjeanfrancois computationofhaplotypesonsnpssubsetsadvantageoftheglobalmethod