Cargando…

Congruence as a measurement of extended haplotype structure across the genome

BACKGROUND: Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes....

Descripción completa

Detalles Bibliográficos
Autores principales: Baschal, Erin E, Jasinski, Jean M, Boyle, Theresa A, Fain, Pamela R, Eisenbarth, George S, Siebert, Janet C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3310717/
https://www.ncbi.nlm.nih.gov/pubmed/22369243
http://dx.doi.org/10.1186/1479-5876-10-32
_version_ 1782227689378152448
author Baschal, Erin E
Jasinski, Jean M
Boyle, Theresa A
Fain, Pamela R
Eisenbarth, George S
Siebert, Janet C
author_facet Baschal, Erin E
Jasinski, Jean M
Boyle, Theresa A
Fain, Pamela R
Eisenbarth, George S
Siebert, Janet C
author_sort Baschal, Erin E
collection PubMed
description BACKGROUND: Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. METHODS: The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. RESULTS: Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. CONCLUSIONS: Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies.
format Online
Article
Text
id pubmed-3310717
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33107172012-03-23 Congruence as a measurement of extended haplotype structure across the genome Baschal, Erin E Jasinski, Jean M Boyle, Theresa A Fain, Pamela R Eisenbarth, George S Siebert, Janet C J Transl Med Methodology BACKGROUND: Historically, extended haplotypes have been defined using only a few data points, such as alleles for several HLA genes in the MHC. High-density SNP data, and the increasing affordability of whole genome SNP typing, creates the opportunity to define higher resolution extended haplotypes. This drives the need for new tools that support quantification and visualization of extended haplotypes as defined by as many as 2000 SNPs. Confronted with high-density SNP data across the major histocompatibility complex (MHC) for 2,300 complete families, compiled by the Type 1 Diabetes Genetics Consortium (T1DGC), we developed software for studying extended haplotypes. METHODS: The software, called ExHap (Extended Haplotype), uses a similarity measurement we term congruence to identify and quantify long-range allele identity. Using ExHap, we analyzed congruence in both the T1DGC data and family-phased data from the International HapMap Project. RESULTS: Congruent chromosomes from the T1DGC data have between 96.5% and 99.9% allele identity over 1,818 SNPs spanning 2.64 megabases of the MHC (HLA-DRB1 to HLA-A). Thirty-three of 132 DQ-DR-B-A defined haplotype groups have > 50% congruent chromosomes in this region. For example, 92% of chromosomes within the DR3-B8-A1 haplotype are congruent from HLA-DRB1 to HLA-A (99.8% allele identity). We also applied ExHap to all 22 autosomes for both CEU and YRI cohorts from the International HapMap Project, identifying multiple candidate extended haplotypes. CONCLUSIONS: Long-range congruence is not unique to the MHC region. Patterns of allele identity on phased chromosomes provide a simple, straightforward approach to visually and quantitatively inspect complex long-range structural patterns in the genome. Such patterns aid the biologist in appreciating genetic similarities and differences across cohorts, and can lead to hypothesis generation for subsequent studies. BioMed Central 2012-02-27 /pmc/articles/PMC3310717/ /pubmed/22369243 http://dx.doi.org/10.1186/1479-5876-10-32 Text en Copyright ©2012 Baschal et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Baschal, Erin E
Jasinski, Jean M
Boyle, Theresa A
Fain, Pamela R
Eisenbarth, George S
Siebert, Janet C
Congruence as a measurement of extended haplotype structure across the genome
title Congruence as a measurement of extended haplotype structure across the genome
title_full Congruence as a measurement of extended haplotype structure across the genome
title_fullStr Congruence as a measurement of extended haplotype structure across the genome
title_full_unstemmed Congruence as a measurement of extended haplotype structure across the genome
title_short Congruence as a measurement of extended haplotype structure across the genome
title_sort congruence as a measurement of extended haplotype structure across the genome
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3310717/
https://www.ncbi.nlm.nih.gov/pubmed/22369243
http://dx.doi.org/10.1186/1479-5876-10-32
work_keys_str_mv AT baschalerine congruenceasameasurementofextendedhaplotypestructureacrossthegenome
AT jasinskijeanm congruenceasameasurementofextendedhaplotypestructureacrossthegenome
AT boyletheresaa congruenceasameasurementofextendedhaplotypestructureacrossthegenome
AT fainpamelar congruenceasameasurementofextendedhaplotypestructureacrossthegenome
AT eisenbarthgeorges congruenceasameasurementofextendedhaplotypestructureacrossthegenome
AT siebertjanetc congruenceasameasurementofextendedhaplotypestructureacrossthegenome