Cargando…

Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens

DNA sequence variation within human leukocyte antigen (HLA) genes mediate susceptibility to a wide range of human diseases. The complex genetic structure of the major histocompatibility complex (MHC) makes it difficult, however, to collect genotyping data in large cohorts. Long-range linkage disequi...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Xiaoming, Han, Buhm, Onengut-Gumuscu, Suna, Chen, Wei-Min, Concannon, Patrick J., Rich, Stephen S., Raychaudhuri, Soumya, de Bakker, Paul I.W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3675122/
https://www.ncbi.nlm.nih.gov/pubmed/23762245
http://dx.doi.org/10.1371/journal.pone.0064683
_version_ 1782272480276119552
author Jia, Xiaoming
Han, Buhm
Onengut-Gumuscu, Suna
Chen, Wei-Min
Concannon, Patrick J.
Rich, Stephen S.
Raychaudhuri, Soumya
de Bakker, Paul I.W.
author_facet Jia, Xiaoming
Han, Buhm
Onengut-Gumuscu, Suna
Chen, Wei-Min
Concannon, Patrick J.
Rich, Stephen S.
Raychaudhuri, Soumya
de Bakker, Paul I.W.
author_sort Jia, Xiaoming
collection PubMed
description DNA sequence variation within human leukocyte antigen (HLA) genes mediate susceptibility to a wide range of human diseases. The complex genetic structure of the major histocompatibility complex (MHC) makes it difficult, however, to collect genotyping data in large cohorts. Long-range linkage disequilibrium between HLA loci and SNP markers across the major histocompatibility complex (MHC) region offers an alternative approach through imputation to interrogate HLA variation in existing GWAS data sets. Here we describe a computational strategy, SNP2HLA, to impute classical alleles and amino acid polymorphisms at class I (HLA-A, -B, -C) and class II (-DPA1, -DPB1, -DQA1, -DQB1, and -DRB1) loci. To characterize performance of SNP2HLA, we constructed two European ancestry reference panels, one based on data collected in HapMap-CEPH pedigrees (90 individuals) and another based on data collected by the Type 1 Diabetes Genetics Consortium (T1DGC, 5,225 individuals). We imputed HLA alleles in an independent data set from the British 1958 Birth Cohort (N = 918) with gold standard four-digit HLA types and SNPs genotyped using the Affymetrix GeneChip 500 K and Illumina Immunochip microarrays. We demonstrate that the sample size of the reference panel, rather than SNP density of the genotyping platform, is critical to achieve high imputation accuracy. Using the larger T1DGC reference panel, the average accuracy at four-digit resolution is 94.7% using the low-density Affymetrix GeneChip 500 K, and 96.7% using the high-density Illumina Immunochip. For amino acid polymorphisms within HLA genes, we achieve 98.6% and 99.3% accuracy using the Affymetrix GeneChip 500 K and Illumina Immunochip, respectively. Finally, we demonstrate how imputation and association testing at amino acid resolution can facilitate fine-mapping of primary MHC association signals, giving a specific example from type 1 diabetes.
format Online
Article
Text
id pubmed-3675122
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36751222013-06-12 Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens Jia, Xiaoming Han, Buhm Onengut-Gumuscu, Suna Chen, Wei-Min Concannon, Patrick J. Rich, Stephen S. Raychaudhuri, Soumya de Bakker, Paul I.W. PLoS One Research Article DNA sequence variation within human leukocyte antigen (HLA) genes mediate susceptibility to a wide range of human diseases. The complex genetic structure of the major histocompatibility complex (MHC) makes it difficult, however, to collect genotyping data in large cohorts. Long-range linkage disequilibrium between HLA loci and SNP markers across the major histocompatibility complex (MHC) region offers an alternative approach through imputation to interrogate HLA variation in existing GWAS data sets. Here we describe a computational strategy, SNP2HLA, to impute classical alleles and amino acid polymorphisms at class I (HLA-A, -B, -C) and class II (-DPA1, -DPB1, -DQA1, -DQB1, and -DRB1) loci. To characterize performance of SNP2HLA, we constructed two European ancestry reference panels, one based on data collected in HapMap-CEPH pedigrees (90 individuals) and another based on data collected by the Type 1 Diabetes Genetics Consortium (T1DGC, 5,225 individuals). We imputed HLA alleles in an independent data set from the British 1958 Birth Cohort (N = 918) with gold standard four-digit HLA types and SNPs genotyped using the Affymetrix GeneChip 500 K and Illumina Immunochip microarrays. We demonstrate that the sample size of the reference panel, rather than SNP density of the genotyping platform, is critical to achieve high imputation accuracy. Using the larger T1DGC reference panel, the average accuracy at four-digit resolution is 94.7% using the low-density Affymetrix GeneChip 500 K, and 96.7% using the high-density Illumina Immunochip. For amino acid polymorphisms within HLA genes, we achieve 98.6% and 99.3% accuracy using the Affymetrix GeneChip 500 K and Illumina Immunochip, respectively. Finally, we demonstrate how imputation and association testing at amino acid resolution can facilitate fine-mapping of primary MHC association signals, giving a specific example from type 1 diabetes. Public Library of Science 2013-06-06 /pmc/articles/PMC3675122/ /pubmed/23762245 http://dx.doi.org/10.1371/journal.pone.0064683 Text en © 2013 Jia et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Jia, Xiaoming
Han, Buhm
Onengut-Gumuscu, Suna
Chen, Wei-Min
Concannon, Patrick J.
Rich, Stephen S.
Raychaudhuri, Soumya
de Bakker, Paul I.W.
Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens
title Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens
title_full Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens
title_fullStr Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens
title_full_unstemmed Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens
title_short Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens
title_sort imputing amino acid polymorphisms in human leukocyte antigens
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3675122/
https://www.ncbi.nlm.nih.gov/pubmed/23762245
http://dx.doi.org/10.1371/journal.pone.0064683
work_keys_str_mv AT jiaxiaoming imputingaminoacidpolymorphismsinhumanleukocyteantigens
AT hanbuhm imputingaminoacidpolymorphismsinhumanleukocyteantigens
AT onengutgumuscusuna imputingaminoacidpolymorphismsinhumanleukocyteantigens
AT chenweimin imputingaminoacidpolymorphismsinhumanleukocyteantigens
AT concannonpatrickj imputingaminoacidpolymorphismsinhumanleukocyteantigens
AT richstephens imputingaminoacidpolymorphismsinhumanleukocyteantigens
AT raychaudhurisoumya imputingaminoacidpolymorphismsinhumanleukocyteantigens
AT debakkerpauliw imputingaminoacidpolymorphismsinhumanleukocyteantigens