Cargando…

HIBAG—HLA genotype imputation with attribute bagging

Genotyping of classical human leukocyte antigen (HLA) alleles is an essential tool in the analysis of diseases and adverse drug reactions with associations mapping to the major histocompatibility complex (MHC). However, deriving high-resolution HLA types subsequent to whole-genome single-nucleotide...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, X, Shen, J, Cox, C, Wakefield, J C, Ehm, M G, Nelson, M R, Weir, B S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3772955/
https://www.ncbi.nlm.nih.gov/pubmed/23712092
http://dx.doi.org/10.1038/tpj.2013.18
_version_ 1782284393401810944
author Zheng, X
Shen, J
Cox, C
Wakefield, J C
Ehm, M G
Nelson, M R
Weir, B S
author_facet Zheng, X
Shen, J
Cox, C
Wakefield, J C
Ehm, M G
Nelson, M R
Weir, B S
author_sort Zheng, X
collection PubMed
description Genotyping of classical human leukocyte antigen (HLA) alleles is an essential tool in the analysis of diseases and adverse drug reactions with associations mapping to the major histocompatibility complex (MHC). However, deriving high-resolution HLA types subsequent to whole-genome single-nucleotide polymorphism (SNP) typing or sequencing is often cost prohibitive for large samples. An alternative approach takes advantage of the extended haplotype structure within the MHC to predict HLA alleles using dense SNP genotypes, such as those available from genome-wide SNP panels. Current methods for HLA imputation are difficult to apply or may require the user to have access to large training data sets with SNP and HLA types. We propose HIBAG, HLA Imputation using attribute BAGging, that makes predictions by averaging HLA-type posterior probabilities over an ensemble of classifiers built on bootstrap samples. We assess the performance of HIBAG using our study data (n=2668 subjects of European ancestry) as a training set and HLA data from the British 1958 birth cohort study (n≈1000 subjects) as independent validation samples. Prediction accuracies for HLA-A, B, C, DRB1 and DQB1 range from 92.2% to 98.1% using a set of SNP markers common to the Illumina 1M Duo, OmniQuad, OmniExpress, 660K and 550K platforms. HIBAG performed well compared with the other two leading methods, HLA*IMP and BEAGLE. This method is implemented in a freely available HIBAG R package that includes pre-fit classifiers for European, Asian, Hispanic and African ancestries, providing a readily available imputation approach without the need to have access to large training data sets.
format Online
Article
Text
id pubmed-3772955
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-37729552014-04-24 HIBAG—HLA genotype imputation with attribute bagging Zheng, X Shen, J Cox, C Wakefield, J C Ehm, M G Nelson, M R Weir, B S Pharmacogenomics J Original Article Genotyping of classical human leukocyte antigen (HLA) alleles is an essential tool in the analysis of diseases and adverse drug reactions with associations mapping to the major histocompatibility complex (MHC). However, deriving high-resolution HLA types subsequent to whole-genome single-nucleotide polymorphism (SNP) typing or sequencing is often cost prohibitive for large samples. An alternative approach takes advantage of the extended haplotype structure within the MHC to predict HLA alleles using dense SNP genotypes, such as those available from genome-wide SNP panels. Current methods for HLA imputation are difficult to apply or may require the user to have access to large training data sets with SNP and HLA types. We propose HIBAG, HLA Imputation using attribute BAGging, that makes predictions by averaging HLA-type posterior probabilities over an ensemble of classifiers built on bootstrap samples. We assess the performance of HIBAG using our study data (n=2668 subjects of European ancestry) as a training set and HLA data from the British 1958 birth cohort study (n≈1000 subjects) as independent validation samples. Prediction accuracies for HLA-A, B, C, DRB1 and DQB1 range from 92.2% to 98.1% using a set of SNP markers common to the Illumina 1M Duo, OmniQuad, OmniExpress, 660K and 550K platforms. HIBAG performed well compared with the other two leading methods, HLA*IMP and BEAGLE. This method is implemented in a freely available HIBAG R package that includes pre-fit classifiers for European, Asian, Hispanic and African ancestries, providing a readily available imputation approach without the need to have access to large training data sets. Nature Publishing Group 2014-04 2013-05-28 /pmc/articles/PMC3772955/ /pubmed/23712092 http://dx.doi.org/10.1038/tpj.2013.18 Text en Copyright © 2014 Macmillan Publishers Limited http://creativecommons.org/licenses/by/3.0/ This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/3.0/
spellingShingle Original Article
Zheng, X
Shen, J
Cox, C
Wakefield, J C
Ehm, M G
Nelson, M R
Weir, B S
HIBAG—HLA genotype imputation with attribute bagging
title HIBAG—HLA genotype imputation with attribute bagging
title_full HIBAG—HLA genotype imputation with attribute bagging
title_fullStr HIBAG—HLA genotype imputation with attribute bagging
title_full_unstemmed HIBAG—HLA genotype imputation with attribute bagging
title_short HIBAG—HLA genotype imputation with attribute bagging
title_sort hibag—hla genotype imputation with attribute bagging
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3772955/
https://www.ncbi.nlm.nih.gov/pubmed/23712092
http://dx.doi.org/10.1038/tpj.2013.18
work_keys_str_mv AT zhengx hibaghlagenotypeimputationwithattributebagging
AT shenj hibaghlagenotypeimputationwithattributebagging
AT coxc hibaghlagenotypeimputationwithattributebagging
AT wakefieldjc hibaghlagenotypeimputationwithattributebagging
AT ehmmg hibaghlagenotypeimputationwithattributebagging
AT nelsonmr hibaghlagenotypeimputationwithattributebagging
AT weirbs hibaghlagenotypeimputationwithattributebagging