Cargando…

High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references

Statistical imputation of classical human leukocyte antigen (HLA) alleles is becoming an indispensable tool for fine-mappings of disease association signals from case–control genome-wide association studies. However, most currently available HLA imputation tools are based on European reference popul...

Descripción completa

Detalles Bibliográficos
Autores principales: Khor, S-S, Yang, W, Kawashima, M, Kamitsuji, S, Zheng, X, Nishida, N, Sawai, H, Toyoda, H, Miyagawa, T, Honda, M, Kamatani, N, Tokunaga, K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4762906/
https://www.ncbi.nlm.nih.gov/pubmed/25707395
http://dx.doi.org/10.1038/tpj.2015.4
_version_ 1782417161460908032
author Khor, S-S
Yang, W
Kawashima, M
Kamitsuji, S
Zheng, X
Nishida, N
Sawai, H
Toyoda, H
Miyagawa, T
Honda, M
Kamatani, N
Tokunaga, K
author_facet Khor, S-S
Yang, W
Kawashima, M
Kamitsuji, S
Zheng, X
Nishida, N
Sawai, H
Toyoda, H
Miyagawa, T
Honda, M
Kamatani, N
Tokunaga, K
author_sort Khor, S-S
collection PubMed
description Statistical imputation of classical human leukocyte antigen (HLA) alleles is becoming an indispensable tool for fine-mappings of disease association signals from case–control genome-wide association studies. However, most currently available HLA imputation tools are based on European reference populations and are not suitable for direct application to non-European populations. Among the HLA imputation tools, The HIBAG R package is a flexible HLA imputation tool that is equipped with a wide range of population-based classifiers; moreover, HIBAG R enables individual researchers to build custom classifiers. Here, two data sets, each comprising data from healthy Japanese individuals of difference sample sizes, were used to build custom classifiers. HLA imputation accuracy in five HLA classes (HLA-A, HLA-B, HLA-DRB1, HLA-DQB1 and HLA-DPB1) increased from the 82.5–98.8% obtained with the original HIBAG references to 95.2–99.5% with our custom classifiers. A call threshold (CT) of 0.4 is recommended for our Japanese classifiers; in contrast, HIBAG references recommend a CT of 0.5. Finally, our classifiers could be used to identify the risk haplotypes for Japanese narcolepsy with cataplexy, HLA-DRB1*15:01 and HLA-DQB1*06:02, with 100% and 99.7% accuracy, respectively; therefore, these classifiers can be used to supplement the current lack of HLA genotyping data in widely available genome-wide association study data sets.
format Online
Article
Text
id pubmed-4762906
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-47629062016-03-07 High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references Khor, S-S Yang, W Kawashima, M Kamitsuji, S Zheng, X Nishida, N Sawai, H Toyoda, H Miyagawa, T Honda, M Kamatani, N Tokunaga, K Pharmacogenomics J Original Article Statistical imputation of classical human leukocyte antigen (HLA) alleles is becoming an indispensable tool for fine-mappings of disease association signals from case–control genome-wide association studies. However, most currently available HLA imputation tools are based on European reference populations and are not suitable for direct application to non-European populations. Among the HLA imputation tools, The HIBAG R package is a flexible HLA imputation tool that is equipped with a wide range of population-based classifiers; moreover, HIBAG R enables individual researchers to build custom classifiers. Here, two data sets, each comprising data from healthy Japanese individuals of difference sample sizes, were used to build custom classifiers. HLA imputation accuracy in five HLA classes (HLA-A, HLA-B, HLA-DRB1, HLA-DQB1 and HLA-DPB1) increased from the 82.5–98.8% obtained with the original HIBAG references to 95.2–99.5% with our custom classifiers. A call threshold (CT) of 0.4 is recommended for our Japanese classifiers; in contrast, HIBAG references recommend a CT of 0.5. Finally, our classifiers could be used to identify the risk haplotypes for Japanese narcolepsy with cataplexy, HLA-DRB1*15:01 and HLA-DQB1*06:02, with 100% and 99.7% accuracy, respectively; therefore, these classifiers can be used to supplement the current lack of HLA genotyping data in widely available genome-wide association study data sets. Nature Publishing Group 2015-12 2015-02-24 /pmc/articles/PMC4762906/ /pubmed/25707395 http://dx.doi.org/10.1038/tpj.2015.4 Text en Copyright © 2015 Macmillan Publishers Limited http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/
spellingShingle Original Article
Khor, S-S
Yang, W
Kawashima, M
Kamitsuji, S
Zheng, X
Nishida, N
Sawai, H
Toyoda, H
Miyagawa, T
Honda, M
Kamatani, N
Tokunaga, K
High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references
title High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references
title_full High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references
title_fullStr High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references
title_full_unstemmed High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references
title_short High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references
title_sort high-accuracy imputation for hla class i and ii genes based on high-resolution snp data of population-specific references
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4762906/
https://www.ncbi.nlm.nih.gov/pubmed/25707395
http://dx.doi.org/10.1038/tpj.2015.4
work_keys_str_mv AT khorss highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT yangw highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT kawashimam highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT kamitsujis highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT zhengx highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT nishidan highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT sawaih highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT toyodah highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT miyagawat highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT hondam highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT kamatanin highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences
AT tokunagak highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences