Cargando…
High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references
Statistical imputation of classical human leukocyte antigen (HLA) alleles is becoming an indispensable tool for fine-mappings of disease association signals from case–control genome-wide association studies. However, most currently available HLA imputation tools are based on European reference popul...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4762906/ https://www.ncbi.nlm.nih.gov/pubmed/25707395 http://dx.doi.org/10.1038/tpj.2015.4 |
_version_ | 1782417161460908032 |
---|---|
author | Khor, S-S Yang, W Kawashima, M Kamitsuji, S Zheng, X Nishida, N Sawai, H Toyoda, H Miyagawa, T Honda, M Kamatani, N Tokunaga, K |
author_facet | Khor, S-S Yang, W Kawashima, M Kamitsuji, S Zheng, X Nishida, N Sawai, H Toyoda, H Miyagawa, T Honda, M Kamatani, N Tokunaga, K |
author_sort | Khor, S-S |
collection | PubMed |
description | Statistical imputation of classical human leukocyte antigen (HLA) alleles is becoming an indispensable tool for fine-mappings of disease association signals from case–control genome-wide association studies. However, most currently available HLA imputation tools are based on European reference populations and are not suitable for direct application to non-European populations. Among the HLA imputation tools, The HIBAG R package is a flexible HLA imputation tool that is equipped with a wide range of population-based classifiers; moreover, HIBAG R enables individual researchers to build custom classifiers. Here, two data sets, each comprising data from healthy Japanese individuals of difference sample sizes, were used to build custom classifiers. HLA imputation accuracy in five HLA classes (HLA-A, HLA-B, HLA-DRB1, HLA-DQB1 and HLA-DPB1) increased from the 82.5–98.8% obtained with the original HIBAG references to 95.2–99.5% with our custom classifiers. A call threshold (CT) of 0.4 is recommended for our Japanese classifiers; in contrast, HIBAG references recommend a CT of 0.5. Finally, our classifiers could be used to identify the risk haplotypes for Japanese narcolepsy with cataplexy, HLA-DRB1*15:01 and HLA-DQB1*06:02, with 100% and 99.7% accuracy, respectively; therefore, these classifiers can be used to supplement the current lack of HLA genotyping data in widely available genome-wide association study data sets. |
format | Online Article Text |
id | pubmed-4762906 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-47629062016-03-07 High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references Khor, S-S Yang, W Kawashima, M Kamitsuji, S Zheng, X Nishida, N Sawai, H Toyoda, H Miyagawa, T Honda, M Kamatani, N Tokunaga, K Pharmacogenomics J Original Article Statistical imputation of classical human leukocyte antigen (HLA) alleles is becoming an indispensable tool for fine-mappings of disease association signals from case–control genome-wide association studies. However, most currently available HLA imputation tools are based on European reference populations and are not suitable for direct application to non-European populations. Among the HLA imputation tools, The HIBAG R package is a flexible HLA imputation tool that is equipped with a wide range of population-based classifiers; moreover, HIBAG R enables individual researchers to build custom classifiers. Here, two data sets, each comprising data from healthy Japanese individuals of difference sample sizes, were used to build custom classifiers. HLA imputation accuracy in five HLA classes (HLA-A, HLA-B, HLA-DRB1, HLA-DQB1 and HLA-DPB1) increased from the 82.5–98.8% obtained with the original HIBAG references to 95.2–99.5% with our custom classifiers. A call threshold (CT) of 0.4 is recommended for our Japanese classifiers; in contrast, HIBAG references recommend a CT of 0.5. Finally, our classifiers could be used to identify the risk haplotypes for Japanese narcolepsy with cataplexy, HLA-DRB1*15:01 and HLA-DQB1*06:02, with 100% and 99.7% accuracy, respectively; therefore, these classifiers can be used to supplement the current lack of HLA genotyping data in widely available genome-wide association study data sets. Nature Publishing Group 2015-12 2015-02-24 /pmc/articles/PMC4762906/ /pubmed/25707395 http://dx.doi.org/10.1038/tpj.2015.4 Text en Copyright © 2015 Macmillan Publishers Limited http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ |
spellingShingle | Original Article Khor, S-S Yang, W Kawashima, M Kamitsuji, S Zheng, X Nishida, N Sawai, H Toyoda, H Miyagawa, T Honda, M Kamatani, N Tokunaga, K High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references |
title | High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references |
title_full | High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references |
title_fullStr | High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references |
title_full_unstemmed | High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references |
title_short | High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references |
title_sort | high-accuracy imputation for hla class i and ii genes based on high-resolution snp data of population-specific references |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4762906/ https://www.ncbi.nlm.nih.gov/pubmed/25707395 http://dx.doi.org/10.1038/tpj.2015.4 |
work_keys_str_mv | AT khorss highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT yangw highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT kawashimam highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT kamitsujis highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT zhengx highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT nishidan highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT sawaih highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT toyodah highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT miyagawat highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT hondam highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT kamatanin highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences AT tokunagak highaccuracyimputationforhlaclassiandiigenesbasedonhighresolutionsnpdataofpopulationspecificreferences |