Cargando…

HLA-check: evaluating HLA data from SNP information

BACKGROUND: The major histocompatibility complex (MHC) region of the human genome, and specifically the human leukocyte antigen (HLA) genes, play a major role in numerous human diseases. With the recent progress of sequencing methods (eg, Next-Generation Sequencing, NGS), the accurate genotyping of...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeanmougin, Marc, Noirel, Josselin, Coulonges, Cédric, Zagury, Jean-François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5504728/
https://www.ncbi.nlm.nih.gov/pubmed/28697761
http://dx.doi.org/10.1186/s12859-017-1746-1
_version_ 1783249333296037888
author Jeanmougin, Marc
Noirel, Josselin
Coulonges, Cédric
Zagury, Jean-François
author_facet Jeanmougin, Marc
Noirel, Josselin
Coulonges, Cédric
Zagury, Jean-François
author_sort Jeanmougin, Marc
collection PubMed
description BACKGROUND: The major histocompatibility complex (MHC) region of the human genome, and specifically the human leukocyte antigen (HLA) genes, play a major role in numerous human diseases. With the recent progress of sequencing methods (eg, Next-Generation Sequencing, NGS), the accurate genotyping of this region has become possible but remains relatively costly. In order to obtain the HLA information for the millions of samples already genotyped by chips in the past ten years, efficient bioinformatics tools, such as SNP2HLA or HIBAG, have been developed that infer HLA information from the linkage disequilibrium existing between HLA alleles and SNP markers in the MHC region. RESULTS: In this study, we first used ShapeIT and Impute2 to implement an imputation method akin to SNP2HLA and found a comparable quality of imputation on a European dataset. More importantly, we developed a new tool, HLA-check, that allows for the detection of aberrant HLA allele calling with regard to the SNP genotypes in the region. Adding this tool to the HLA imputation software increases dramatically their accuracy, especially for HLA class I genes. CONCLUSION: Overall, HLA-check was able to identify a limited number of implausible HLA typings (less than 10%) in a population, and these samples can then either be removed or be retyped by NGS for HLA association analysis.
format Online
Article
Text
id pubmed-5504728
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55047282017-07-12 HLA-check: evaluating HLA data from SNP information Jeanmougin, Marc Noirel, Josselin Coulonges, Cédric Zagury, Jean-François BMC Bioinformatics Software BACKGROUND: The major histocompatibility complex (MHC) region of the human genome, and specifically the human leukocyte antigen (HLA) genes, play a major role in numerous human diseases. With the recent progress of sequencing methods (eg, Next-Generation Sequencing, NGS), the accurate genotyping of this region has become possible but remains relatively costly. In order to obtain the HLA information for the millions of samples already genotyped by chips in the past ten years, efficient bioinformatics tools, such as SNP2HLA or HIBAG, have been developed that infer HLA information from the linkage disequilibrium existing between HLA alleles and SNP markers in the MHC region. RESULTS: In this study, we first used ShapeIT and Impute2 to implement an imputation method akin to SNP2HLA and found a comparable quality of imputation on a European dataset. More importantly, we developed a new tool, HLA-check, that allows for the detection of aberrant HLA allele calling with regard to the SNP genotypes in the region. Adding this tool to the HLA imputation software increases dramatically their accuracy, especially for HLA class I genes. CONCLUSION: Overall, HLA-check was able to identify a limited number of implausible HLA typings (less than 10%) in a population, and these samples can then either be removed or be retyped by NGS for HLA association analysis. BioMed Central 2017-07-11 /pmc/articles/PMC5504728/ /pubmed/28697761 http://dx.doi.org/10.1186/s12859-017-1746-1 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Jeanmougin, Marc
Noirel, Josselin
Coulonges, Cédric
Zagury, Jean-François
HLA-check: evaluating HLA data from SNP information
title HLA-check: evaluating HLA data from SNP information
title_full HLA-check: evaluating HLA data from SNP information
title_fullStr HLA-check: evaluating HLA data from SNP information
title_full_unstemmed HLA-check: evaluating HLA data from SNP information
title_short HLA-check: evaluating HLA data from SNP information
title_sort hla-check: evaluating hla data from snp information
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5504728/
https://www.ncbi.nlm.nih.gov/pubmed/28697761
http://dx.doi.org/10.1186/s12859-017-1746-1
work_keys_str_mv AT jeanmouginmarc hlacheckevaluatinghladatafromsnpinformation
AT noireljosselin hlacheckevaluatinghladatafromsnpinformation
AT coulongescedric hlacheckevaluatinghladatafromsnpinformation
AT zaguryjeanfrancois hlacheckevaluatinghladatafromsnpinformation