Cargando…

HLAscan: genotyping of the HLA region using next-generation sequencing data

BACKGROUND: Several recent studies showed that next-generation sequencing (NGS)-based human leukocyte antigen (HLA) typing is a feasible and promising technique for variant calling of highly polymorphic regions. To date, however, no method with sufficient read depth has completely solved the allele...

Descripción completa

Detalles Bibliográficos
Autores principales: Ka, Sojeong, Lee, Sunho, Hong, Jonghee, Cho, Yangrae, Sung, Joohon, Kim, Han-Na, Kim, Hyung-Lae, Jung, Jongsun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5427585/
https://www.ncbi.nlm.nih.gov/pubmed/28499414
http://dx.doi.org/10.1186/s12859-017-1671-3
_version_ 1783235659398381568
author Ka, Sojeong
Lee, Sunho
Hong, Jonghee
Cho, Yangrae
Sung, Joohon
Kim, Han-Na
Kim, Hyung-Lae
Jung, Jongsun
author_facet Ka, Sojeong
Lee, Sunho
Hong, Jonghee
Cho, Yangrae
Sung, Joohon
Kim, Han-Na
Kim, Hyung-Lae
Jung, Jongsun
author_sort Ka, Sojeong
collection PubMed
description BACKGROUND: Several recent studies showed that next-generation sequencing (NGS)-based human leukocyte antigen (HLA) typing is a feasible and promising technique for variant calling of highly polymorphic regions. To date, however, no method with sufficient read depth has completely solved the allele phasing issue. In this study, we developed a new method (HLAscan) for HLA genotyping using NGS data. RESULTS: HLAscan performs alignment of reads to HLA sequences from the international ImMunoGeneTics project/human leukocyte antigen (IMGT/HLA) database. The distribution of aligned reads was used to calculate a score function to determine correctly phased alleles by progressively removing false-positive alleles. Comparative HLA typing tests using public datasets from the 1000 Genomes Project and the International HapMap Project demonstrated that HLAscan could perform HLA typing more accurately than previously reported NGS-based methods such as HLAreporter and PHLAT. In addition, the results of HLA-A, −B, and -DRB1 typing by HLAscan using data generated by NextGen were identical to those obtained using a Sanger sequencing–based method. We also applied HLAscan to a family dataset with various coverage depths generated on the Illumina HiSeq X-TEN platform. HLAscan identified allele types of HLA-A, −B, −C, −DQB1, and -DRB1 with 100% accuracy for sequences at ≥ 90× depth, and the overall accuracy was 96.9%. CONCLUSIONS: HLAscan, an alignment-based program that takes read distribution into account to determine true allele types, outperformed previously developed HLA typing tools. Therefore, HLAscan can be reliably applied for determination of HLA type across the whole-genome, exome, and target sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1671-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5427585
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54275852017-05-15 HLAscan: genotyping of the HLA region using next-generation sequencing data Ka, Sojeong Lee, Sunho Hong, Jonghee Cho, Yangrae Sung, Joohon Kim, Han-Na Kim, Hyung-Lae Jung, Jongsun BMC Bioinformatics Methodology Article BACKGROUND: Several recent studies showed that next-generation sequencing (NGS)-based human leukocyte antigen (HLA) typing is a feasible and promising technique for variant calling of highly polymorphic regions. To date, however, no method with sufficient read depth has completely solved the allele phasing issue. In this study, we developed a new method (HLAscan) for HLA genotyping using NGS data. RESULTS: HLAscan performs alignment of reads to HLA sequences from the international ImMunoGeneTics project/human leukocyte antigen (IMGT/HLA) database. The distribution of aligned reads was used to calculate a score function to determine correctly phased alleles by progressively removing false-positive alleles. Comparative HLA typing tests using public datasets from the 1000 Genomes Project and the International HapMap Project demonstrated that HLAscan could perform HLA typing more accurately than previously reported NGS-based methods such as HLAreporter and PHLAT. In addition, the results of HLA-A, −B, and -DRB1 typing by HLAscan using data generated by NextGen were identical to those obtained using a Sanger sequencing–based method. We also applied HLAscan to a family dataset with various coverage depths generated on the Illumina HiSeq X-TEN platform. HLAscan identified allele types of HLA-A, −B, −C, −DQB1, and -DRB1 with 100% accuracy for sequences at ≥ 90× depth, and the overall accuracy was 96.9%. CONCLUSIONS: HLAscan, an alignment-based program that takes read distribution into account to determine true allele types, outperformed previously developed HLA typing tools. Therefore, HLAscan can be reliably applied for determination of HLA type across the whole-genome, exome, and target sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1671-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-12 /pmc/articles/PMC5427585/ /pubmed/28499414 http://dx.doi.org/10.1186/s12859-017-1671-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Ka, Sojeong
Lee, Sunho
Hong, Jonghee
Cho, Yangrae
Sung, Joohon
Kim, Han-Na
Kim, Hyung-Lae
Jung, Jongsun
HLAscan: genotyping of the HLA region using next-generation sequencing data
title HLAscan: genotyping of the HLA region using next-generation sequencing data
title_full HLAscan: genotyping of the HLA region using next-generation sequencing data
title_fullStr HLAscan: genotyping of the HLA region using next-generation sequencing data
title_full_unstemmed HLAscan: genotyping of the HLA region using next-generation sequencing data
title_short HLAscan: genotyping of the HLA region using next-generation sequencing data
title_sort hlascan: genotyping of the hla region using next-generation sequencing data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5427585/
https://www.ncbi.nlm.nih.gov/pubmed/28499414
http://dx.doi.org/10.1186/s12859-017-1671-3
work_keys_str_mv AT kasojeong hlascangenotypingofthehlaregionusingnextgenerationsequencingdata
AT leesunho hlascangenotypingofthehlaregionusingnextgenerationsequencingdata
AT hongjonghee hlascangenotypingofthehlaregionusingnextgenerationsequencingdata
AT choyangrae hlascangenotypingofthehlaregionusingnextgenerationsequencingdata
AT sungjoohon hlascangenotypingofthehlaregionusingnextgenerationsequencingdata
AT kimhanna hlascangenotypingofthehlaregionusingnextgenerationsequencingdata
AT kimhyunglae hlascangenotypingofthehlaregionusingnextgenerationsequencingdata
AT jungjongsun hlascangenotypingofthehlaregionusingnextgenerationsequencingdata