Cargando…
HLAscan: genotyping of the HLA region using next-generation sequencing data
BACKGROUND: Several recent studies showed that next-generation sequencing (NGS)-based human leukocyte antigen (HLA) typing is a feasible and promising technique for variant calling of highly polymorphic regions. To date, however, no method with sufficient read depth has completely solved the allele...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5427585/ https://www.ncbi.nlm.nih.gov/pubmed/28499414 http://dx.doi.org/10.1186/s12859-017-1671-3 |
_version_ | 1783235659398381568 |
---|---|
author | Ka, Sojeong Lee, Sunho Hong, Jonghee Cho, Yangrae Sung, Joohon Kim, Han-Na Kim, Hyung-Lae Jung, Jongsun |
author_facet | Ka, Sojeong Lee, Sunho Hong, Jonghee Cho, Yangrae Sung, Joohon Kim, Han-Na Kim, Hyung-Lae Jung, Jongsun |
author_sort | Ka, Sojeong |
collection | PubMed |
description | BACKGROUND: Several recent studies showed that next-generation sequencing (NGS)-based human leukocyte antigen (HLA) typing is a feasible and promising technique for variant calling of highly polymorphic regions. To date, however, no method with sufficient read depth has completely solved the allele phasing issue. In this study, we developed a new method (HLAscan) for HLA genotyping using NGS data. RESULTS: HLAscan performs alignment of reads to HLA sequences from the international ImMunoGeneTics project/human leukocyte antigen (IMGT/HLA) database. The distribution of aligned reads was used to calculate a score function to determine correctly phased alleles by progressively removing false-positive alleles. Comparative HLA typing tests using public datasets from the 1000 Genomes Project and the International HapMap Project demonstrated that HLAscan could perform HLA typing more accurately than previously reported NGS-based methods such as HLAreporter and PHLAT. In addition, the results of HLA-A, −B, and -DRB1 typing by HLAscan using data generated by NextGen were identical to those obtained using a Sanger sequencing–based method. We also applied HLAscan to a family dataset with various coverage depths generated on the Illumina HiSeq X-TEN platform. HLAscan identified allele types of HLA-A, −B, −C, −DQB1, and -DRB1 with 100% accuracy for sequences at ≥ 90× depth, and the overall accuracy was 96.9%. CONCLUSIONS: HLAscan, an alignment-based program that takes read distribution into account to determine true allele types, outperformed previously developed HLA typing tools. Therefore, HLAscan can be reliably applied for determination of HLA type across the whole-genome, exome, and target sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1671-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5427585 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54275852017-05-15 HLAscan: genotyping of the HLA region using next-generation sequencing data Ka, Sojeong Lee, Sunho Hong, Jonghee Cho, Yangrae Sung, Joohon Kim, Han-Na Kim, Hyung-Lae Jung, Jongsun BMC Bioinformatics Methodology Article BACKGROUND: Several recent studies showed that next-generation sequencing (NGS)-based human leukocyte antigen (HLA) typing is a feasible and promising technique for variant calling of highly polymorphic regions. To date, however, no method with sufficient read depth has completely solved the allele phasing issue. In this study, we developed a new method (HLAscan) for HLA genotyping using NGS data. RESULTS: HLAscan performs alignment of reads to HLA sequences from the international ImMunoGeneTics project/human leukocyte antigen (IMGT/HLA) database. The distribution of aligned reads was used to calculate a score function to determine correctly phased alleles by progressively removing false-positive alleles. Comparative HLA typing tests using public datasets from the 1000 Genomes Project and the International HapMap Project demonstrated that HLAscan could perform HLA typing more accurately than previously reported NGS-based methods such as HLAreporter and PHLAT. In addition, the results of HLA-A, −B, and -DRB1 typing by HLAscan using data generated by NextGen were identical to those obtained using a Sanger sequencing–based method. We also applied HLAscan to a family dataset with various coverage depths generated on the Illumina HiSeq X-TEN platform. HLAscan identified allele types of HLA-A, −B, −C, −DQB1, and -DRB1 with 100% accuracy for sequences at ≥ 90× depth, and the overall accuracy was 96.9%. CONCLUSIONS: HLAscan, an alignment-based program that takes read distribution into account to determine true allele types, outperformed previously developed HLA typing tools. Therefore, HLAscan can be reliably applied for determination of HLA type across the whole-genome, exome, and target sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1671-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-12 /pmc/articles/PMC5427585/ /pubmed/28499414 http://dx.doi.org/10.1186/s12859-017-1671-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Ka, Sojeong Lee, Sunho Hong, Jonghee Cho, Yangrae Sung, Joohon Kim, Han-Na Kim, Hyung-Lae Jung, Jongsun HLAscan: genotyping of the HLA region using next-generation sequencing data |
title | HLAscan: genotyping of the HLA region using next-generation sequencing data |
title_full | HLAscan: genotyping of the HLA region using next-generation sequencing data |
title_fullStr | HLAscan: genotyping of the HLA region using next-generation sequencing data |
title_full_unstemmed | HLAscan: genotyping of the HLA region using next-generation sequencing data |
title_short | HLAscan: genotyping of the HLA region using next-generation sequencing data |
title_sort | hlascan: genotyping of the hla region using next-generation sequencing data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5427585/ https://www.ncbi.nlm.nih.gov/pubmed/28499414 http://dx.doi.org/10.1186/s12859-017-1671-3 |
work_keys_str_mv | AT kasojeong hlascangenotypingofthehlaregionusingnextgenerationsequencingdata AT leesunho hlascangenotypingofthehlaregionusingnextgenerationsequencingdata AT hongjonghee hlascangenotypingofthehlaregionusingnextgenerationsequencingdata AT choyangrae hlascangenotypingofthehlaregionusingnextgenerationsequencingdata AT sungjoohon hlascangenotypingofthehlaregionusingnextgenerationsequencingdata AT kimhanna hlascangenotypingofthehlaregionusingnextgenerationsequencingdata AT kimhyunglae hlascangenotypingofthehlaregionusingnextgenerationsequencingdata AT jungjongsun hlascangenotypingofthehlaregionusingnextgenerationsequencingdata |