Cargando…

Phase-defined complete sequencing of the HLA genes by next-generation sequencing

BACKGROUND: The human leukocyte antigen (HLA) region, the 3.8-Mb segment of the human genome at 6p21, has been associated with more than 100 different diseases, mostly autoimmune diseases. Due to the complex nature of HLA genes, there are difficulties in elucidating complete HLA gene sequences espec...

Descripción completa

Detalles Bibliográficos
Autores principales: Hosomichi, Kazuyoshi, Jinam, Timothy A, Mitsunaga, Shigeki, Nakaoka, Hirofumi, Inoue, Ituro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3671147/
https://www.ncbi.nlm.nih.gov/pubmed/23714642
http://dx.doi.org/10.1186/1471-2164-14-355
_version_ 1782271933355655168
author Hosomichi, Kazuyoshi
Jinam, Timothy A
Mitsunaga, Shigeki
Nakaoka, Hirofumi
Inoue, Ituro
author_facet Hosomichi, Kazuyoshi
Jinam, Timothy A
Mitsunaga, Shigeki
Nakaoka, Hirofumi
Inoue, Ituro
author_sort Hosomichi, Kazuyoshi
collection PubMed
description BACKGROUND: The human leukocyte antigen (HLA) region, the 3.8-Mb segment of the human genome at 6p21, has been associated with more than 100 different diseases, mostly autoimmune diseases. Due to the complex nature of HLA genes, there are difficulties in elucidating complete HLA gene sequences especially HLA gene haplotype structures by the conventional sequencing method. We propose a novel, accurate, and cost-effective method for generating phase-defined complete sequencing of HLA genes by using indexed multiplex next generation sequencing. RESULTS: A total of 33 HLA homozygous samples, 11 HLA heterozygous samples, and 3 parents-child families were subjected to phase-defined HLA gene sequencing. We applied long-range PCR to amplify six HLA genes (HLA-A, -C, -B, DRB1, -DQB1, and –DPB1) followed by transposase-based library construction and multiplex sequencing with the MiSeq sequencer. Paired-end reads (2 × 250 bp) derived from the sequencer were aligned to the six HLA gene segments of UCSC hg19 allowing at most 80 bases mismatch. For HLA homozygous samples, the six amplicons of an individual were pooled and simultaneously sequenced and mapped as an individual-tagging method. The paired-end reads were aligned to corresponding genes of UCSC hg19 and unambiguous, continuous sequences were obtained. For HLA heterozygous samples, each amplicon was separately sequenced and mapped as a gene-tagging method. After alignments, we detected informative paired-end reads harboring SNVs on both forward and reverse reads that are used to separate two chromosomes and to generate two phase-defined sequences in an individual. Consequently, we were able to determine the phase-defined HLA gene sequences from promoter to 3′-UTR and assign up to 8-digit HLA allele numbers, regardless of whether the alleles are rare or novel. Parent–child trio-based sequencing validated our sequencing and phasing methods. CONCLUSIONS: Our protocol generated phased-defined sequences of the entire HLA genes, resulting in high resolution HLA typing and new allele detection.
format Online
Article
Text
id pubmed-3671147
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36711472013-06-05 Phase-defined complete sequencing of the HLA genes by next-generation sequencing Hosomichi, Kazuyoshi Jinam, Timothy A Mitsunaga, Shigeki Nakaoka, Hirofumi Inoue, Ituro BMC Genomics Methodology Article BACKGROUND: The human leukocyte antigen (HLA) region, the 3.8-Mb segment of the human genome at 6p21, has been associated with more than 100 different diseases, mostly autoimmune diseases. Due to the complex nature of HLA genes, there are difficulties in elucidating complete HLA gene sequences especially HLA gene haplotype structures by the conventional sequencing method. We propose a novel, accurate, and cost-effective method for generating phase-defined complete sequencing of HLA genes by using indexed multiplex next generation sequencing. RESULTS: A total of 33 HLA homozygous samples, 11 HLA heterozygous samples, and 3 parents-child families were subjected to phase-defined HLA gene sequencing. We applied long-range PCR to amplify six HLA genes (HLA-A, -C, -B, DRB1, -DQB1, and –DPB1) followed by transposase-based library construction and multiplex sequencing with the MiSeq sequencer. Paired-end reads (2 × 250 bp) derived from the sequencer were aligned to the six HLA gene segments of UCSC hg19 allowing at most 80 bases mismatch. For HLA homozygous samples, the six amplicons of an individual were pooled and simultaneously sequenced and mapped as an individual-tagging method. The paired-end reads were aligned to corresponding genes of UCSC hg19 and unambiguous, continuous sequences were obtained. For HLA heterozygous samples, each amplicon was separately sequenced and mapped as a gene-tagging method. After alignments, we detected informative paired-end reads harboring SNVs on both forward and reverse reads that are used to separate two chromosomes and to generate two phase-defined sequences in an individual. Consequently, we were able to determine the phase-defined HLA gene sequences from promoter to 3′-UTR and assign up to 8-digit HLA allele numbers, regardless of whether the alleles are rare or novel. Parent–child trio-based sequencing validated our sequencing and phasing methods. CONCLUSIONS: Our protocol generated phased-defined sequences of the entire HLA genes, resulting in high resolution HLA typing and new allele detection. BioMed Central 2013-05-28 /pmc/articles/PMC3671147/ /pubmed/23714642 http://dx.doi.org/10.1186/1471-2164-14-355 Text en Copyright © 2013 Hosomichi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Hosomichi, Kazuyoshi
Jinam, Timothy A
Mitsunaga, Shigeki
Nakaoka, Hirofumi
Inoue, Ituro
Phase-defined complete sequencing of the HLA genes by next-generation sequencing
title Phase-defined complete sequencing of the HLA genes by next-generation sequencing
title_full Phase-defined complete sequencing of the HLA genes by next-generation sequencing
title_fullStr Phase-defined complete sequencing of the HLA genes by next-generation sequencing
title_full_unstemmed Phase-defined complete sequencing of the HLA genes by next-generation sequencing
title_short Phase-defined complete sequencing of the HLA genes by next-generation sequencing
title_sort phase-defined complete sequencing of the hla genes by next-generation sequencing
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3671147/
https://www.ncbi.nlm.nih.gov/pubmed/23714642
http://dx.doi.org/10.1186/1471-2164-14-355
work_keys_str_mv AT hosomichikazuyoshi phasedefinedcompletesequencingofthehlagenesbynextgenerationsequencing
AT jinamtimothya phasedefinedcompletesequencingofthehlagenesbynextgenerationsequencing
AT mitsunagashigeki phasedefinedcompletesequencingofthehlagenesbynextgenerationsequencing
AT nakaokahirofumi phasedefinedcompletesequencingofthehlagenesbynextgenerationsequencing
AT inoueituro phasedefinedcompletesequencingofthehlagenesbynextgenerationsequencing