Cargando…

Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology

The mammalian Major Histocompatibility Complex (MHC) region contains several gene families characterized by highly polymorphic loci with extensive nucleotide diversity, copy number variation of paralogous genes, and long repetitive sequences. This structural complexity has made it difficult to const...

Descripción completa

Detalles Bibliográficos
Autores principales: Viļuma, Agnese, Mikko, Sofia, Hahn, Daniela, Skow, Loren, Andersson, Göran, Bergström, Tomas F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374520/
https://www.ncbi.nlm.nih.gov/pubmed/28361880
http://dx.doi.org/10.1038/srep45518
_version_ 1782518903366221824
author Viļuma, Agnese
Mikko, Sofia
Hahn, Daniela
Skow, Loren
Andersson, Göran
Bergström, Tomas F.
author_facet Viļuma, Agnese
Mikko, Sofia
Hahn, Daniela
Skow, Loren
Andersson, Göran
Bergström, Tomas F.
author_sort Viļuma, Agnese
collection PubMed
description The mammalian Major Histocompatibility Complex (MHC) region contains several gene families characterized by highly polymorphic loci with extensive nucleotide diversity, copy number variation of paralogous genes, and long repetitive sequences. This structural complexity has made it difficult to construct a reliable reference sequence of the horse MHC region. In this study, we used long-read single molecule, real-time (SMRT) sequencing technology from Pacific Biosciences (PacBio) to sequence eight Bacterial Artificial Chromosome (BAC) clones spanning the horse MHC class II region. The final assembly resulted in a 1,165,328 bp continuous gap free sequence with 35 manually curated genomic loci of which 23 were considered to be functional and 12 to be pseudogenes. In comparison to the MHC class II region in other mammals, the corresponding region in horse shows extraordinary copy number variation and different relative location and directionality of the Eqca-DRB, -DQA, -DQB and –DOB loci. This is the first long-read sequence assembly of the horse MHC class II region with rigorous manual gene annotation, and it will serve as an important resource for association studies of immune-mediated equine diseases and for evolutionary analysis of genetic diversity in this region.
format Online
Article
Text
id pubmed-5374520
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53745202017-04-03 Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology Viļuma, Agnese Mikko, Sofia Hahn, Daniela Skow, Loren Andersson, Göran Bergström, Tomas F. Sci Rep Article The mammalian Major Histocompatibility Complex (MHC) region contains several gene families characterized by highly polymorphic loci with extensive nucleotide diversity, copy number variation of paralogous genes, and long repetitive sequences. This structural complexity has made it difficult to construct a reliable reference sequence of the horse MHC region. In this study, we used long-read single molecule, real-time (SMRT) sequencing technology from Pacific Biosciences (PacBio) to sequence eight Bacterial Artificial Chromosome (BAC) clones spanning the horse MHC class II region. The final assembly resulted in a 1,165,328 bp continuous gap free sequence with 35 manually curated genomic loci of which 23 were considered to be functional and 12 to be pseudogenes. In comparison to the MHC class II region in other mammals, the corresponding region in horse shows extraordinary copy number variation and different relative location and directionality of the Eqca-DRB, -DQA, -DQB and –DOB loci. This is the first long-read sequence assembly of the horse MHC class II region with rigorous manual gene annotation, and it will serve as an important resource for association studies of immune-mediated equine diseases and for evolutionary analysis of genetic diversity in this region. Nature Publishing Group 2017-03-31 /pmc/articles/PMC5374520/ /pubmed/28361880 http://dx.doi.org/10.1038/srep45518 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Viļuma, Agnese
Mikko, Sofia
Hahn, Daniela
Skow, Loren
Andersson, Göran
Bergström, Tomas F.
Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology
title Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology
title_full Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology
title_fullStr Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology
title_full_unstemmed Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology
title_short Genomic structure of the horse major histocompatibility complex class II region resolved using PacBio long-read sequencing technology
title_sort genomic structure of the horse major histocompatibility complex class ii region resolved using pacbio long-read sequencing technology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374520/
https://www.ncbi.nlm.nih.gov/pubmed/28361880
http://dx.doi.org/10.1038/srep45518
work_keys_str_mv AT vilumaagnese genomicstructureofthehorsemajorhistocompatibilitycomplexclassiiregionresolvedusingpacbiolongreadsequencingtechnology
AT mikkosofia genomicstructureofthehorsemajorhistocompatibilitycomplexclassiiregionresolvedusingpacbiolongreadsequencingtechnology
AT hahndaniela genomicstructureofthehorsemajorhistocompatibilitycomplexclassiiregionresolvedusingpacbiolongreadsequencingtechnology
AT skowloren genomicstructureofthehorsemajorhistocompatibilitycomplexclassiiregionresolvedusingpacbiolongreadsequencingtechnology
AT anderssongoran genomicstructureofthehorsemajorhistocompatibilitycomplexclassiiregionresolvedusingpacbiolongreadsequencingtechnology
AT bergstromtomasf genomicstructureofthehorsemajorhistocompatibilitycomplexclassiiregionresolvedusingpacbiolongreadsequencingtechnology