Cargando…

Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes

The homology, recombination, variation, and repetitive elements in the natural killer-cell immunoglobulin-like receptor (KIR) region has made full haplotype DNA interpretation impossible in a high-throughput workflow. Here, we present a new approach using long-read sequencing to efficiently capture,...

Descripción completa

Detalles Bibliográficos
Autores principales: Roe, David, Williams, Jonathan, Ivery, Keyton, Brouckaert, Jenny, Downey, Nick, Locklear, Chad, Kuang, Rui, Maiers, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581912/
https://www.ncbi.nlm.nih.gov/pubmed/33162997
http://dx.doi.org/10.3389/fimmu.2020.582927
_version_ 1783599075735633920
author Roe, David
Williams, Jonathan
Ivery, Keyton
Brouckaert, Jenny
Downey, Nick
Locklear, Chad
Kuang, Rui
Maiers, Martin
author_facet Roe, David
Williams, Jonathan
Ivery, Keyton
Brouckaert, Jenny
Downey, Nick
Locklear, Chad
Kuang, Rui
Maiers, Martin
author_sort Roe, David
collection PubMed
description The homology, recombination, variation, and repetitive elements in the natural killer-cell immunoglobulin-like receptor (KIR) region has made full haplotype DNA interpretation impossible in a high-throughput workflow. Here, we present a new approach using long-read sequencing to efficiently capture, sequence, and assemble diploid human KIR haplotypes. Probes were designed to capture KIR fragments efficiently by leveraging the repeating homology of the region. IDT xGen(®) Lockdown probes were used to capture 2–8 kb of sheared DNA fragments followed by sequencing on a PacBio Sequel. The sequences were error corrected, binned, and then assembled using the Canu assembler. The location of genes and their exon/intron boundaries are included in the workflow. The assembly and annotation was evaluated on 16 individuals (8 African American and 8 Europeans) from whom ground truth was known via long-range sequencing with fosmid library preparation. Using only 18 capture probes, the results show that the assemblies cover 97% of the GenBank reference, are 99.97% concordant, and it takes only 1.8 haplotigs to cover 75% of the reference. We also report the first assembly of diploid KIR haplotypes from long-read WGS. Our targeted hybridization probe capture and sequencing approach is the first of its kind to fully sequence and phase all diploid human KIR haplotypes, and it is efficient enough for population-scale studies and clinical use. The open and free software is available at https://github.com/droeatumn/kass and supported by a environment at https://hub.docker.com/repository/docker/droeatumn/kass.
format Online
Article
Text
id pubmed-7581912
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-75819122020-11-05 Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes Roe, David Williams, Jonathan Ivery, Keyton Brouckaert, Jenny Downey, Nick Locklear, Chad Kuang, Rui Maiers, Martin Front Immunol Immunology The homology, recombination, variation, and repetitive elements in the natural killer-cell immunoglobulin-like receptor (KIR) region has made full haplotype DNA interpretation impossible in a high-throughput workflow. Here, we present a new approach using long-read sequencing to efficiently capture, sequence, and assemble diploid human KIR haplotypes. Probes were designed to capture KIR fragments efficiently by leveraging the repeating homology of the region. IDT xGen(®) Lockdown probes were used to capture 2–8 kb of sheared DNA fragments followed by sequencing on a PacBio Sequel. The sequences were error corrected, binned, and then assembled using the Canu assembler. The location of genes and their exon/intron boundaries are included in the workflow. The assembly and annotation was evaluated on 16 individuals (8 African American and 8 Europeans) from whom ground truth was known via long-range sequencing with fosmid library preparation. Using only 18 capture probes, the results show that the assemblies cover 97% of the GenBank reference, are 99.97% concordant, and it takes only 1.8 haplotigs to cover 75% of the reference. We also report the first assembly of diploid KIR haplotypes from long-read WGS. Our targeted hybridization probe capture and sequencing approach is the first of its kind to fully sequence and phase all diploid human KIR haplotypes, and it is efficient enough for population-scale studies and clinical use. The open and free software is available at https://github.com/droeatumn/kass and supported by a environment at https://hub.docker.com/repository/docker/droeatumn/kass. Frontiers Media S.A. 2020-10-09 /pmc/articles/PMC7581912/ /pubmed/33162997 http://dx.doi.org/10.3389/fimmu.2020.582927 Text en Copyright © 2020 Roe, Williams, Ivery, Brouckaert, Downey, Locklear, Kuang and Maiers http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Roe, David
Williams, Jonathan
Ivery, Keyton
Brouckaert, Jenny
Downey, Nick
Locklear, Chad
Kuang, Rui
Maiers, Martin
Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes
title Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes
title_full Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes
title_fullStr Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes
title_full_unstemmed Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes
title_short Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes
title_sort efficient sequencing, assembly, and annotation of human kir haplotypes
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581912/
https://www.ncbi.nlm.nih.gov/pubmed/33162997
http://dx.doi.org/10.3389/fimmu.2020.582927
work_keys_str_mv AT roedavid efficientsequencingassemblyandannotationofhumankirhaplotypes
AT williamsjonathan efficientsequencingassemblyandannotationofhumankirhaplotypes
AT iverykeyton efficientsequencingassemblyandannotationofhumankirhaplotypes
AT brouckaertjenny efficientsequencingassemblyandannotationofhumankirhaplotypes
AT downeynick efficientsequencingassemblyandannotationofhumankirhaplotypes
AT locklearchad efficientsequencingassemblyandannotationofhumankirhaplotypes
AT kuangrui efficientsequencingassemblyandannotationofhumankirhaplotypes
AT maiersmartin efficientsequencingassemblyandannotationofhumankirhaplotypes