Cargando…

High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING

The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The med...

Descripción completa

Detalles Bibliográficos
Autores principales: Marin, Wesley M., Dandekar, Ravi, Augusto, Danillo G., Yusufali, Tasneem, Heyn, Bianca, Hofmann, Jan, Lange, Vinzenz, Sauter, Jürgen, Norman, Paul J., Hollenbach, Jill A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8360517/
https://www.ncbi.nlm.nih.gov/pubmed/34339413
http://dx.doi.org/10.1371/journal.pcbi.1008904
_version_ 1783737758603280384
author Marin, Wesley M.
Dandekar, Ravi
Augusto, Danillo G.
Yusufali, Tasneem
Heyn, Bianca
Hofmann, Jan
Lange, Vinzenz
Sauter, Jürgen
Norman, Paul J.
Hollenbach, Jill A.
author_facet Marin, Wesley M.
Dandekar, Ravi
Augusto, Danillo G.
Yusufali, Tasneem
Heyn, Bianca
Hofmann, Jan
Lange, Vinzenz
Sauter, Jürgen
Norman, Paul J.
Hollenbach, Jill A.
author_sort Marin, Wesley M.
collection PubMed
description The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The medical relevance and high variability of KIR genes makes short-read sequencing an attractive technology for interrogating the region, providing a high-throughput, high-fidelity sequencing method that is cost-effective. However, because this gene complex is characterized by extensive nucleotide polymorphism, structural variation including gene fusions and deletions, and a high level of homology between genes, its interrogation at high resolution has been thwarted by bioinformatic challenges, with most studies limited to examining presence or absence of specific genes. Here, we present the PING (Pushing Immunogenetics to the Next Generation) pipeline, which incorporates empirical data, novel alignment strategies and a custom alignment processing workflow to enable high-throughput KIR sequence analysis from short-read data. PING provides KIR gene copy number classification functionality for all KIR genes through use of a comprehensive alignment reference. The gene copy number determined per individual enables an innovative genotype determination workflow using genotype-matched references. Together, these methods address the challenges imposed by the structural complexity and overall homology of the KIR complex. To determine copy number and genotype determination accuracy, we applied PING to European and African validation cohorts and a synthetic dataset. PING demonstrated exceptional copy number determination performance across all datasets and robust genotype determination performance. Finally, an investigation into discordant genotypes for the synthetic dataset provides insight into misaligned reads, advancing our understanding in interpretation of short-read sequencing data in complex genomic regions. PING promises to support a new era of studies of KIR polymorphism, delivering high-resolution KIR genotypes that are highly accurate, enabling high-quality, high-throughput KIR genotyping for disease and population studies.
format Online
Article
Text
id pubmed-8360517
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83605172021-08-13 High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING Marin, Wesley M. Dandekar, Ravi Augusto, Danillo G. Yusufali, Tasneem Heyn, Bianca Hofmann, Jan Lange, Vinzenz Sauter, Jürgen Norman, Paul J. Hollenbach, Jill A. PLoS Comput Biol Research Article The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The medical relevance and high variability of KIR genes makes short-read sequencing an attractive technology for interrogating the region, providing a high-throughput, high-fidelity sequencing method that is cost-effective. However, because this gene complex is characterized by extensive nucleotide polymorphism, structural variation including gene fusions and deletions, and a high level of homology between genes, its interrogation at high resolution has been thwarted by bioinformatic challenges, with most studies limited to examining presence or absence of specific genes. Here, we present the PING (Pushing Immunogenetics to the Next Generation) pipeline, which incorporates empirical data, novel alignment strategies and a custom alignment processing workflow to enable high-throughput KIR sequence analysis from short-read data. PING provides KIR gene copy number classification functionality for all KIR genes through use of a comprehensive alignment reference. The gene copy number determined per individual enables an innovative genotype determination workflow using genotype-matched references. Together, these methods address the challenges imposed by the structural complexity and overall homology of the KIR complex. To determine copy number and genotype determination accuracy, we applied PING to European and African validation cohorts and a synthetic dataset. PING demonstrated exceptional copy number determination performance across all datasets and robust genotype determination performance. Finally, an investigation into discordant genotypes for the synthetic dataset provides insight into misaligned reads, advancing our understanding in interpretation of short-read sequencing data in complex genomic regions. PING promises to support a new era of studies of KIR polymorphism, delivering high-resolution KIR genotypes that are highly accurate, enabling high-quality, high-throughput KIR genotyping for disease and population studies. Public Library of Science 2021-08-02 /pmc/articles/PMC8360517/ /pubmed/34339413 http://dx.doi.org/10.1371/journal.pcbi.1008904 Text en © 2021 Marin et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Marin, Wesley M.
Dandekar, Ravi
Augusto, Danillo G.
Yusufali, Tasneem
Heyn, Bianca
Hofmann, Jan
Lange, Vinzenz
Sauter, Jürgen
Norman, Paul J.
Hollenbach, Jill A.
High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING
title High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING
title_full High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING
title_fullStr High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING
title_full_unstemmed High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING
title_short High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING
title_sort high-throughput interpretation of killer-cell immunoglobulin-like receptor short-read sequencing data with ping
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8360517/
https://www.ncbi.nlm.nih.gov/pubmed/34339413
http://dx.doi.org/10.1371/journal.pcbi.1008904
work_keys_str_mv AT marinwesleym highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT dandekarravi highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT augustodanillog highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT yusufalitasneem highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT heynbianca highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT hofmannjan highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT langevinzenz highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT sauterjurgen highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT normanpaulj highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT hollenbachjilla highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping