Cargando…
High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING
The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The med...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8360517/ https://www.ncbi.nlm.nih.gov/pubmed/34339413 http://dx.doi.org/10.1371/journal.pcbi.1008904 |
_version_ | 1783737758603280384 |
---|---|
author | Marin, Wesley M. Dandekar, Ravi Augusto, Danillo G. Yusufali, Tasneem Heyn, Bianca Hofmann, Jan Lange, Vinzenz Sauter, Jürgen Norman, Paul J. Hollenbach, Jill A. |
author_facet | Marin, Wesley M. Dandekar, Ravi Augusto, Danillo G. Yusufali, Tasneem Heyn, Bianca Hofmann, Jan Lange, Vinzenz Sauter, Jürgen Norman, Paul J. Hollenbach, Jill A. |
author_sort | Marin, Wesley M. |
collection | PubMed |
description | The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The medical relevance and high variability of KIR genes makes short-read sequencing an attractive technology for interrogating the region, providing a high-throughput, high-fidelity sequencing method that is cost-effective. However, because this gene complex is characterized by extensive nucleotide polymorphism, structural variation including gene fusions and deletions, and a high level of homology between genes, its interrogation at high resolution has been thwarted by bioinformatic challenges, with most studies limited to examining presence or absence of specific genes. Here, we present the PING (Pushing Immunogenetics to the Next Generation) pipeline, which incorporates empirical data, novel alignment strategies and a custom alignment processing workflow to enable high-throughput KIR sequence analysis from short-read data. PING provides KIR gene copy number classification functionality for all KIR genes through use of a comprehensive alignment reference. The gene copy number determined per individual enables an innovative genotype determination workflow using genotype-matched references. Together, these methods address the challenges imposed by the structural complexity and overall homology of the KIR complex. To determine copy number and genotype determination accuracy, we applied PING to European and African validation cohorts and a synthetic dataset. PING demonstrated exceptional copy number determination performance across all datasets and robust genotype determination performance. Finally, an investigation into discordant genotypes for the synthetic dataset provides insight into misaligned reads, advancing our understanding in interpretation of short-read sequencing data in complex genomic regions. PING promises to support a new era of studies of KIR polymorphism, delivering high-resolution KIR genotypes that are highly accurate, enabling high-quality, high-throughput KIR genotyping for disease and population studies. |
format | Online Article Text |
id | pubmed-8360517 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83605172021-08-13 High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING Marin, Wesley M. Dandekar, Ravi Augusto, Danillo G. Yusufali, Tasneem Heyn, Bianca Hofmann, Jan Lange, Vinzenz Sauter, Jürgen Norman, Paul J. Hollenbach, Jill A. PLoS Comput Biol Research Article The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The medical relevance and high variability of KIR genes makes short-read sequencing an attractive technology for interrogating the region, providing a high-throughput, high-fidelity sequencing method that is cost-effective. However, because this gene complex is characterized by extensive nucleotide polymorphism, structural variation including gene fusions and deletions, and a high level of homology between genes, its interrogation at high resolution has been thwarted by bioinformatic challenges, with most studies limited to examining presence or absence of specific genes. Here, we present the PING (Pushing Immunogenetics to the Next Generation) pipeline, which incorporates empirical data, novel alignment strategies and a custom alignment processing workflow to enable high-throughput KIR sequence analysis from short-read data. PING provides KIR gene copy number classification functionality for all KIR genes through use of a comprehensive alignment reference. The gene copy number determined per individual enables an innovative genotype determination workflow using genotype-matched references. Together, these methods address the challenges imposed by the structural complexity and overall homology of the KIR complex. To determine copy number and genotype determination accuracy, we applied PING to European and African validation cohorts and a synthetic dataset. PING demonstrated exceptional copy number determination performance across all datasets and robust genotype determination performance. Finally, an investigation into discordant genotypes for the synthetic dataset provides insight into misaligned reads, advancing our understanding in interpretation of short-read sequencing data in complex genomic regions. PING promises to support a new era of studies of KIR polymorphism, delivering high-resolution KIR genotypes that are highly accurate, enabling high-quality, high-throughput KIR genotyping for disease and population studies. Public Library of Science 2021-08-02 /pmc/articles/PMC8360517/ /pubmed/34339413 http://dx.doi.org/10.1371/journal.pcbi.1008904 Text en © 2021 Marin et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Marin, Wesley M. Dandekar, Ravi Augusto, Danillo G. Yusufali, Tasneem Heyn, Bianca Hofmann, Jan Lange, Vinzenz Sauter, Jürgen Norman, Paul J. Hollenbach, Jill A. High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING |
title | High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING |
title_full | High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING |
title_fullStr | High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING |
title_full_unstemmed | High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING |
title_short | High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING |
title_sort | high-throughput interpretation of killer-cell immunoglobulin-like receptor short-read sequencing data with ping |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8360517/ https://www.ncbi.nlm.nih.gov/pubmed/34339413 http://dx.doi.org/10.1371/journal.pcbi.1008904 |
work_keys_str_mv | AT marinwesleym highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT dandekarravi highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT augustodanillog highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT yusufalitasneem highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT heynbianca highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT hofmannjan highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT langevinzenz highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT sauterjurgen highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT normanpaulj highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping AT hollenbachjilla highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping |