Cargando…

Automated analysis of immunosequencing datasets reveals novel immunoglobulin D genes across diverse species

Immunoglobulin genes are formed through V(D)J recombination, which joins the variable (V), diversity (D), and joining (J) germline genes. Since variations in germline genes have been linked to various diseases, personalized immunogenomics focuses on finding alleles of germline genes across various p...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhardwaj, Vinnu, Franceschetti, Massimo, Rao, Ramesh, Pevzner, Pavel A., Safonova, Yana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295240/
https://www.ncbi.nlm.nih.gov/pubmed/32339161
http://dx.doi.org/10.1371/journal.pcbi.1007837
_version_ 1783546615965941760
author Bhardwaj, Vinnu
Franceschetti, Massimo
Rao, Ramesh
Pevzner, Pavel A.
Safonova, Yana
author_facet Bhardwaj, Vinnu
Franceschetti, Massimo
Rao, Ramesh
Pevzner, Pavel A.
Safonova, Yana
author_sort Bhardwaj, Vinnu
collection PubMed
description Immunoglobulin genes are formed through V(D)J recombination, which joins the variable (V), diversity (D), and joining (J) germline genes. Since variations in germline genes have been linked to various diseases, personalized immunogenomics focuses on finding alleles of germline genes across various patients. Although reconstruction of V and J genes is a well-studied problem, the more challenging task of reconstructing D genes remained open until the IgScout algorithm was developed in 2019. In this work, we address limitations of IgScout by developing a probabilistic MINING-D algorithm for D gene reconstruction, apply it to hundreds of immunosequencing datasets from multiple species, and validate the newly inferred D genes by analyzing diverse whole genome sequencing datasets and haplotyping heterozygous V genes.
format Online
Article
Text
id pubmed-7295240
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-72952402020-06-19 Automated analysis of immunosequencing datasets reveals novel immunoglobulin D genes across diverse species Bhardwaj, Vinnu Franceschetti, Massimo Rao, Ramesh Pevzner, Pavel A. Safonova, Yana PLoS Comput Biol Research Article Immunoglobulin genes are formed through V(D)J recombination, which joins the variable (V), diversity (D), and joining (J) germline genes. Since variations in germline genes have been linked to various diseases, personalized immunogenomics focuses on finding alleles of germline genes across various patients. Although reconstruction of V and J genes is a well-studied problem, the more challenging task of reconstructing D genes remained open until the IgScout algorithm was developed in 2019. In this work, we address limitations of IgScout by developing a probabilistic MINING-D algorithm for D gene reconstruction, apply it to hundreds of immunosequencing datasets from multiple species, and validate the newly inferred D genes by analyzing diverse whole genome sequencing datasets and haplotyping heterozygous V genes. Public Library of Science 2020-04-27 /pmc/articles/PMC7295240/ /pubmed/32339161 http://dx.doi.org/10.1371/journal.pcbi.1007837 Text en © 2020 Bhardwaj et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bhardwaj, Vinnu
Franceschetti, Massimo
Rao, Ramesh
Pevzner, Pavel A.
Safonova, Yana
Automated analysis of immunosequencing datasets reveals novel immunoglobulin D genes across diverse species
title Automated analysis of immunosequencing datasets reveals novel immunoglobulin D genes across diverse species
title_full Automated analysis of immunosequencing datasets reveals novel immunoglobulin D genes across diverse species
title_fullStr Automated analysis of immunosequencing datasets reveals novel immunoglobulin D genes across diverse species
title_full_unstemmed Automated analysis of immunosequencing datasets reveals novel immunoglobulin D genes across diverse species
title_short Automated analysis of immunosequencing datasets reveals novel immunoglobulin D genes across diverse species
title_sort automated analysis of immunosequencing datasets reveals novel immunoglobulin d genes across diverse species
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7295240/
https://www.ncbi.nlm.nih.gov/pubmed/32339161
http://dx.doi.org/10.1371/journal.pcbi.1007837
work_keys_str_mv AT bhardwajvinnu automatedanalysisofimmunosequencingdatasetsrevealsnovelimmunoglobulindgenesacrossdiversespecies
AT franceschettimassimo automatedanalysisofimmunosequencingdatasetsrevealsnovelimmunoglobulindgenesacrossdiversespecies
AT raoramesh automatedanalysisofimmunosequencingdatasetsrevealsnovelimmunoglobulindgenesacrossdiversespecies
AT pevznerpavela automatedanalysisofimmunosequencingdatasetsrevealsnovelimmunoglobulindgenesacrossdiversespecies
AT safonovayana automatedanalysisofimmunosequencingdatasetsrevealsnovelimmunoglobulindgenesacrossdiversespecies