Cargando…

CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data

MOTIVATION: B cells display remarkable diversity in producing B-cell receptors through recombination of immunoglobulin (Ig) V-D-J genes. Somatic hypermutation (SHM) of immunoglobulin heavy chain variable (IGHV) genes are used as a prognostic marker in B-cell malignancies. Clinically, IGHV mutation s...

Descripción completa

Detalles Bibliográficos
Autores principales: Islam, Rashedul, Bilenky, Misha, Weng, Andrew P, Connors, Joseph M, Hirst, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8600631/
https://www.ncbi.nlm.nih.gov/pubmed/34806017
http://dx.doi.org/10.1093/bioadv/vbab021
_version_ 1784601194165960704
author Islam, Rashedul
Bilenky, Misha
Weng, Andrew P
Connors, Joseph M
Hirst, Martin
author_facet Islam, Rashedul
Bilenky, Misha
Weng, Andrew P
Connors, Joseph M
Hirst, Martin
author_sort Islam, Rashedul
collection PubMed
description MOTIVATION: B cells display remarkable diversity in producing B-cell receptors through recombination of immunoglobulin (Ig) V-D-J genes. Somatic hypermutation (SHM) of immunoglobulin heavy chain variable (IGHV) genes are used as a prognostic marker in B-cell malignancies. Clinically, IGHV mutation status is determined by targeted Sanger sequencing which is a resource-intensive and low-throughput procedure. Here, we describe a bioinformatic pipeline, CRIS (Complete Reconstruction of Immunoglobulin IGHV-D-J Sequences) that uses RNA sequencing (RNA-seq) datasets to reconstruct IGHV-D-J sequences and determine IGHV SHM status. RESULTS: CRIS extracts RNA-seq reads aligned to Ig gene loci, performs assembly of Ig transcripts and aligns the resulting contigs to reference Ig sequences to enumerate and classify SHMs in the IGHV gene sequence. CRIS improves on existing tools that infer the B-cell receptor repertoire from RNA-seq data using a portion IGHV gene segment by de novo assembly. We show that the SHM status identified by CRIS using the entire IGHV gene segment is highly concordant with clinical classification in three independent chronic lymphocytic leukemia patient cohorts. AVAILABILITY AND IMPLEMENTATION: The CRIS pipeline is available under the MIT License from https://github.com/Rashedul/CRIS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-8600631
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86006312021-11-18 CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data Islam, Rashedul Bilenky, Misha Weng, Andrew P Connors, Joseph M Hirst, Martin Bioinform Adv Original Article MOTIVATION: B cells display remarkable diversity in producing B-cell receptors through recombination of immunoglobulin (Ig) V-D-J genes. Somatic hypermutation (SHM) of immunoglobulin heavy chain variable (IGHV) genes are used as a prognostic marker in B-cell malignancies. Clinically, IGHV mutation status is determined by targeted Sanger sequencing which is a resource-intensive and low-throughput procedure. Here, we describe a bioinformatic pipeline, CRIS (Complete Reconstruction of Immunoglobulin IGHV-D-J Sequences) that uses RNA sequencing (RNA-seq) datasets to reconstruct IGHV-D-J sequences and determine IGHV SHM status. RESULTS: CRIS extracts RNA-seq reads aligned to Ig gene loci, performs assembly of Ig transcripts and aligns the resulting contigs to reference Ig sequences to enumerate and classify SHMs in the IGHV gene sequence. CRIS improves on existing tools that infer the B-cell receptor repertoire from RNA-seq data using a portion IGHV gene segment by de novo assembly. We show that the SHM status identified by CRIS using the entire IGHV gene segment is highly concordant with clinical classification in three independent chronic lymphocytic leukemia patient cohorts. AVAILABILITY AND IMPLEMENTATION: The CRIS pipeline is available under the MIT License from https://github.com/Rashedul/CRIS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2021-09-09 /pmc/articles/PMC8600631/ /pubmed/34806017 http://dx.doi.org/10.1093/bioadv/vbab021 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Islam, Rashedul
Bilenky, Misha
Weng, Andrew P
Connors, Joseph M
Hirst, Martin
CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data
title CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data
title_full CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data
title_fullStr CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data
title_full_unstemmed CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data
title_short CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data
title_sort cris: complete reconstruction of immunoglobulin v-d-j sequences from rna-seq data
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8600631/
https://www.ncbi.nlm.nih.gov/pubmed/34806017
http://dx.doi.org/10.1093/bioadv/vbab021
work_keys_str_mv AT islamrashedul criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata
AT bilenkymisha criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata
AT wengandrewp criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata
AT connorsjosephm criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata
AT hirstmartin criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata