Cargando…
CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data
MOTIVATION: B cells display remarkable diversity in producing B-cell receptors through recombination of immunoglobulin (Ig) V-D-J genes. Somatic hypermutation (SHM) of immunoglobulin heavy chain variable (IGHV) genes are used as a prognostic marker in B-cell malignancies. Clinically, IGHV mutation s...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8600631/ https://www.ncbi.nlm.nih.gov/pubmed/34806017 http://dx.doi.org/10.1093/bioadv/vbab021 |
_version_ | 1784601194165960704 |
---|---|
author | Islam, Rashedul Bilenky, Misha Weng, Andrew P Connors, Joseph M Hirst, Martin |
author_facet | Islam, Rashedul Bilenky, Misha Weng, Andrew P Connors, Joseph M Hirst, Martin |
author_sort | Islam, Rashedul |
collection | PubMed |
description | MOTIVATION: B cells display remarkable diversity in producing B-cell receptors through recombination of immunoglobulin (Ig) V-D-J genes. Somatic hypermutation (SHM) of immunoglobulin heavy chain variable (IGHV) genes are used as a prognostic marker in B-cell malignancies. Clinically, IGHV mutation status is determined by targeted Sanger sequencing which is a resource-intensive and low-throughput procedure. Here, we describe a bioinformatic pipeline, CRIS (Complete Reconstruction of Immunoglobulin IGHV-D-J Sequences) that uses RNA sequencing (RNA-seq) datasets to reconstruct IGHV-D-J sequences and determine IGHV SHM status. RESULTS: CRIS extracts RNA-seq reads aligned to Ig gene loci, performs assembly of Ig transcripts and aligns the resulting contigs to reference Ig sequences to enumerate and classify SHMs in the IGHV gene sequence. CRIS improves on existing tools that infer the B-cell receptor repertoire from RNA-seq data using a portion IGHV gene segment by de novo assembly. We show that the SHM status identified by CRIS using the entire IGHV gene segment is highly concordant with clinical classification in three independent chronic lymphocytic leukemia patient cohorts. AVAILABILITY AND IMPLEMENTATION: The CRIS pipeline is available under the MIT License from https://github.com/Rashedul/CRIS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. |
format | Online Article Text |
id | pubmed-8600631 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-86006312021-11-18 CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data Islam, Rashedul Bilenky, Misha Weng, Andrew P Connors, Joseph M Hirst, Martin Bioinform Adv Original Article MOTIVATION: B cells display remarkable diversity in producing B-cell receptors through recombination of immunoglobulin (Ig) V-D-J genes. Somatic hypermutation (SHM) of immunoglobulin heavy chain variable (IGHV) genes are used as a prognostic marker in B-cell malignancies. Clinically, IGHV mutation status is determined by targeted Sanger sequencing which is a resource-intensive and low-throughput procedure. Here, we describe a bioinformatic pipeline, CRIS (Complete Reconstruction of Immunoglobulin IGHV-D-J Sequences) that uses RNA sequencing (RNA-seq) datasets to reconstruct IGHV-D-J sequences and determine IGHV SHM status. RESULTS: CRIS extracts RNA-seq reads aligned to Ig gene loci, performs assembly of Ig transcripts and aligns the resulting contigs to reference Ig sequences to enumerate and classify SHMs in the IGHV gene sequence. CRIS improves on existing tools that infer the B-cell receptor repertoire from RNA-seq data using a portion IGHV gene segment by de novo assembly. We show that the SHM status identified by CRIS using the entire IGHV gene segment is highly concordant with clinical classification in three independent chronic lymphocytic leukemia patient cohorts. AVAILABILITY AND IMPLEMENTATION: The CRIS pipeline is available under the MIT License from https://github.com/Rashedul/CRIS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2021-09-09 /pmc/articles/PMC8600631/ /pubmed/34806017 http://dx.doi.org/10.1093/bioadv/vbab021 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Islam, Rashedul Bilenky, Misha Weng, Andrew P Connors, Joseph M Hirst, Martin CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data |
title | CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data |
title_full | CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data |
title_fullStr | CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data |
title_full_unstemmed | CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data |
title_short | CRIS: complete reconstruction of immunoglobulin V-D-J sequences from RNA-seq data |
title_sort | cris: complete reconstruction of immunoglobulin v-d-j sequences from rna-seq data |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8600631/ https://www.ncbi.nlm.nih.gov/pubmed/34806017 http://dx.doi.org/10.1093/bioadv/vbab021 |
work_keys_str_mv | AT islamrashedul criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata AT bilenkymisha criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata AT wengandrewp criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata AT connorsjosephm criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata AT hirstmartin criscompletereconstructionofimmunoglobulinvdjsequencesfromrnaseqdata |