Cargando…

Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data

INTRODUCTION: Monoclonal antibody light chain proteins secreted by clonal plasma cells cause tissue damage due to amyloid deposition and other mechanisms. The unique protein sequence associated with each case contributes to the diversity of clinical features observed in patients. Extensive work has...

Descripción completa

Detalles Bibliográficos
Autores principales: Nau, Allison, Shen, Yun, Sanchorawala, Vaishali, Prokaeva, Tatiana, Morgan, Gareth J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10151772/
https://www.ncbi.nlm.nih.gov/pubmed/37143670
http://dx.doi.org/10.3389/fimmu.2023.1167235
_version_ 1785035612380725248
author Nau, Allison
Shen, Yun
Sanchorawala, Vaishali
Prokaeva, Tatiana
Morgan, Gareth J.
author_facet Nau, Allison
Shen, Yun
Sanchorawala, Vaishali
Prokaeva, Tatiana
Morgan, Gareth J.
author_sort Nau, Allison
collection PubMed
description INTRODUCTION: Monoclonal antibody light chain proteins secreted by clonal plasma cells cause tissue damage due to amyloid deposition and other mechanisms. The unique protein sequence associated with each case contributes to the diversity of clinical features observed in patients. Extensive work has characterized many light chains associated with multiple myeloma, light chain amyloidosis and other disorders, which we have collected in the publicly accessible database, AL-Base. However, light chain sequence diversity makes it difficult to determine the contribution of specific amino acid changes to pathology. Sequences of light chains associated with multiple myeloma provide a useful comparison to study mechanisms of light chain aggregation, but relatively few monoclonal sequences have been determined. Therefore, we sought to identify complete light chain sequences from existing high throughput sequencing data. METHODS: We developed a computational approach using the MiXCR suite of tools to extract complete rearranged IGV(L)-IGJ(L) sequences from untargeted RNA sequencing data. This method was applied to whole-transcriptome RNA sequencing data from 766 newly diagnosed patients in the Multiple Myeloma Research Foundation CoMMpass study. RESULTS: Monoclonal IGV(L)-IGJ(L) sequences were defined as those where >50% of assigned IGK or IGL reads from each sample mapped to a unique sequence. Clonal light chain sequences were identified in 705/766 samples from the CoMMpass study. Of these, 685 sequences covered the complete IGV(L)-IGJ(L) region. The identity of the assigned sequences is consistent with their associated clinical data and with partial sequences previously determined from the same cohort of samples. Sequences have been deposited in AL-Base. DISCUSSION: Our method allows routine identification of clonal antibody sequences from RNA sequencing data collected for gene expression studies. The sequences identified represent, to our knowledge, the largest collection of multiple myeloma-associated light chains reported to date. This work substantially increases the number of monoclonal light chains known to be associated with non-amyloid plasma cell disorders and will facilitate studies of light chain pathology.
format Online
Article
Text
id pubmed-10151772
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-101517722023-05-03 Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data Nau, Allison Shen, Yun Sanchorawala, Vaishali Prokaeva, Tatiana Morgan, Gareth J. Front Immunol Immunology INTRODUCTION: Monoclonal antibody light chain proteins secreted by clonal plasma cells cause tissue damage due to amyloid deposition and other mechanisms. The unique protein sequence associated with each case contributes to the diversity of clinical features observed in patients. Extensive work has characterized many light chains associated with multiple myeloma, light chain amyloidosis and other disorders, which we have collected in the publicly accessible database, AL-Base. However, light chain sequence diversity makes it difficult to determine the contribution of specific amino acid changes to pathology. Sequences of light chains associated with multiple myeloma provide a useful comparison to study mechanisms of light chain aggregation, but relatively few monoclonal sequences have been determined. Therefore, we sought to identify complete light chain sequences from existing high throughput sequencing data. METHODS: We developed a computational approach using the MiXCR suite of tools to extract complete rearranged IGV(L)-IGJ(L) sequences from untargeted RNA sequencing data. This method was applied to whole-transcriptome RNA sequencing data from 766 newly diagnosed patients in the Multiple Myeloma Research Foundation CoMMpass study. RESULTS: Monoclonal IGV(L)-IGJ(L) sequences were defined as those where >50% of assigned IGK or IGL reads from each sample mapped to a unique sequence. Clonal light chain sequences were identified in 705/766 samples from the CoMMpass study. Of these, 685 sequences covered the complete IGV(L)-IGJ(L) region. The identity of the assigned sequences is consistent with their associated clinical data and with partial sequences previously determined from the same cohort of samples. Sequences have been deposited in AL-Base. DISCUSSION: Our method allows routine identification of clonal antibody sequences from RNA sequencing data collected for gene expression studies. The sequences identified represent, to our knowledge, the largest collection of multiple myeloma-associated light chains reported to date. This work substantially increases the number of monoclonal light chains known to be associated with non-amyloid plasma cell disorders and will facilitate studies of light chain pathology. Frontiers Media S.A. 2023-04-18 /pmc/articles/PMC10151772/ /pubmed/37143670 http://dx.doi.org/10.3389/fimmu.2023.1167235 Text en Copyright © 2023 Nau, Shen, Sanchorawala, Prokaeva and Morgan https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Nau, Allison
Shen, Yun
Sanchorawala, Vaishali
Prokaeva, Tatiana
Morgan, Gareth J.
Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data
title Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data
title_full Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data
title_fullStr Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data
title_full_unstemmed Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data
title_short Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data
title_sort complete variable domain sequences of monoclonal antibody light chains identified from untargeted rna sequencing data
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10151772/
https://www.ncbi.nlm.nih.gov/pubmed/37143670
http://dx.doi.org/10.3389/fimmu.2023.1167235
work_keys_str_mv AT nauallison completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata
AT shenyun completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata
AT sanchorawalavaishali completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata
AT prokaevatatiana completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata
AT morgangarethj completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata