Cargando…
Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data
INTRODUCTION: Monoclonal antibody light chain proteins secreted by clonal plasma cells cause tissue damage due to amyloid deposition and other mechanisms. The unique protein sequence associated with each case contributes to the diversity of clinical features observed in patients. Extensive work has...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10151772/ https://www.ncbi.nlm.nih.gov/pubmed/37143670 http://dx.doi.org/10.3389/fimmu.2023.1167235 |
_version_ | 1785035612380725248 |
---|---|
author | Nau, Allison Shen, Yun Sanchorawala, Vaishali Prokaeva, Tatiana Morgan, Gareth J. |
author_facet | Nau, Allison Shen, Yun Sanchorawala, Vaishali Prokaeva, Tatiana Morgan, Gareth J. |
author_sort | Nau, Allison |
collection | PubMed |
description | INTRODUCTION: Monoclonal antibody light chain proteins secreted by clonal plasma cells cause tissue damage due to amyloid deposition and other mechanisms. The unique protein sequence associated with each case contributes to the diversity of clinical features observed in patients. Extensive work has characterized many light chains associated with multiple myeloma, light chain amyloidosis and other disorders, which we have collected in the publicly accessible database, AL-Base. However, light chain sequence diversity makes it difficult to determine the contribution of specific amino acid changes to pathology. Sequences of light chains associated with multiple myeloma provide a useful comparison to study mechanisms of light chain aggregation, but relatively few monoclonal sequences have been determined. Therefore, we sought to identify complete light chain sequences from existing high throughput sequencing data. METHODS: We developed a computational approach using the MiXCR suite of tools to extract complete rearranged IGV(L)-IGJ(L) sequences from untargeted RNA sequencing data. This method was applied to whole-transcriptome RNA sequencing data from 766 newly diagnosed patients in the Multiple Myeloma Research Foundation CoMMpass study. RESULTS: Monoclonal IGV(L)-IGJ(L) sequences were defined as those where >50% of assigned IGK or IGL reads from each sample mapped to a unique sequence. Clonal light chain sequences were identified in 705/766 samples from the CoMMpass study. Of these, 685 sequences covered the complete IGV(L)-IGJ(L) region. The identity of the assigned sequences is consistent with their associated clinical data and with partial sequences previously determined from the same cohort of samples. Sequences have been deposited in AL-Base. DISCUSSION: Our method allows routine identification of clonal antibody sequences from RNA sequencing data collected for gene expression studies. The sequences identified represent, to our knowledge, the largest collection of multiple myeloma-associated light chains reported to date. This work substantially increases the number of monoclonal light chains known to be associated with non-amyloid plasma cell disorders and will facilitate studies of light chain pathology. |
format | Online Article Text |
id | pubmed-10151772 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-101517722023-05-03 Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data Nau, Allison Shen, Yun Sanchorawala, Vaishali Prokaeva, Tatiana Morgan, Gareth J. Front Immunol Immunology INTRODUCTION: Monoclonal antibody light chain proteins secreted by clonal plasma cells cause tissue damage due to amyloid deposition and other mechanisms. The unique protein sequence associated with each case contributes to the diversity of clinical features observed in patients. Extensive work has characterized many light chains associated with multiple myeloma, light chain amyloidosis and other disorders, which we have collected in the publicly accessible database, AL-Base. However, light chain sequence diversity makes it difficult to determine the contribution of specific amino acid changes to pathology. Sequences of light chains associated with multiple myeloma provide a useful comparison to study mechanisms of light chain aggregation, but relatively few monoclonal sequences have been determined. Therefore, we sought to identify complete light chain sequences from existing high throughput sequencing data. METHODS: We developed a computational approach using the MiXCR suite of tools to extract complete rearranged IGV(L)-IGJ(L) sequences from untargeted RNA sequencing data. This method was applied to whole-transcriptome RNA sequencing data from 766 newly diagnosed patients in the Multiple Myeloma Research Foundation CoMMpass study. RESULTS: Monoclonal IGV(L)-IGJ(L) sequences were defined as those where >50% of assigned IGK or IGL reads from each sample mapped to a unique sequence. Clonal light chain sequences were identified in 705/766 samples from the CoMMpass study. Of these, 685 sequences covered the complete IGV(L)-IGJ(L) region. The identity of the assigned sequences is consistent with their associated clinical data and with partial sequences previously determined from the same cohort of samples. Sequences have been deposited in AL-Base. DISCUSSION: Our method allows routine identification of clonal antibody sequences from RNA sequencing data collected for gene expression studies. The sequences identified represent, to our knowledge, the largest collection of multiple myeloma-associated light chains reported to date. This work substantially increases the number of monoclonal light chains known to be associated with non-amyloid plasma cell disorders and will facilitate studies of light chain pathology. Frontiers Media S.A. 2023-04-18 /pmc/articles/PMC10151772/ /pubmed/37143670 http://dx.doi.org/10.3389/fimmu.2023.1167235 Text en Copyright © 2023 Nau, Shen, Sanchorawala, Prokaeva and Morgan https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Immunology Nau, Allison Shen, Yun Sanchorawala, Vaishali Prokaeva, Tatiana Morgan, Gareth J. Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data |
title | Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data |
title_full | Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data |
title_fullStr | Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data |
title_full_unstemmed | Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data |
title_short | Complete variable domain sequences of monoclonal antibody light chains identified from untargeted RNA sequencing data |
title_sort | complete variable domain sequences of monoclonal antibody light chains identified from untargeted rna sequencing data |
topic | Immunology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10151772/ https://www.ncbi.nlm.nih.gov/pubmed/37143670 http://dx.doi.org/10.3389/fimmu.2023.1167235 |
work_keys_str_mv | AT nauallison completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata AT shenyun completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata AT sanchorawalavaishali completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata AT prokaevatatiana completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata AT morgangarethj completevariabledomainsequencesofmonoclonalantibodylightchainsidentifiedfromuntargetedrnasequencingdata |