Cargando…

Fast clonal family inference from large-scale B cell repertoire sequencing data

Advances in high-throughput sequencing technologies have facilitated the large-scale characterization of B cell receptor (BCR) repertoires. However, the vast amount and high diversity of the BCR sequences pose challenges for efficient and biologically meaningful analysis. Here, we introduce fastBCR,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Kaixuan, Hu, Xihao, Zhang, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10626204/
https://www.ncbi.nlm.nih.gov/pubmed/37788671
http://dx.doi.org/10.1016/j.crmeth.2023.100601
_version_ 1785131295055020032
author Wang, Kaixuan
Hu, Xihao
Zhang, Jian
author_facet Wang, Kaixuan
Hu, Xihao
Zhang, Jian
author_sort Wang, Kaixuan
collection PubMed
description Advances in high-throughput sequencing technologies have facilitated the large-scale characterization of B cell receptor (BCR) repertoires. However, the vast amount and high diversity of the BCR sequences pose challenges for efficient and biologically meaningful analysis. Here, we introduce fastBCR, an efficient computational approach for inferring B cell clonal families from massive BCR heavy chain sequences. We demonstrate that fastBCR substantially reduces the running time while ensuring high accuracy on simulated datasets with diverse numbers of B cell lineages and varying mutation rates. We apply fastBCR to real BCR sequencing data from peripheral blood samples of COVID-19 patients, showing that the inferred clonal families display disease-associated features, as well as corresponding antigen-binding specificity and affinity. Overall, our results demonstrate the advantages of fastBCR for analyzing BCR repertoire data, which will facilitate the identification of disease-associated antibodies and improve our understanding of the B cell immune response.
format Online
Article
Text
id pubmed-10626204
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-106262042023-11-07 Fast clonal family inference from large-scale B cell repertoire sequencing data Wang, Kaixuan Hu, Xihao Zhang, Jian Cell Rep Methods Article Advances in high-throughput sequencing technologies have facilitated the large-scale characterization of B cell receptor (BCR) repertoires. However, the vast amount and high diversity of the BCR sequences pose challenges for efficient and biologically meaningful analysis. Here, we introduce fastBCR, an efficient computational approach for inferring B cell clonal families from massive BCR heavy chain sequences. We demonstrate that fastBCR substantially reduces the running time while ensuring high accuracy on simulated datasets with diverse numbers of B cell lineages and varying mutation rates. We apply fastBCR to real BCR sequencing data from peripheral blood samples of COVID-19 patients, showing that the inferred clonal families display disease-associated features, as well as corresponding antigen-binding specificity and affinity. Overall, our results demonstrate the advantages of fastBCR for analyzing BCR repertoire data, which will facilitate the identification of disease-associated antibodies and improve our understanding of the B cell immune response. Elsevier 2023-10-02 /pmc/articles/PMC10626204/ /pubmed/37788671 http://dx.doi.org/10.1016/j.crmeth.2023.100601 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Wang, Kaixuan
Hu, Xihao
Zhang, Jian
Fast clonal family inference from large-scale B cell repertoire sequencing data
title Fast clonal family inference from large-scale B cell repertoire sequencing data
title_full Fast clonal family inference from large-scale B cell repertoire sequencing data
title_fullStr Fast clonal family inference from large-scale B cell repertoire sequencing data
title_full_unstemmed Fast clonal family inference from large-scale B cell repertoire sequencing data
title_short Fast clonal family inference from large-scale B cell repertoire sequencing data
title_sort fast clonal family inference from large-scale b cell repertoire sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10626204/
https://www.ncbi.nlm.nih.gov/pubmed/37788671
http://dx.doi.org/10.1016/j.crmeth.2023.100601
work_keys_str_mv AT wangkaixuan fastclonalfamilyinferencefromlargescalebcellrepertoiresequencingdata
AT huxihao fastclonalfamilyinferencefromlargescalebcellrepertoiresequencingdata
AT zhangjian fastclonalfamilyinferencefromlargescalebcellrepertoiresequencingdata