Cargando…

Predicting B cell receptor substitution profiles using public repertoire data

B cells develop high affinity receptors during the course of affinity maturation, a cyclic process of mutation and selection. At the end of affinity maturation, a number of cells sharing the same ancestor (i.e. in the same “clonal family”) are released from the germinal center; their amino acid freq...

Descripción completa

Detalles Bibliográficos
Autores principales: Dhar, Amrit, Davidsen, Kristian, Matsen, Frederick A., Minin, Vladimir N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6205660/
https://www.ncbi.nlm.nih.gov/pubmed/30332400
http://dx.doi.org/10.1371/journal.pcbi.1006388
_version_ 1783366222707949568
author Dhar, Amrit
Davidsen, Kristian
Matsen, Frederick A.
Minin, Vladimir N.
author_facet Dhar, Amrit
Davidsen, Kristian
Matsen, Frederick A.
Minin, Vladimir N.
author_sort Dhar, Amrit
collection PubMed
description B cells develop high affinity receptors during the course of affinity maturation, a cyclic process of mutation and selection. At the end of affinity maturation, a number of cells sharing the same ancestor (i.e. in the same “clonal family”) are released from the germinal center; their amino acid frequency profile reflects the allowed and disallowed substitutions at each position. These clonal-family-specific frequency profiles, called “substitution profiles”, are useful for studying the course of affinity maturation as well as for antibody engineering purposes. However, most often only a single sequence is recovered from each clonal family in a sequencing experiment, making it impossible to construct a clonal-family-specific substitution profile. Given the public release of many high-quality large B cell receptor datasets, one may ask whether it is possible to use such data in a prediction model for clonal-family-specific substitution profiles. In this paper, we present the method “Substitution Profiles Using Related Families” (SPURF), a penalized tensor regression framework that integrates information from a rich assemblage of datasets to predict the clonal-family-specific substitution profile for any single input sequence. Using this framework, we show that substitution profiles from similar clonal families can be leveraged together with simulated substitution profiles and germline gene sequence information to improve prediction. We fit this model on a large public dataset and validate the robustness of our approach on two external datasets. Furthermore, we provide a command-line tool in an open-source software package (https://github.com/krdav/SPURF) implementing these ideas and providing easy prediction using our pre-fit models.
format Online
Article
Text
id pubmed-6205660
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62056602018-11-19 Predicting B cell receptor substitution profiles using public repertoire data Dhar, Amrit Davidsen, Kristian Matsen, Frederick A. Minin, Vladimir N. PLoS Comput Biol Research Article B cells develop high affinity receptors during the course of affinity maturation, a cyclic process of mutation and selection. At the end of affinity maturation, a number of cells sharing the same ancestor (i.e. in the same “clonal family”) are released from the germinal center; their amino acid frequency profile reflects the allowed and disallowed substitutions at each position. These clonal-family-specific frequency profiles, called “substitution profiles”, are useful for studying the course of affinity maturation as well as for antibody engineering purposes. However, most often only a single sequence is recovered from each clonal family in a sequencing experiment, making it impossible to construct a clonal-family-specific substitution profile. Given the public release of many high-quality large B cell receptor datasets, one may ask whether it is possible to use such data in a prediction model for clonal-family-specific substitution profiles. In this paper, we present the method “Substitution Profiles Using Related Families” (SPURF), a penalized tensor regression framework that integrates information from a rich assemblage of datasets to predict the clonal-family-specific substitution profile for any single input sequence. Using this framework, we show that substitution profiles from similar clonal families can be leveraged together with simulated substitution profiles and germline gene sequence information to improve prediction. We fit this model on a large public dataset and validate the robustness of our approach on two external datasets. Furthermore, we provide a command-line tool in an open-source software package (https://github.com/krdav/SPURF) implementing these ideas and providing easy prediction using our pre-fit models. Public Library of Science 2018-10-17 /pmc/articles/PMC6205660/ /pubmed/30332400 http://dx.doi.org/10.1371/journal.pcbi.1006388 Text en © 2018 Dhar et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Dhar, Amrit
Davidsen, Kristian
Matsen, Frederick A.
Minin, Vladimir N.
Predicting B cell receptor substitution profiles using public repertoire data
title Predicting B cell receptor substitution profiles using public repertoire data
title_full Predicting B cell receptor substitution profiles using public repertoire data
title_fullStr Predicting B cell receptor substitution profiles using public repertoire data
title_full_unstemmed Predicting B cell receptor substitution profiles using public repertoire data
title_short Predicting B cell receptor substitution profiles using public repertoire data
title_sort predicting b cell receptor substitution profiles using public repertoire data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6205660/
https://www.ncbi.nlm.nih.gov/pubmed/30332400
http://dx.doi.org/10.1371/journal.pcbi.1006388
work_keys_str_mv AT dharamrit predictingbcellreceptorsubstitutionprofilesusingpublicrepertoiredata
AT davidsenkristian predictingbcellreceptorsubstitutionprofilesusingpublicrepertoiredata
AT matsenfredericka predictingbcellreceptorsubstitutionprofilesusingpublicrepertoiredata
AT mininvladimirn predictingbcellreceptorsubstitutionprofilesusingpublicrepertoiredata