Cargando…
The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta
The human adaptive immune response enables the targeting of epitopes on pathogens with high specificity. Infection with a pathogen induces somatic hyper-mutation and B-cell selection processes that govern the shape and diversity of the antibody sequence landscape. To date, even the largest immunome...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Taylor & Francis
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9103704/ https://www.ncbi.nlm.nih.gov/pubmed/35544469 http://dx.doi.org/10.1080/19420862.2022.2068212 |
_version_ | 1784707616823312384 |
---|---|
author | Schmitz, Samuel Schmitz, Emily A. Crowe, James E. Meiler, Jens |
author_facet | Schmitz, Samuel Schmitz, Emily A. Crowe, James E. Meiler, Jens |
author_sort | Schmitz, Samuel |
collection | PubMed |
description | The human adaptive immune response enables the targeting of epitopes on pathogens with high specificity. Infection with a pathogen induces somatic hyper-mutation and B-cell selection processes that govern the shape and diversity of the antibody sequence landscape. To date, even the largest immunome repertoires of adaptive immune receptors acquired by next-generation sequencing cannot fully capture the vast antibody sequence space of a single individual, which is estimated to be at least 10(12) potential sequences. Degeneracy of the genetic code means that the number of possible nucleotide triplets (64) is greater than the number of canonical amino acids (20), resulting in some amino acids being encoded by multiple triplets and different amino acids sharing the same nucleotide in 1 or 2 positions in the triplet. We hypothesize that the degeneracy of the genetic code can be used to statistically model an enlarged space of human antibody amino acid sequences, accommodating for the discrepancy between the observed and the hypothesized antibody sequence space. Facilitated by Bayesian statistics and immunome repertoire clustering, we calculated amino acid probabilities from single nucleotide frequencies to infer a human amino acid sequence space that is used to design human-like antibodies with Rosetta. We show that antibodies designed with our restraints are on average up to 16.6% more human-like in the V and J regions compared to the Rosetta designs produced without constraints. The human-likeness of the heavy-chain CDR3 region (CDRH3) could be increased for 8 of 27 antibodies compared to Rosetta designs with a similar number of mutations and could be successfully applied on Mus musculus antibodies to demonstrate humanization. |
format | Online Article Text |
id | pubmed-9103704 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Taylor & Francis |
record_format | MEDLINE/PubMed |
spelling | pubmed-91037042022-05-14 The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta Schmitz, Samuel Schmitz, Emily A. Crowe, James E. Meiler, Jens MAbs Report The human adaptive immune response enables the targeting of epitopes on pathogens with high specificity. Infection with a pathogen induces somatic hyper-mutation and B-cell selection processes that govern the shape and diversity of the antibody sequence landscape. To date, even the largest immunome repertoires of adaptive immune receptors acquired by next-generation sequencing cannot fully capture the vast antibody sequence space of a single individual, which is estimated to be at least 10(12) potential sequences. Degeneracy of the genetic code means that the number of possible nucleotide triplets (64) is greater than the number of canonical amino acids (20), resulting in some amino acids being encoded by multiple triplets and different amino acids sharing the same nucleotide in 1 or 2 positions in the triplet. We hypothesize that the degeneracy of the genetic code can be used to statistically model an enlarged space of human antibody amino acid sequences, accommodating for the discrepancy between the observed and the hypothesized antibody sequence space. Facilitated by Bayesian statistics and immunome repertoire clustering, we calculated amino acid probabilities from single nucleotide frequencies to infer a human amino acid sequence space that is used to design human-like antibodies with Rosetta. We show that antibodies designed with our restraints are on average up to 16.6% more human-like in the V and J regions compared to the Rosetta designs produced without constraints. The human-likeness of the heavy-chain CDR3 region (CDRH3) could be increased for 8 of 27 antibodies compared to Rosetta designs with a similar number of mutations and could be successfully applied on Mus musculus antibodies to demonstrate humanization. Taylor & Francis 2022-05-11 /pmc/articles/PMC9103704/ /pubmed/35544469 http://dx.doi.org/10.1080/19420862.2022.2068212 Text en © 2022 The Author(s). Published with license by Taylor & Francis Group, LLC. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Report Schmitz, Samuel Schmitz, Emily A. Crowe, James E. Meiler, Jens The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta |
title | The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta |
title_full | The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta |
title_fullStr | The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta |
title_full_unstemmed | The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta |
title_short | The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta |
title_sort | human antibody sequence space and structural design of the v, j regions, and cdrh3 with rosetta |
topic | Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9103704/ https://www.ncbi.nlm.nih.gov/pubmed/35544469 http://dx.doi.org/10.1080/19420862.2022.2068212 |
work_keys_str_mv | AT schmitzsamuel thehumanantibodysequencespaceandstructuraldesignofthevjregionsandcdrh3withrosetta AT schmitzemilya thehumanantibodysequencespaceandstructuraldesignofthevjregionsandcdrh3withrosetta AT crowejamese thehumanantibodysequencespaceandstructuraldesignofthevjregionsandcdrh3withrosetta AT meilerjens thehumanantibodysequencespaceandstructuraldesignofthevjregionsandcdrh3withrosetta AT schmitzsamuel humanantibodysequencespaceandstructuraldesignofthevjregionsandcdrh3withrosetta AT schmitzemilya humanantibodysequencespaceandstructuraldesignofthevjregionsandcdrh3withrosetta AT crowejamese humanantibodysequencespaceandstructuraldesignofthevjregionsandcdrh3withrosetta AT meilerjens humanantibodysequencespaceandstructuraldesignofthevjregionsandcdrh3withrosetta |