Cargando…

Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data

MOTIVATION: Deep sequencing of antibody and related protein libraries after phage or yeast-surface display sorting is widely used to identify variants with increased affinity, specificity, and/or improvements in key biophysical properties. Conventional approaches for identifying optimal variants typ...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Matthew D, Case, Marshall A, Makowski, Emily K, Tessier, Peter M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10477941/
https://www.ncbi.nlm.nih.gov/pubmed/37478351
http://dx.doi.org/10.1093/bioinformatics/btad446
_version_ 1785101243895513088
author Smith, Matthew D
Case, Marshall A
Makowski, Emily K
Tessier, Peter M
author_facet Smith, Matthew D
Case, Marshall A
Makowski, Emily K
Tessier, Peter M
author_sort Smith, Matthew D
collection PubMed
description MOTIVATION: Deep sequencing of antibody and related protein libraries after phage or yeast-surface display sorting is widely used to identify variants with increased affinity, specificity, and/or improvements in key biophysical properties. Conventional approaches for identifying optimal variants typically use the frequencies of observation in enriched libraries or the corresponding enrichment ratios. However, these approaches disregard the vast majority of deep sequencing data and often fail to identify the best variants in the libraries. RESULTS: Here, we present a method, Position-Specific Enrichment Ratio Matrix (PSERM) scoring, that uses entire deep sequencing datasets from pre- and post-selections to score each observed protein variant. The PSERM scores are the sum of the site-specific enrichment ratios observed at each mutated position. We find that PSERM scores are much more reproducible and correlate more strongly with experimentally measured properties than frequencies or enrichment ratios, including for multiple antibody properties (affinity and non-specific binding) for a clinical-stage antibody (emibetuzumab). We expect that this method will be broadly applicable to diverse protein engineering campaigns. AVAILABILITY AND IMPLEMENTATION: All deep sequencing datasets and code to perform the analyses presented within are available via https://github.com/Tessier-Lab-UMich/PSERM_paper.
format Online
Article
Text
id pubmed-10477941
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104779412023-09-06 Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data Smith, Matthew D Case, Marshall A Makowski, Emily K Tessier, Peter M Bioinformatics Original Paper MOTIVATION: Deep sequencing of antibody and related protein libraries after phage or yeast-surface display sorting is widely used to identify variants with increased affinity, specificity, and/or improvements in key biophysical properties. Conventional approaches for identifying optimal variants typically use the frequencies of observation in enriched libraries or the corresponding enrichment ratios. However, these approaches disregard the vast majority of deep sequencing data and often fail to identify the best variants in the libraries. RESULTS: Here, we present a method, Position-Specific Enrichment Ratio Matrix (PSERM) scoring, that uses entire deep sequencing datasets from pre- and post-selections to score each observed protein variant. The PSERM scores are the sum of the site-specific enrichment ratios observed at each mutated position. We find that PSERM scores are much more reproducible and correlate more strongly with experimentally measured properties than frequencies or enrichment ratios, including for multiple antibody properties (affinity and non-specific binding) for a clinical-stage antibody (emibetuzumab). We expect that this method will be broadly applicable to diverse protein engineering campaigns. AVAILABILITY AND IMPLEMENTATION: All deep sequencing datasets and code to perform the analyses presented within are available via https://github.com/Tessier-Lab-UMich/PSERM_paper. Oxford University Press 2023-07-21 /pmc/articles/PMC10477941/ /pubmed/37478351 http://dx.doi.org/10.1093/bioinformatics/btad446 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Smith, Matthew D
Case, Marshall A
Makowski, Emily K
Tessier, Peter M
Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data
title Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data
title_full Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data
title_fullStr Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data
title_full_unstemmed Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data
title_short Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data
title_sort position-specific enrichment ratio matrix scores predict antibody variant properties from deep sequencing data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10477941/
https://www.ncbi.nlm.nih.gov/pubmed/37478351
http://dx.doi.org/10.1093/bioinformatics/btad446
work_keys_str_mv AT smithmatthewd positionspecificenrichmentratiomatrixscorespredictantibodyvariantpropertiesfromdeepsequencingdata
AT casemarshalla positionspecificenrichmentratiomatrixscorespredictantibodyvariantpropertiesfromdeepsequencingdata
AT makowskiemilyk positionspecificenrichmentratiomatrixscorespredictantibodyvariantpropertiesfromdeepsequencingdata
AT tessierpeterm positionspecificenrichmentratiomatrixscorespredictantibodyvariantpropertiesfromdeepsequencingdata