Cargando…

GENERATION OF NOVEL ANTIBODY CANDIDATES USING TRANSFORMER AND GAN-BASED DEEP LEARNING ARTIFICIAL INTELLIGENCE

INTRODUCTION: Conventional library-based antibody display can only explore a small fraction of the sequences generated from animal immunization, not even to exhaust the potential sequence diversity that can be turned into antibody therapies. This is because screening for antibody is limited to seque...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Hongyu, Lyu, Xiaochen, Zhao, Qichao, Liu, Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370453/
http://dx.doi.org/10.1093/abt/tbad014.014
_version_ 1785077937748312064
author Zhang, Hongyu
Lyu, Xiaochen
Zhao, Qichao
Liu, Bo
author_facet Zhang, Hongyu
Lyu, Xiaochen
Zhao, Qichao
Liu, Bo
author_sort Zhang, Hongyu
collection PubMed
description INTRODUCTION: Conventional library-based antibody display can only explore a small fraction of the sequences generated from animal immunization, not even to exhaust the potential sequence diversity that can be turned into antibody therapies. This is because screening for antibody is limited to sequences that can be displayed, which only constitute a subset of the entire sequences generated by B cells, whereas screening for antibody directly from single B cells can be costly. Here, we introduce a novel Artificial Intelligence-enabling tool to navigate antibody discovery from a broader range of search space with reduced cost. We trained a transformer-based model from sequences of an immunized library to cluster the clones and a generative adversarial network (GAN)-based model to generate novel sequences that can be potentially developed into antibody therapies. BACKGROUND AND SIGNIFICANCE: One limitation in the early discovery of antibody is the number of functional candidates that can be selected. Our work provides an AI-enabling tool to discover and generate a panel of antibodies of differentiated binding strengths to a broad range of epitopes to ensure functional coverage. METHODS & RESULTS: We extracted 10(4) sequences from the FACS-enriched yeast pool from a fully immunized alpaca (Lama pacos) using Next Generation Sequencing, from which we assembled 10(3) unique sdAb sequences. We fine-tuned a transformer-based deep learning model, which was previously trained from our dataset containing 100,000 antibody sequences, on such pre-processed sdAb sequences giving representation that correlates to the sequence homology for the clustering of clonal types. We postulate such representation also encodes long-range amino acid interactions in the 3D structure, making the accuracy exceeds the performance of bioinformatics-based primary sequence homology analysis. This process is fully automated and optimized to require minima computational resources. We selected 15 candidates from AI-clustered clonal groups and experimentally measured their binding activity. Kd of 12 candidates were of 10(−9) affinity and 1 candidates were of 10(−8) affinity, the rest one candidate was non-binding (hence a hit rate of 87%). The large sequence diversity of the CDR3 show these nanobodies are potentially good binders for a wide range of epitopes. We generated a CDR-diversifying virtual library (10(3)) of each binding candidate by training a GAN-based models using the sequences of the same clonal group of the binder sequences. This method incorporates the probability of amino acid residues on each specific location that provides a more precise mutagenesis route than PCR-based affinity maturation. The generated sequences provided a wider CDR sequence diversity for the selection of antibodies of differentiated affinity and epitopes, which could generate candidates of different functionality. CONCLUSION: Antibody discovery is a central step in early drug development that identification of a wide range of functional candidates could increase the success rate and reduce risks in later developments. We built an AI-enabling tool for the searching and generation of functional antibodies from animal immunization library. We believe this technology would help deliver candidates of fine-tuned affinity and functionality.
format Online
Article
Text
id pubmed-10370453
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103704532023-07-27 GENERATION OF NOVEL ANTIBODY CANDIDATES USING TRANSFORMER AND GAN-BASED DEEP LEARNING ARTIFICIAL INTELLIGENCE Zhang, Hongyu Lyu, Xiaochen Zhao, Qichao Liu, Bo Antib Ther Abstract INTRODUCTION: Conventional library-based antibody display can only explore a small fraction of the sequences generated from animal immunization, not even to exhaust the potential sequence diversity that can be turned into antibody therapies. This is because screening for antibody is limited to sequences that can be displayed, which only constitute a subset of the entire sequences generated by B cells, whereas screening for antibody directly from single B cells can be costly. Here, we introduce a novel Artificial Intelligence-enabling tool to navigate antibody discovery from a broader range of search space with reduced cost. We trained a transformer-based model from sequences of an immunized library to cluster the clones and a generative adversarial network (GAN)-based model to generate novel sequences that can be potentially developed into antibody therapies. BACKGROUND AND SIGNIFICANCE: One limitation in the early discovery of antibody is the number of functional candidates that can be selected. Our work provides an AI-enabling tool to discover and generate a panel of antibodies of differentiated binding strengths to a broad range of epitopes to ensure functional coverage. METHODS & RESULTS: We extracted 10(4) sequences from the FACS-enriched yeast pool from a fully immunized alpaca (Lama pacos) using Next Generation Sequencing, from which we assembled 10(3) unique sdAb sequences. We fine-tuned a transformer-based deep learning model, which was previously trained from our dataset containing 100,000 antibody sequences, on such pre-processed sdAb sequences giving representation that correlates to the sequence homology for the clustering of clonal types. We postulate such representation also encodes long-range amino acid interactions in the 3D structure, making the accuracy exceeds the performance of bioinformatics-based primary sequence homology analysis. This process is fully automated and optimized to require minima computational resources. We selected 15 candidates from AI-clustered clonal groups and experimentally measured their binding activity. Kd of 12 candidates were of 10(−9) affinity and 1 candidates were of 10(−8) affinity, the rest one candidate was non-binding (hence a hit rate of 87%). The large sequence diversity of the CDR3 show these nanobodies are potentially good binders for a wide range of epitopes. We generated a CDR-diversifying virtual library (10(3)) of each binding candidate by training a GAN-based models using the sequences of the same clonal group of the binder sequences. This method incorporates the probability of amino acid residues on each specific location that provides a more precise mutagenesis route than PCR-based affinity maturation. The generated sequences provided a wider CDR sequence diversity for the selection of antibodies of differentiated affinity and epitopes, which could generate candidates of different functionality. CONCLUSION: Antibody discovery is a central step in early drug development that identification of a wide range of functional candidates could increase the success rate and reduce risks in later developments. We built an AI-enabling tool for the searching and generation of functional antibodies from animal immunization library. We believe this technology would help deliver candidates of fine-tuned affinity and functionality. Oxford University Press 2023-07-24 /pmc/articles/PMC10370453/ http://dx.doi.org/10.1093/abt/tbad014.014 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Antibody Therapeutics. All rights reserved. For Permissions, please email: journals.permissions@oup.com https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Abstract
Zhang, Hongyu
Lyu, Xiaochen
Zhao, Qichao
Liu, Bo
GENERATION OF NOVEL ANTIBODY CANDIDATES USING TRANSFORMER AND GAN-BASED DEEP LEARNING ARTIFICIAL INTELLIGENCE
title GENERATION OF NOVEL ANTIBODY CANDIDATES USING TRANSFORMER AND GAN-BASED DEEP LEARNING ARTIFICIAL INTELLIGENCE
title_full GENERATION OF NOVEL ANTIBODY CANDIDATES USING TRANSFORMER AND GAN-BASED DEEP LEARNING ARTIFICIAL INTELLIGENCE
title_fullStr GENERATION OF NOVEL ANTIBODY CANDIDATES USING TRANSFORMER AND GAN-BASED DEEP LEARNING ARTIFICIAL INTELLIGENCE
title_full_unstemmed GENERATION OF NOVEL ANTIBODY CANDIDATES USING TRANSFORMER AND GAN-BASED DEEP LEARNING ARTIFICIAL INTELLIGENCE
title_short GENERATION OF NOVEL ANTIBODY CANDIDATES USING TRANSFORMER AND GAN-BASED DEEP LEARNING ARTIFICIAL INTELLIGENCE
title_sort generation of novel antibody candidates using transformer and gan-based deep learning artificial intelligence
topic Abstract
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370453/
http://dx.doi.org/10.1093/abt/tbad014.014
work_keys_str_mv AT zhanghongyu generationofnovelantibodycandidatesusingtransformerandganbaseddeeplearningartificialintelligence
AT lyuxiaochen generationofnovelantibodycandidatesusingtransformerandganbaseddeeplearningartificialintelligence
AT zhaoqichao generationofnovelantibodycandidatesusingtransformerandganbaseddeeplearningartificialintelligence
AT liubo generationofnovelantibodycandidatesusingtransformerandganbaseddeeplearningartificialintelligence