Cargando…

In silico proof of principle of machine learning-based antibody design at unconstrained scale

Generative machine learning (ML) has been postulated to become a major driver in the computational design of antigen-specific monoclonal antibodies (mAb). However, efforts to confirm this hypothesis have been hindered by the infeasibility of testing arbitrarily large numbers of antibody sequences fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Akbar, Rahmad, Robert, Philippe A., Weber, Cédric R., Widrich, Michael, Frank, Robert, Pavlović, Milena, Scheffer, Lonneke, Chernigovskaya, Maria, Snapkov, Igor, Slabodkin, Andrei, Mehta, Brij Bhushan, Miho, Enkelejda, Lund-Johansen, Fridtjof, Andersen, Jan Terje, Hochreiter, Sepp, Hobæk Haff, Ingrid, Klambauer, Günter, Sandve, Geir Kjetil, Greiff, Victor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8986205/
https://www.ncbi.nlm.nih.gov/pubmed/35377271
http://dx.doi.org/10.1080/19420862.2022.2031482
_version_ 1784682498550136832
author Akbar, Rahmad
Robert, Philippe A.
Weber, Cédric R.
Widrich, Michael
Frank, Robert
Pavlović, Milena
Scheffer, Lonneke
Chernigovskaya, Maria
Snapkov, Igor
Slabodkin, Andrei
Mehta, Brij Bhushan
Miho, Enkelejda
Lund-Johansen, Fridtjof
Andersen, Jan Terje
Hochreiter, Sepp
Hobæk Haff, Ingrid
Klambauer, Günter
Sandve, Geir Kjetil
Greiff, Victor
author_facet Akbar, Rahmad
Robert, Philippe A.
Weber, Cédric R.
Widrich, Michael
Frank, Robert
Pavlović, Milena
Scheffer, Lonneke
Chernigovskaya, Maria
Snapkov, Igor
Slabodkin, Andrei
Mehta, Brij Bhushan
Miho, Enkelejda
Lund-Johansen, Fridtjof
Andersen, Jan Terje
Hochreiter, Sepp
Hobæk Haff, Ingrid
Klambauer, Günter
Sandve, Geir Kjetil
Greiff, Victor
author_sort Akbar, Rahmad
collection PubMed
description Generative machine learning (ML) has been postulated to become a major driver in the computational design of antigen-specific monoclonal antibodies (mAb). However, efforts to confirm this hypothesis have been hindered by the infeasibility of testing arbitrarily large numbers of antibody sequences for their most critical design parameters: paratope, epitope, affinity, and developability. To address this challenge, we leveraged a lattice-based antibody-antigen binding simulation framework, which incorporates a wide range of physiological antibody-binding parameters. The simulation framework enables the computation of synthetic antibody-antigen 3D-structures, and it functions as an oracle for unrestricted prospective evaluation and benchmarking of antibody design parameters of ML-generated antibody sequences. We found that a deep generative model, trained exclusively on antibody sequence (one dimensional: 1D) data can be used to design conformational (three dimensional: 3D) epitope-specific antibodies, matching, or exceeding the training dataset in affinity and developability parameter value variety. Furthermore, we established a lower threshold of sequence diversity necessary for high-accuracy generative antibody ML and demonstrated that this lower threshold also holds on experimental real-world data. Finally, we show that transfer learning enables the generation of high-affinity antibody sequences from low-N training data. Our work establishes a priori feasibility and the theoretical foundation of high-throughput ML-based mAb design.
format Online
Article
Text
id pubmed-8986205
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-89862052022-04-07 In silico proof of principle of machine learning-based antibody design at unconstrained scale Akbar, Rahmad Robert, Philippe A. Weber, Cédric R. Widrich, Michael Frank, Robert Pavlović, Milena Scheffer, Lonneke Chernigovskaya, Maria Snapkov, Igor Slabodkin, Andrei Mehta, Brij Bhushan Miho, Enkelejda Lund-Johansen, Fridtjof Andersen, Jan Terje Hochreiter, Sepp Hobæk Haff, Ingrid Klambauer, Günter Sandve, Geir Kjetil Greiff, Victor MAbs Report Generative machine learning (ML) has been postulated to become a major driver in the computational design of antigen-specific monoclonal antibodies (mAb). However, efforts to confirm this hypothesis have been hindered by the infeasibility of testing arbitrarily large numbers of antibody sequences for their most critical design parameters: paratope, epitope, affinity, and developability. To address this challenge, we leveraged a lattice-based antibody-antigen binding simulation framework, which incorporates a wide range of physiological antibody-binding parameters. The simulation framework enables the computation of synthetic antibody-antigen 3D-structures, and it functions as an oracle for unrestricted prospective evaluation and benchmarking of antibody design parameters of ML-generated antibody sequences. We found that a deep generative model, trained exclusively on antibody sequence (one dimensional: 1D) data can be used to design conformational (three dimensional: 3D) epitope-specific antibodies, matching, or exceeding the training dataset in affinity and developability parameter value variety. Furthermore, we established a lower threshold of sequence diversity necessary for high-accuracy generative antibody ML and demonstrated that this lower threshold also holds on experimental real-world data. Finally, we show that transfer learning enables the generation of high-affinity antibody sequences from low-N training data. Our work establishes a priori feasibility and the theoretical foundation of high-throughput ML-based mAb design. Taylor & Francis 2022-04-04 /pmc/articles/PMC8986205/ /pubmed/35377271 http://dx.doi.org/10.1080/19420862.2022.2031482 Text en © 2022 The Author(s). Published with license by Taylor & Francis Group, LLC. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Report
Akbar, Rahmad
Robert, Philippe A.
Weber, Cédric R.
Widrich, Michael
Frank, Robert
Pavlović, Milena
Scheffer, Lonneke
Chernigovskaya, Maria
Snapkov, Igor
Slabodkin, Andrei
Mehta, Brij Bhushan
Miho, Enkelejda
Lund-Johansen, Fridtjof
Andersen, Jan Terje
Hochreiter, Sepp
Hobæk Haff, Ingrid
Klambauer, Günter
Sandve, Geir Kjetil
Greiff, Victor
In silico proof of principle of machine learning-based antibody design at unconstrained scale
title In silico proof of principle of machine learning-based antibody design at unconstrained scale
title_full In silico proof of principle of machine learning-based antibody design at unconstrained scale
title_fullStr In silico proof of principle of machine learning-based antibody design at unconstrained scale
title_full_unstemmed In silico proof of principle of machine learning-based antibody design at unconstrained scale
title_short In silico proof of principle of machine learning-based antibody design at unconstrained scale
title_sort in silico proof of principle of machine learning-based antibody design at unconstrained scale
topic Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8986205/
https://www.ncbi.nlm.nih.gov/pubmed/35377271
http://dx.doi.org/10.1080/19420862.2022.2031482
work_keys_str_mv AT akbarrahmad insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT robertphilippea insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT webercedricr insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT widrichmichael insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT frankrobert insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT pavlovicmilena insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT schefferlonneke insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT chernigovskayamaria insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT snapkovigor insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT slabodkinandrei insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT mehtabrijbhushan insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT mihoenkelejda insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT lundjohansenfridtjof insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT andersenjanterje insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT hochreitersepp insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT hobækhaffingrid insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT klambauergunter insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT sandvegeirkjetil insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale
AT greiffvictor insilicoproofofprincipleofmachinelearningbasedantibodydesignatunconstrainedscale