Cargando…

Geometric potentials from deep learning improve prediction of CDR H3 loop structures

MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruffolo, Jeffrey A, Guerra, Carlos, Mahajan, Sai Pooja, Sulam, Jeremias, Gray, Jeffrey J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355305/
https://www.ncbi.nlm.nih.gov/pubmed/32657412
http://dx.doi.org/10.1093/bioinformatics/btaa457
_version_ 1783558249088286720
author Ruffolo, Jeffrey A
Guerra, Carlos
Mahajan, Sai Pooja
Sulam, Jeremias
Gray, Jeffrey J
author_facet Ruffolo, Jeffrey A
Guerra, Carlos
Mahajan, Sai Pooja
Sulam, Jeremias
Gray, Jeffrey J
author_sort Ruffolo, Jeffrey A
collection PubMed
description MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly diverse set of observed conformations. In recent years, deep neural networks have proven to be effective at capturing the complex patterns of protein structure. This work proposes DeepH3, a deep residual neural network that learns to predict inter-residue distances and orientations from antibody heavy and light chain sequence. The output of DeepH3 is a set of probability distributions over distances and orientation angles between pairs of residues. These distributions are converted to geometric potentials and used to discriminate between decoy structures produced by RosettaAntibody and predict new CDR H3 loop structures de novo. RESULTS: When evaluated on the Rosetta antibody benchmark dataset of 49 targets, DeepH3-predicted potentials identified better, same and worse structures [measured by root-mean-squared distance (RMSD) from the experimental CDR H3 loop structure] than the standard Rosetta energy function for 33, 6 and 10 targets, respectively, and improved the average RMSD of predictions by 32.1% (1.4 Å). Analysis of individual geometric potentials revealed that inter-residue orientations were more effective than inter-residue distances for discriminating near-native CDR H3 loops. When applied to de novo prediction of CDR H3 loop structures, DeepH3 achieves an average RMSD of 2.2 ± 1.1 Å on the Rosetta antibody benchmark. AVAILABILITY AND IMPLEMENTATION: DeepH3 source code and pre-trained model parameters are freely available at https://github.com/Graylab/deepH3-distances-orientations. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7355305
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73553052020-07-16 Geometric potentials from deep learning improve prediction of CDR H3 loop structures Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J Bioinformatics Macromolecular Sequence, Structure, and Function MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly diverse set of observed conformations. In recent years, deep neural networks have proven to be effective at capturing the complex patterns of protein structure. This work proposes DeepH3, a deep residual neural network that learns to predict inter-residue distances and orientations from antibody heavy and light chain sequence. The output of DeepH3 is a set of probability distributions over distances and orientation angles between pairs of residues. These distributions are converted to geometric potentials and used to discriminate between decoy structures produced by RosettaAntibody and predict new CDR H3 loop structures de novo. RESULTS: When evaluated on the Rosetta antibody benchmark dataset of 49 targets, DeepH3-predicted potentials identified better, same and worse structures [measured by root-mean-squared distance (RMSD) from the experimental CDR H3 loop structure] than the standard Rosetta energy function for 33, 6 and 10 targets, respectively, and improved the average RMSD of predictions by 32.1% (1.4 Å). Analysis of individual geometric potentials revealed that inter-residue orientations were more effective than inter-residue distances for discriminating near-native CDR H3 loops. When applied to de novo prediction of CDR H3 loop structures, DeepH3 achieves an average RMSD of 2.2 ± 1.1 Å on the Rosetta antibody benchmark. AVAILABILITY AND IMPLEMENTATION: DeepH3 source code and pre-trained model parameters are freely available at https://github.com/Graylab/deepH3-distances-orientations. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355305/ /pubmed/32657412 http://dx.doi.org/10.1093/bioinformatics/btaa457 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Macromolecular Sequence, Structure, and Function
Ruffolo, Jeffrey A
Guerra, Carlos
Mahajan, Sai Pooja
Sulam, Jeremias
Gray, Jeffrey J
Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_full Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_fullStr Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_full_unstemmed Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_short Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_sort geometric potentials from deep learning improve prediction of cdr h3 loop structures
topic Macromolecular Sequence, Structure, and Function
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355305/
https://www.ncbi.nlm.nih.gov/pubmed/32657412
http://dx.doi.org/10.1093/bioinformatics/btaa457
work_keys_str_mv AT ruffolojeffreya geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures
AT guerracarlos geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures
AT mahajansaipooja geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures
AT sulamjeremias geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures
AT grayjeffreyj geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures