Cargando…
Geometric potentials from deep learning improve prediction of CDR H3 loop structures
MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355305/ https://www.ncbi.nlm.nih.gov/pubmed/32657412 http://dx.doi.org/10.1093/bioinformatics/btaa457 |
_version_ | 1783558249088286720 |
---|---|
author | Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J |
author_facet | Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J |
author_sort | Ruffolo, Jeffrey A |
collection | PubMed |
description | MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly diverse set of observed conformations. In recent years, deep neural networks have proven to be effective at capturing the complex patterns of protein structure. This work proposes DeepH3, a deep residual neural network that learns to predict inter-residue distances and orientations from antibody heavy and light chain sequence. The output of DeepH3 is a set of probability distributions over distances and orientation angles between pairs of residues. These distributions are converted to geometric potentials and used to discriminate between decoy structures produced by RosettaAntibody and predict new CDR H3 loop structures de novo. RESULTS: When evaluated on the Rosetta antibody benchmark dataset of 49 targets, DeepH3-predicted potentials identified better, same and worse structures [measured by root-mean-squared distance (RMSD) from the experimental CDR H3 loop structure] than the standard Rosetta energy function for 33, 6 and 10 targets, respectively, and improved the average RMSD of predictions by 32.1% (1.4 Å). Analysis of individual geometric potentials revealed that inter-residue orientations were more effective than inter-residue distances for discriminating near-native CDR H3 loops. When applied to de novo prediction of CDR H3 loop structures, DeepH3 achieves an average RMSD of 2.2 ± 1.1 Å on the Rosetta antibody benchmark. AVAILABILITY AND IMPLEMENTATION: DeepH3 source code and pre-trained model parameters are freely available at https://github.com/Graylab/deepH3-distances-orientations. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7355305 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73553052020-07-16 Geometric potentials from deep learning improve prediction of CDR H3 loop structures Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J Bioinformatics Macromolecular Sequence, Structure, and Function MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly diverse set of observed conformations. In recent years, deep neural networks have proven to be effective at capturing the complex patterns of protein structure. This work proposes DeepH3, a deep residual neural network that learns to predict inter-residue distances and orientations from antibody heavy and light chain sequence. The output of DeepH3 is a set of probability distributions over distances and orientation angles between pairs of residues. These distributions are converted to geometric potentials and used to discriminate between decoy structures produced by RosettaAntibody and predict new CDR H3 loop structures de novo. RESULTS: When evaluated on the Rosetta antibody benchmark dataset of 49 targets, DeepH3-predicted potentials identified better, same and worse structures [measured by root-mean-squared distance (RMSD) from the experimental CDR H3 loop structure] than the standard Rosetta energy function for 33, 6 and 10 targets, respectively, and improved the average RMSD of predictions by 32.1% (1.4 Å). Analysis of individual geometric potentials revealed that inter-residue orientations were more effective than inter-residue distances for discriminating near-native CDR H3 loops. When applied to de novo prediction of CDR H3 loop structures, DeepH3 achieves an average RMSD of 2.2 ± 1.1 Å on the Rosetta antibody benchmark. AVAILABILITY AND IMPLEMENTATION: DeepH3 source code and pre-trained model parameters are freely available at https://github.com/Graylab/deepH3-distances-orientations. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355305/ /pubmed/32657412 http://dx.doi.org/10.1093/bioinformatics/btaa457 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Macromolecular Sequence, Structure, and Function Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J Geometric potentials from deep learning improve prediction of CDR H3 loop structures |
title | Geometric potentials from deep learning improve prediction of CDR H3 loop structures |
title_full | Geometric potentials from deep learning improve prediction of CDR H3 loop structures |
title_fullStr | Geometric potentials from deep learning improve prediction of CDR H3 loop structures |
title_full_unstemmed | Geometric potentials from deep learning improve prediction of CDR H3 loop structures |
title_short | Geometric potentials from deep learning improve prediction of CDR H3 loop structures |
title_sort | geometric potentials from deep learning improve prediction of cdr h3 loop structures |
topic | Macromolecular Sequence, Structure, and Function |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355305/ https://www.ncbi.nlm.nih.gov/pubmed/32657412 http://dx.doi.org/10.1093/bioinformatics/btaa457 |
work_keys_str_mv | AT ruffolojeffreya geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures AT guerracarlos geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures AT mahajansaipooja geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures AT sulamjeremias geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures AT grayjeffreyj geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures |