Cargando…

Geometric potentials from deep learning improve prediction of CDR H3 loop structures

MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ruffolo, Jeffrey A, Guerra, Carlos, Mahajan, Sai Pooja, Sulam, Jeremias, Gray, Jeffrey J
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2020
Materias:	Macromolecular Sequence, Structure, and Function
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355305/ https://www.ncbi.nlm.nih.gov/pubmed/32657412 http://dx.doi.org/10.1093/bioinformatics/btaa457

_version_	1783558249088286720
author	Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J
author_facet	Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J
author_sort	Ruffolo, Jeffrey A
collection	PubMed
description	MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly diverse set of observed conformations. In recent years, deep neural networks have proven to be effective at capturing the complex patterns of protein structure. This work proposes DeepH3, a deep residual neural network that learns to predict inter-residue distances and orientations from antibody heavy and light chain sequence. The output of DeepH3 is a set of probability distributions over distances and orientation angles between pairs of residues. These distributions are converted to geometric potentials and used to discriminate between decoy structures produced by RosettaAntibody and predict new CDR H3 loop structures de novo. RESULTS: When evaluated on the Rosetta antibody benchmark dataset of 49 targets, DeepH3-predicted potentials identified better, same and worse structures [measured by root-mean-squared distance (RMSD) from the experimental CDR H3 loop structure] than the standard Rosetta energy function for 33, 6 and 10 targets, respectively, and improved the average RMSD of predictions by 32.1% (1.4 Å). Analysis of individual geometric potentials revealed that inter-residue orientations were more effective than inter-residue distances for discriminating near-native CDR H3 loops. When applied to de novo prediction of CDR H3 loop structures, DeepH3 achieves an average RMSD of 2.2 ± 1.1 Å on the Rosetta antibody benchmark. AVAILABILITY AND IMPLEMENTATION: DeepH3 source code and pre-trained model parameters are freely available at https://github.com/Graylab/deepH3-distances-orientations. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-7355305
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-73553052020-07-16 Geometric potentials from deep learning improve prediction of CDR H3 loop structures Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J Bioinformatics Macromolecular Sequence, Structure, and Function MOTIVATION: Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly diverse set of observed conformations. In recent years, deep neural networks have proven to be effective at capturing the complex patterns of protein structure. This work proposes DeepH3, a deep residual neural network that learns to predict inter-residue distances and orientations from antibody heavy and light chain sequence. The output of DeepH3 is a set of probability distributions over distances and orientation angles between pairs of residues. These distributions are converted to geometric potentials and used to discriminate between decoy structures produced by RosettaAntibody and predict new CDR H3 loop structures de novo. RESULTS: When evaluated on the Rosetta antibody benchmark dataset of 49 targets, DeepH3-predicted potentials identified better, same and worse structures [measured by root-mean-squared distance (RMSD) from the experimental CDR H3 loop structure] than the standard Rosetta energy function for 33, 6 and 10 targets, respectively, and improved the average RMSD of predictions by 32.1% (1.4 Å). Analysis of individual geometric potentials revealed that inter-residue orientations were more effective than inter-residue distances for discriminating near-native CDR H3 loops. When applied to de novo prediction of CDR H3 loop structures, DeepH3 achieves an average RMSD of 2.2 ± 1.1 Å on the Rosetta antibody benchmark. AVAILABILITY AND IMPLEMENTATION: DeepH3 source code and pre-trained model parameters are freely available at https://github.com/Graylab/deepH3-distances-orientations. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355305/ /pubmed/32657412 http://dx.doi.org/10.1093/bioinformatics/btaa457 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Macromolecular Sequence, Structure, and Function Ruffolo, Jeffrey A Guerra, Carlos Mahajan, Sai Pooja Sulam, Jeremias Gray, Jeffrey J Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title	Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_full	Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_fullStr	Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_full_unstemmed	Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_short	Geometric potentials from deep learning improve prediction of CDR H3 loop structures
title_sort	geometric potentials from deep learning improve prediction of cdr h3 loop structures
topic	Macromolecular Sequence, Structure, and Function
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355305/ https://www.ncbi.nlm.nih.gov/pubmed/32657412 http://dx.doi.org/10.1093/bioinformatics/btaa457
work_keys_str_mv	AT ruffolojeffreya geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures AT guerracarlos geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures AT mahajansaipooja geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures AT sulamjeremias geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures AT grayjeffreyj geometricpotentialsfromdeeplearningimprovepredictionofcdrh3loopstructures

Geometric potentials from deep learning improve prediction of CDR H3 loop structures

Ejemplares similares