Cargando…

Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks

Predicting mutation-induced changes in protein thermodynamic stability (ΔΔG) is of great interest in protein engineering, variant interpretation, and protein biophysics. We introduce ThermoNet, a deep, 3D-convolutional neural network (3D-CNN) designed for structure-based prediction of ΔΔGs upon poin...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Bian, Yang, Yucheng T., Capra, John A., Gerstein, Mark B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728386/
https://www.ncbi.nlm.nih.gov/pubmed/33253214
http://dx.doi.org/10.1371/journal.pcbi.1008291
_version_ 1783621265198678016
author Li, Bian
Yang, Yucheng T.
Capra, John A.
Gerstein, Mark B.
author_facet Li, Bian
Yang, Yucheng T.
Capra, John A.
Gerstein, Mark B.
author_sort Li, Bian
collection PubMed
description Predicting mutation-induced changes in protein thermodynamic stability (ΔΔG) is of great interest in protein engineering, variant interpretation, and protein biophysics. We introduce ThermoNet, a deep, 3D-convolutional neural network (3D-CNN) designed for structure-based prediction of ΔΔGs upon point mutation. To leverage the image-processing power inherent in CNNs, we treat protein structures as if they were multi-channel 3D images. In particular, the inputs to ThermoNet are uniformly constructed as multi-channel voxel grids based on biophysical properties derived from raw atom coordinates. We train and evaluate ThermoNet with a curated data set that accounts for protein homology and is balanced with direct and reverse mutations; this provides a framework for addressing biases that have likely influenced many previous ΔΔG prediction methods. ThermoNet demonstrates performance comparable to the best available methods on the widely used S(sym) test set. In addition, ThermoNet accurately predicts the effects of both stabilizing and destabilizing mutations, while most other methods exhibit a strong bias towards predicting destabilization. We further show that homology between S(sym) and widely used training sets like S2648 and VariBench has likely led to overestimated performance in previous studies. Finally, we demonstrate the practical utility of ThermoNet in predicting the ΔΔGs for two clinically relevant proteins, p53 and myoglobin, and for pathogenic and benign missense variants from ClinVar. Overall, our results suggest that 3D-CNNs can model the complex, non-linear interactions perturbed by mutations, directly from biophysical properties of atoms.
format Online
Article
Text
id pubmed-7728386
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-77283862020-12-17 Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks Li, Bian Yang, Yucheng T. Capra, John A. Gerstein, Mark B. PLoS Comput Biol Research Article Predicting mutation-induced changes in protein thermodynamic stability (ΔΔG) is of great interest in protein engineering, variant interpretation, and protein biophysics. We introduce ThermoNet, a deep, 3D-convolutional neural network (3D-CNN) designed for structure-based prediction of ΔΔGs upon point mutation. To leverage the image-processing power inherent in CNNs, we treat protein structures as if they were multi-channel 3D images. In particular, the inputs to ThermoNet are uniformly constructed as multi-channel voxel grids based on biophysical properties derived from raw atom coordinates. We train and evaluate ThermoNet with a curated data set that accounts for protein homology and is balanced with direct and reverse mutations; this provides a framework for addressing biases that have likely influenced many previous ΔΔG prediction methods. ThermoNet demonstrates performance comparable to the best available methods on the widely used S(sym) test set. In addition, ThermoNet accurately predicts the effects of both stabilizing and destabilizing mutations, while most other methods exhibit a strong bias towards predicting destabilization. We further show that homology between S(sym) and widely used training sets like S2648 and VariBench has likely led to overestimated performance in previous studies. Finally, we demonstrate the practical utility of ThermoNet in predicting the ΔΔGs for two clinically relevant proteins, p53 and myoglobin, and for pathogenic and benign missense variants from ClinVar. Overall, our results suggest that 3D-CNNs can model the complex, non-linear interactions perturbed by mutations, directly from biophysical properties of atoms. Public Library of Science 2020-11-30 /pmc/articles/PMC7728386/ /pubmed/33253214 http://dx.doi.org/10.1371/journal.pcbi.1008291 Text en © 2020 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Li, Bian
Yang, Yucheng T.
Capra, John A.
Gerstein, Mark B.
Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks
title Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks
title_full Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks
title_fullStr Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks
title_full_unstemmed Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks
title_short Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks
title_sort predicting changes in protein thermodynamic stability upon point mutation with deep 3d convolutional neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728386/
https://www.ncbi.nlm.nih.gov/pubmed/33253214
http://dx.doi.org/10.1371/journal.pcbi.1008291
work_keys_str_mv AT libian predictingchangesinproteinthermodynamicstabilityuponpointmutationwithdeep3dconvolutionalneuralnetworks
AT yangyuchengt predictingchangesinproteinthermodynamicstabilityuponpointmutationwithdeep3dconvolutionalneuralnetworks
AT caprajohna predictingchangesinproteinthermodynamicstabilityuponpointmutationwithdeep3dconvolutionalneuralnetworks
AT gersteinmarkb predictingchangesinproteinthermodynamicstabilityuponpointmutationwithdeep3dconvolutionalneuralnetworks