Cargando…
Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks
Predicting mutation-induced changes in protein thermodynamic stability (ΔΔG) is of great interest in protein engineering, variant interpretation, and protein biophysics. We introduce ThermoNet, a deep, 3D-convolutional neural network (3D-CNN) designed for structure-based prediction of ΔΔGs upon poin...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728386/ https://www.ncbi.nlm.nih.gov/pubmed/33253214 http://dx.doi.org/10.1371/journal.pcbi.1008291 |
_version_ | 1783621265198678016 |
---|---|
author | Li, Bian Yang, Yucheng T. Capra, John A. Gerstein, Mark B. |
author_facet | Li, Bian Yang, Yucheng T. Capra, John A. Gerstein, Mark B. |
author_sort | Li, Bian |
collection | PubMed |
description | Predicting mutation-induced changes in protein thermodynamic stability (ΔΔG) is of great interest in protein engineering, variant interpretation, and protein biophysics. We introduce ThermoNet, a deep, 3D-convolutional neural network (3D-CNN) designed for structure-based prediction of ΔΔGs upon point mutation. To leverage the image-processing power inherent in CNNs, we treat protein structures as if they were multi-channel 3D images. In particular, the inputs to ThermoNet are uniformly constructed as multi-channel voxel grids based on biophysical properties derived from raw atom coordinates. We train and evaluate ThermoNet with a curated data set that accounts for protein homology and is balanced with direct and reverse mutations; this provides a framework for addressing biases that have likely influenced many previous ΔΔG prediction methods. ThermoNet demonstrates performance comparable to the best available methods on the widely used S(sym) test set. In addition, ThermoNet accurately predicts the effects of both stabilizing and destabilizing mutations, while most other methods exhibit a strong bias towards predicting destabilization. We further show that homology between S(sym) and widely used training sets like S2648 and VariBench has likely led to overestimated performance in previous studies. Finally, we demonstrate the practical utility of ThermoNet in predicting the ΔΔGs for two clinically relevant proteins, p53 and myoglobin, and for pathogenic and benign missense variants from ClinVar. Overall, our results suggest that 3D-CNNs can model the complex, non-linear interactions perturbed by mutations, directly from biophysical properties of atoms. |
format | Online Article Text |
id | pubmed-7728386 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-77283862020-12-17 Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks Li, Bian Yang, Yucheng T. Capra, John A. Gerstein, Mark B. PLoS Comput Biol Research Article Predicting mutation-induced changes in protein thermodynamic stability (ΔΔG) is of great interest in protein engineering, variant interpretation, and protein biophysics. We introduce ThermoNet, a deep, 3D-convolutional neural network (3D-CNN) designed for structure-based prediction of ΔΔGs upon point mutation. To leverage the image-processing power inherent in CNNs, we treat protein structures as if they were multi-channel 3D images. In particular, the inputs to ThermoNet are uniformly constructed as multi-channel voxel grids based on biophysical properties derived from raw atom coordinates. We train and evaluate ThermoNet with a curated data set that accounts for protein homology and is balanced with direct and reverse mutations; this provides a framework for addressing biases that have likely influenced many previous ΔΔG prediction methods. ThermoNet demonstrates performance comparable to the best available methods on the widely used S(sym) test set. In addition, ThermoNet accurately predicts the effects of both stabilizing and destabilizing mutations, while most other methods exhibit a strong bias towards predicting destabilization. We further show that homology between S(sym) and widely used training sets like S2648 and VariBench has likely led to overestimated performance in previous studies. Finally, we demonstrate the practical utility of ThermoNet in predicting the ΔΔGs for two clinically relevant proteins, p53 and myoglobin, and for pathogenic and benign missense variants from ClinVar. Overall, our results suggest that 3D-CNNs can model the complex, non-linear interactions perturbed by mutations, directly from biophysical properties of atoms. Public Library of Science 2020-11-30 /pmc/articles/PMC7728386/ /pubmed/33253214 http://dx.doi.org/10.1371/journal.pcbi.1008291 Text en © 2020 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Li, Bian Yang, Yucheng T. Capra, John A. Gerstein, Mark B. Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks |
title | Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks |
title_full | Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks |
title_fullStr | Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks |
title_full_unstemmed | Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks |
title_short | Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks |
title_sort | predicting changes in protein thermodynamic stability upon point mutation with deep 3d convolutional neural networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728386/ https://www.ncbi.nlm.nih.gov/pubmed/33253214 http://dx.doi.org/10.1371/journal.pcbi.1008291 |
work_keys_str_mv | AT libian predictingchangesinproteinthermodynamicstabilityuponpointmutationwithdeep3dconvolutionalneuralnetworks AT yangyuchengt predictingchangesinproteinthermodynamicstabilityuponpointmutationwithdeep3dconvolutionalneuralnetworks AT caprajohna predictingchangesinproteinthermodynamicstabilityuponpointmutationwithdeep3dconvolutionalneuralnetworks AT gersteinmarkb predictingchangesinproteinthermodynamicstabilityuponpointmutationwithdeep3dconvolutionalneuralnetworks |