Cargando…

Computational Protein Design with Deep Learning Neural Networks

Computational protein design has a wide variety of applications. Despite its remarkable success, designing a protein for a given structure and function is still a challenging task. On the other hand, the number of solved protein structures is rapidly increasing while the number of unique protein fol...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jingxue, Cao, Huali, Zhang, John Z. H., Qi, Yifei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5910428/
https://www.ncbi.nlm.nih.gov/pubmed/29679026
http://dx.doi.org/10.1038/s41598-018-24760-x
_version_ 1783316037699108864
author Wang, Jingxue
Cao, Huali
Zhang, John Z. H.
Qi, Yifei
author_facet Wang, Jingxue
Cao, Huali
Zhang, John Z. H.
Qi, Yifei
author_sort Wang, Jingxue
collection PubMed
description Computational protein design has a wide variety of applications. Despite its remarkable success, designing a protein for a given structure and function is still a challenging task. On the other hand, the number of solved protein structures is rapidly increasing while the number of unique protein folds has reached a steady number, suggesting more structural information is being accumulated on each fold. Deep learning neural network is a powerful method to learn such big data set and has shown superior performance in many machine learning fields. In this study, we applied the deep learning neural network approach to computational protein design for predicting the probability of 20 natural amino acids on each residue in a protein. A large set of protein structures was collected and a multi-layer neural network was constructed. A number of structural properties were extracted as input features and the best network achieved an accuracy of 38.3%. Using the network output as residue type restraints improves the average sequence identity in designing three natural proteins using Rosetta. Moreover, the predictions from our network show ~3% higher sequence identity than a previous method. Results from this study may benefit further development of computational protein design methods.
format Online
Article
Text
id pubmed-5910428
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-59104282018-04-30 Computational Protein Design with Deep Learning Neural Networks Wang, Jingxue Cao, Huali Zhang, John Z. H. Qi, Yifei Sci Rep Article Computational protein design has a wide variety of applications. Despite its remarkable success, designing a protein for a given structure and function is still a challenging task. On the other hand, the number of solved protein structures is rapidly increasing while the number of unique protein folds has reached a steady number, suggesting more structural information is being accumulated on each fold. Deep learning neural network is a powerful method to learn such big data set and has shown superior performance in many machine learning fields. In this study, we applied the deep learning neural network approach to computational protein design for predicting the probability of 20 natural amino acids on each residue in a protein. A large set of protein structures was collected and a multi-layer neural network was constructed. A number of structural properties were extracted as input features and the best network achieved an accuracy of 38.3%. Using the network output as residue type restraints improves the average sequence identity in designing three natural proteins using Rosetta. Moreover, the predictions from our network show ~3% higher sequence identity than a previous method. Results from this study may benefit further development of computational protein design methods. Nature Publishing Group UK 2018-04-20 /pmc/articles/PMC5910428/ /pubmed/29679026 http://dx.doi.org/10.1038/s41598-018-24760-x Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wang, Jingxue
Cao, Huali
Zhang, John Z. H.
Qi, Yifei
Computational Protein Design with Deep Learning Neural Networks
title Computational Protein Design with Deep Learning Neural Networks
title_full Computational Protein Design with Deep Learning Neural Networks
title_fullStr Computational Protein Design with Deep Learning Neural Networks
title_full_unstemmed Computational Protein Design with Deep Learning Neural Networks
title_short Computational Protein Design with Deep Learning Neural Networks
title_sort computational protein design with deep learning neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5910428/
https://www.ncbi.nlm.nih.gov/pubmed/29679026
http://dx.doi.org/10.1038/s41598-018-24760-x
work_keys_str_mv AT wangjingxue computationalproteindesignwithdeeplearningneuralnetworks
AT caohuali computationalproteindesignwithdeeplearningneuralnetworks
AT zhangjohnzh computationalproteindesignwithdeeplearningneuralnetworks
AT qiyifei computationalproteindesignwithdeeplearningneuralnetworks