Cargando…
Computational Protein Design with Deep Learning Neural Networks
Computational protein design has a wide variety of applications. Despite its remarkable success, designing a protein for a given structure and function is still a challenging task. On the other hand, the number of solved protein structures is rapidly increasing while the number of unique protein fol...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5910428/ https://www.ncbi.nlm.nih.gov/pubmed/29679026 http://dx.doi.org/10.1038/s41598-018-24760-x |
_version_ | 1783316037699108864 |
---|---|
author | Wang, Jingxue Cao, Huali Zhang, John Z. H. Qi, Yifei |
author_facet | Wang, Jingxue Cao, Huali Zhang, John Z. H. Qi, Yifei |
author_sort | Wang, Jingxue |
collection | PubMed |
description | Computational protein design has a wide variety of applications. Despite its remarkable success, designing a protein for a given structure and function is still a challenging task. On the other hand, the number of solved protein structures is rapidly increasing while the number of unique protein folds has reached a steady number, suggesting more structural information is being accumulated on each fold. Deep learning neural network is a powerful method to learn such big data set and has shown superior performance in many machine learning fields. In this study, we applied the deep learning neural network approach to computational protein design for predicting the probability of 20 natural amino acids on each residue in a protein. A large set of protein structures was collected and a multi-layer neural network was constructed. A number of structural properties were extracted as input features and the best network achieved an accuracy of 38.3%. Using the network output as residue type restraints improves the average sequence identity in designing three natural proteins using Rosetta. Moreover, the predictions from our network show ~3% higher sequence identity than a previous method. Results from this study may benefit further development of computational protein design methods. |
format | Online Article Text |
id | pubmed-5910428 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-59104282018-04-30 Computational Protein Design with Deep Learning Neural Networks Wang, Jingxue Cao, Huali Zhang, John Z. H. Qi, Yifei Sci Rep Article Computational protein design has a wide variety of applications. Despite its remarkable success, designing a protein for a given structure and function is still a challenging task. On the other hand, the number of solved protein structures is rapidly increasing while the number of unique protein folds has reached a steady number, suggesting more structural information is being accumulated on each fold. Deep learning neural network is a powerful method to learn such big data set and has shown superior performance in many machine learning fields. In this study, we applied the deep learning neural network approach to computational protein design for predicting the probability of 20 natural amino acids on each residue in a protein. A large set of protein structures was collected and a multi-layer neural network was constructed. A number of structural properties were extracted as input features and the best network achieved an accuracy of 38.3%. Using the network output as residue type restraints improves the average sequence identity in designing three natural proteins using Rosetta. Moreover, the predictions from our network show ~3% higher sequence identity than a previous method. Results from this study may benefit further development of computational protein design methods. Nature Publishing Group UK 2018-04-20 /pmc/articles/PMC5910428/ /pubmed/29679026 http://dx.doi.org/10.1038/s41598-018-24760-x Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Wang, Jingxue Cao, Huali Zhang, John Z. H. Qi, Yifei Computational Protein Design with Deep Learning Neural Networks |
title | Computational Protein Design with Deep Learning Neural Networks |
title_full | Computational Protein Design with Deep Learning Neural Networks |
title_fullStr | Computational Protein Design with Deep Learning Neural Networks |
title_full_unstemmed | Computational Protein Design with Deep Learning Neural Networks |
title_short | Computational Protein Design with Deep Learning Neural Networks |
title_sort | computational protein design with deep learning neural networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5910428/ https://www.ncbi.nlm.nih.gov/pubmed/29679026 http://dx.doi.org/10.1038/s41598-018-24760-x |
work_keys_str_mv | AT wangjingxue computationalproteindesignwithdeeplearningneuralnetworks AT caohuali computationalproteindesignwithdeeplearningneuralnetworks AT zhangjohnzh computationalproteindesignwithdeeplearningneuralnetworks AT qiyifei computationalproteindesignwithdeeplearningneuralnetworks |