Cargando…
Design of metalloproteins and novel protein folds using variational autoencoders
The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link pro...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6212568/ https://www.ncbi.nlm.nih.gov/pubmed/30385875 http://dx.doi.org/10.1038/s41598-018-34533-1 |
_version_ | 1783367567774056448 |
---|---|
author | Greener, Joe G. Moffat, Lewis Jones, David T |
author_facet | Greener, Joe G. Moffat, Lewis Jones, David T |
author_sort | Greener, Joe G. |
collection | PubMed |
description | The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link protein design and deep learning by using variational autoencoders to generate protein sequences conditioned on desired properties. Potential copper and calcium binding sites are added to non-metal binding proteins without human intervention and compared to a hidden Markov model. In another use case, a grammar of protein structures is developed and used to produce sequences for a novel protein topology. One candidate structure is found to be stable by molecular dynamics simulation. The ability of our model to confine the vast search space of protein sequences and to scale easily has the potential to assist in a variety of protein design tasks. |
format | Online Article Text |
id | pubmed-6212568 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-62125682018-11-06 Design of metalloproteins and novel protein folds using variational autoencoders Greener, Joe G. Moffat, Lewis Jones, David T Sci Rep Article The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link protein design and deep learning by using variational autoencoders to generate protein sequences conditioned on desired properties. Potential copper and calcium binding sites are added to non-metal binding proteins without human intervention and compared to a hidden Markov model. In another use case, a grammar of protein structures is developed and used to produce sequences for a novel protein topology. One candidate structure is found to be stable by molecular dynamics simulation. The ability of our model to confine the vast search space of protein sequences and to scale easily has the potential to assist in a variety of protein design tasks. Nature Publishing Group UK 2018-11-01 /pmc/articles/PMC6212568/ /pubmed/30385875 http://dx.doi.org/10.1038/s41598-018-34533-1 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Greener, Joe G. Moffat, Lewis Jones, David T Design of metalloproteins and novel protein folds using variational autoencoders |
title | Design of metalloproteins and novel protein folds using variational autoencoders |
title_full | Design of metalloproteins and novel protein folds using variational autoencoders |
title_fullStr | Design of metalloproteins and novel protein folds using variational autoencoders |
title_full_unstemmed | Design of metalloproteins and novel protein folds using variational autoencoders |
title_short | Design of metalloproteins and novel protein folds using variational autoencoders |
title_sort | design of metalloproteins and novel protein folds using variational autoencoders |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6212568/ https://www.ncbi.nlm.nih.gov/pubmed/30385875 http://dx.doi.org/10.1038/s41598-018-34533-1 |
work_keys_str_mv | AT greenerjoeg designofmetalloproteinsandnovelproteinfoldsusingvariationalautoencoders AT moffatlewis designofmetalloproteinsandnovelproteinfoldsusingvariationalautoencoders AT jonesdavidt designofmetalloproteinsandnovelproteinfoldsusingvariationalautoencoders |