Cargando…

Design of metalloproteins and novel protein folds using variational autoencoders

The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Greener, Joe G., Moffat, Lewis, Jones, David T
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6212568/
https://www.ncbi.nlm.nih.gov/pubmed/30385875
http://dx.doi.org/10.1038/s41598-018-34533-1
_version_ 1783367567774056448
author Greener, Joe G.
Moffat, Lewis
Jones, David T
author_facet Greener, Joe G.
Moffat, Lewis
Jones, David T
author_sort Greener, Joe G.
collection PubMed
description The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link protein design and deep learning by using variational autoencoders to generate protein sequences conditioned on desired properties. Potential copper and calcium binding sites are added to non-metal binding proteins without human intervention and compared to a hidden Markov model. In another use case, a grammar of protein structures is developed and used to produce sequences for a novel protein topology. One candidate structure is found to be stable by molecular dynamics simulation. The ability of our model to confine the vast search space of protein sequences and to scale easily has the potential to assist in a variety of protein design tasks.
format Online
Article
Text
id pubmed-6212568
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-62125682018-11-06 Design of metalloproteins and novel protein folds using variational autoencoders Greener, Joe G. Moffat, Lewis Jones, David T Sci Rep Article The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link protein design and deep learning by using variational autoencoders to generate protein sequences conditioned on desired properties. Potential copper and calcium binding sites are added to non-metal binding proteins without human intervention and compared to a hidden Markov model. In another use case, a grammar of protein structures is developed and used to produce sequences for a novel protein topology. One candidate structure is found to be stable by molecular dynamics simulation. The ability of our model to confine the vast search space of protein sequences and to scale easily has the potential to assist in a variety of protein design tasks. Nature Publishing Group UK 2018-11-01 /pmc/articles/PMC6212568/ /pubmed/30385875 http://dx.doi.org/10.1038/s41598-018-34533-1 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Greener, Joe G.
Moffat, Lewis
Jones, David T
Design of metalloproteins and novel protein folds using variational autoencoders
title Design of metalloproteins and novel protein folds using variational autoencoders
title_full Design of metalloproteins and novel protein folds using variational autoencoders
title_fullStr Design of metalloproteins and novel protein folds using variational autoencoders
title_full_unstemmed Design of metalloproteins and novel protein folds using variational autoencoders
title_short Design of metalloproteins and novel protein folds using variational autoencoders
title_sort design of metalloproteins and novel protein folds using variational autoencoders
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6212568/
https://www.ncbi.nlm.nih.gov/pubmed/30385875
http://dx.doi.org/10.1038/s41598-018-34533-1
work_keys_str_mv AT greenerjoeg designofmetalloproteinsandnovelproteinfoldsusingvariationalautoencoders
AT moffatlewis designofmetalloproteinsandnovelproteinfoldsusingvariationalautoencoders
AT jonesdavidt designofmetalloproteinsandnovelproteinfoldsusingvariationalautoencoders