Cargando…

Improved protein structure prediction by deep learning irrespective of co-evolution information

Predicting the tertiary structure of a protein from its primary sequence has been greatly improved by integrating deep learning and co-evolutionary analysis, as shown in CASP13 and CASP14. We describe our latest study of this idea, analyzing the efficacy of network size and co-evolution data and its...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Jinbo, Mcpartlon, Matthew, Li, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8340610/
https://www.ncbi.nlm.nih.gov/pubmed/34368623
http://dx.doi.org/10.1038/s42256-021-00348-5
_version_ 1783733806465810432
author Xu, Jinbo
Mcpartlon, Matthew
Li, Jin
author_facet Xu, Jinbo
Mcpartlon, Matthew
Li, Jin
author_sort Xu, Jinbo
collection PubMed
description Predicting the tertiary structure of a protein from its primary sequence has been greatly improved by integrating deep learning and co-evolutionary analysis, as shown in CASP13 and CASP14. We describe our latest study of this idea, analyzing the efficacy of network size and co-evolution data and its performance on both natural and designed proteins. We show that a large ResNet (convolutional residual neural networks) can predict structures of correct folds for 26 out of 32 CASP13 free-modeling (FM) targets and L/5 long-range contacts with precision over 80%. When co-evolution is not used ResNet still can predict structures of correct folds for 18 CASP13 FM targets, greatly exceeding previous methods that do not use co-evolution either. Even with only primary sequence ResNet can predict structures of correct folds for all tested human-designed proteins. In addition, ResNet may fare better for the designed proteins when trained without co-evolution than with co-evolution. These results suggest that ResNet does not simply denoise co-evolution signals, but instead may learn important protein sequence-structure relationship. This has important implications on protein design and engineering especially when co-evolutionary data is unavailable.
format Online
Article
Text
id pubmed-8340610
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-83406102022-01-01 Improved protein structure prediction by deep learning irrespective of co-evolution information Xu, Jinbo Mcpartlon, Matthew Li, Jin Nat Mach Intell Article Predicting the tertiary structure of a protein from its primary sequence has been greatly improved by integrating deep learning and co-evolutionary analysis, as shown in CASP13 and CASP14. We describe our latest study of this idea, analyzing the efficacy of network size and co-evolution data and its performance on both natural and designed proteins. We show that a large ResNet (convolutional residual neural networks) can predict structures of correct folds for 26 out of 32 CASP13 free-modeling (FM) targets and L/5 long-range contacts with precision over 80%. When co-evolution is not used ResNet still can predict structures of correct folds for 18 CASP13 FM targets, greatly exceeding previous methods that do not use co-evolution either. Even with only primary sequence ResNet can predict structures of correct folds for all tested human-designed proteins. In addition, ResNet may fare better for the designed proteins when trained without co-evolution than with co-evolution. These results suggest that ResNet does not simply denoise co-evolution signals, but instead may learn important protein sequence-structure relationship. This has important implications on protein design and engineering especially when co-evolutionary data is unavailable. 2021-05-20 2021-07 /pmc/articles/PMC8340610/ /pubmed/34368623 http://dx.doi.org/10.1038/s42256-021-00348-5 Text en http://www.nature.com/authors/editorial_policies/license.html#termsUsers may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Xu, Jinbo
Mcpartlon, Matthew
Li, Jin
Improved protein structure prediction by deep learning irrespective of co-evolution information
title Improved protein structure prediction by deep learning irrespective of co-evolution information
title_full Improved protein structure prediction by deep learning irrespective of co-evolution information
title_fullStr Improved protein structure prediction by deep learning irrespective of co-evolution information
title_full_unstemmed Improved protein structure prediction by deep learning irrespective of co-evolution information
title_short Improved protein structure prediction by deep learning irrespective of co-evolution information
title_sort improved protein structure prediction by deep learning irrespective of co-evolution information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8340610/
https://www.ncbi.nlm.nih.gov/pubmed/34368623
http://dx.doi.org/10.1038/s42256-021-00348-5
work_keys_str_mv AT xujinbo improvedproteinstructurepredictionbydeeplearningirrespectiveofcoevolutioninformation
AT mcpartlonmatthew improvedproteinstructurepredictionbydeeplearningirrespectiveofcoevolutioninformation
AT lijin improvedproteinstructurepredictionbydeeplearningirrespectiveofcoevolutioninformation