Cargando…

CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction

Residue co-evolution has become the primary principle for estimating inter-residue distances of a protein, which are crucially important for predicting protein structure. Most existing approaches adopt an indirect strategy, i.e., inferring residue co-evolution based on some hand-crafted features, sa...

Descripción completa

Detalles Bibliográficos
Autores principales: Ju, Fusong, Zhu, Jianwei, Shao, Bin, Kong, Lupeng, Liu, Tie-Yan, Zheng, Wei-Mou, Bu, Dongbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8100175/
https://www.ncbi.nlm.nih.gov/pubmed/33953201
http://dx.doi.org/10.1038/s41467-021-22869-8
_version_ 1783688725679570944
author Ju, Fusong
Zhu, Jianwei
Shao, Bin
Kong, Lupeng
Liu, Tie-Yan
Zheng, Wei-Mou
Bu, Dongbo
author_facet Ju, Fusong
Zhu, Jianwei
Shao, Bin
Kong, Lupeng
Liu, Tie-Yan
Zheng, Wei-Mou
Bu, Dongbo
author_sort Ju, Fusong
collection PubMed
description Residue co-evolution has become the primary principle for estimating inter-residue distances of a protein, which are crucially important for predicting protein structure. Most existing approaches adopt an indirect strategy, i.e., inferring residue co-evolution based on some hand-crafted features, say, a covariance matrix, calculated from multiple sequence alignment (MSA) of target protein. This indirect strategy, however, cannot fully exploit the information carried by MSA. Here, we report an end-to-end deep neural network, CopulaNet, to estimate residue co-evolution directly from MSA. The key elements of CopulaNet include: (i) an encoder to model context-specific mutation for each residue; (ii) an aggregator to model residue co-evolution, and thereafter estimate inter-residue distances. Using CASP13 (the 13th Critical Assessment of Protein Structure Prediction) target proteins as representatives, we demonstrate that CopulaNet can predict protein structure with improved accuracy and efficiency. This study represents a step toward improved end-to-end prediction of inter-residue distances and protein tertiary structures.
format Online
Article
Text
id pubmed-8100175
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81001752021-05-11 CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction Ju, Fusong Zhu, Jianwei Shao, Bin Kong, Lupeng Liu, Tie-Yan Zheng, Wei-Mou Bu, Dongbo Nat Commun Article Residue co-evolution has become the primary principle for estimating inter-residue distances of a protein, which are crucially important for predicting protein structure. Most existing approaches adopt an indirect strategy, i.e., inferring residue co-evolution based on some hand-crafted features, say, a covariance matrix, calculated from multiple sequence alignment (MSA) of target protein. This indirect strategy, however, cannot fully exploit the information carried by MSA. Here, we report an end-to-end deep neural network, CopulaNet, to estimate residue co-evolution directly from MSA. The key elements of CopulaNet include: (i) an encoder to model context-specific mutation for each residue; (ii) an aggregator to model residue co-evolution, and thereafter estimate inter-residue distances. Using CASP13 (the 13th Critical Assessment of Protein Structure Prediction) target proteins as representatives, we demonstrate that CopulaNet can predict protein structure with improved accuracy and efficiency. This study represents a step toward improved end-to-end prediction of inter-residue distances and protein tertiary structures. Nature Publishing Group UK 2021-05-05 /pmc/articles/PMC8100175/ /pubmed/33953201 http://dx.doi.org/10.1038/s41467-021-22869-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ju, Fusong
Zhu, Jianwei
Shao, Bin
Kong, Lupeng
Liu, Tie-Yan
Zheng, Wei-Mou
Bu, Dongbo
CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
title CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
title_full CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
title_fullStr CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
title_full_unstemmed CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
title_short CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
title_sort copulanet: learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8100175/
https://www.ncbi.nlm.nih.gov/pubmed/33953201
http://dx.doi.org/10.1038/s41467-021-22869-8
work_keys_str_mv AT jufusong copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction
AT zhujianwei copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction
AT shaobin copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction
AT konglupeng copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction
AT liutieyan copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction
AT zhengweimou copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction
AT budongbo copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction