Cargando…
CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction
Residue co-evolution has become the primary principle for estimating inter-residue distances of a protein, which are crucially important for predicting protein structure. Most existing approaches adopt an indirect strategy, i.e., inferring residue co-evolution based on some hand-crafted features, sa...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8100175/ https://www.ncbi.nlm.nih.gov/pubmed/33953201 http://dx.doi.org/10.1038/s41467-021-22869-8 |
_version_ | 1783688725679570944 |
---|---|
author | Ju, Fusong Zhu, Jianwei Shao, Bin Kong, Lupeng Liu, Tie-Yan Zheng, Wei-Mou Bu, Dongbo |
author_facet | Ju, Fusong Zhu, Jianwei Shao, Bin Kong, Lupeng Liu, Tie-Yan Zheng, Wei-Mou Bu, Dongbo |
author_sort | Ju, Fusong |
collection | PubMed |
description | Residue co-evolution has become the primary principle for estimating inter-residue distances of a protein, which are crucially important for predicting protein structure. Most existing approaches adopt an indirect strategy, i.e., inferring residue co-evolution based on some hand-crafted features, say, a covariance matrix, calculated from multiple sequence alignment (MSA) of target protein. This indirect strategy, however, cannot fully exploit the information carried by MSA. Here, we report an end-to-end deep neural network, CopulaNet, to estimate residue co-evolution directly from MSA. The key elements of CopulaNet include: (i) an encoder to model context-specific mutation for each residue; (ii) an aggregator to model residue co-evolution, and thereafter estimate inter-residue distances. Using CASP13 (the 13th Critical Assessment of Protein Structure Prediction) target proteins as representatives, we demonstrate that CopulaNet can predict protein structure with improved accuracy and efficiency. This study represents a step toward improved end-to-end prediction of inter-residue distances and protein tertiary structures. |
format | Online Article Text |
id | pubmed-8100175 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-81001752021-05-11 CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction Ju, Fusong Zhu, Jianwei Shao, Bin Kong, Lupeng Liu, Tie-Yan Zheng, Wei-Mou Bu, Dongbo Nat Commun Article Residue co-evolution has become the primary principle for estimating inter-residue distances of a protein, which are crucially important for predicting protein structure. Most existing approaches adopt an indirect strategy, i.e., inferring residue co-evolution based on some hand-crafted features, say, a covariance matrix, calculated from multiple sequence alignment (MSA) of target protein. This indirect strategy, however, cannot fully exploit the information carried by MSA. Here, we report an end-to-end deep neural network, CopulaNet, to estimate residue co-evolution directly from MSA. The key elements of CopulaNet include: (i) an encoder to model context-specific mutation for each residue; (ii) an aggregator to model residue co-evolution, and thereafter estimate inter-residue distances. Using CASP13 (the 13th Critical Assessment of Protein Structure Prediction) target proteins as representatives, we demonstrate that CopulaNet can predict protein structure with improved accuracy and efficiency. This study represents a step toward improved end-to-end prediction of inter-residue distances and protein tertiary structures. Nature Publishing Group UK 2021-05-05 /pmc/articles/PMC8100175/ /pubmed/33953201 http://dx.doi.org/10.1038/s41467-021-22869-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Ju, Fusong Zhu, Jianwei Shao, Bin Kong, Lupeng Liu, Tie-Yan Zheng, Wei-Mou Bu, Dongbo CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction |
title | CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction |
title_full | CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction |
title_fullStr | CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction |
title_full_unstemmed | CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction |
title_short | CopulaNet: Learning residue co-evolution directly from multiple sequence alignment for protein structure prediction |
title_sort | copulanet: learning residue co-evolution directly from multiple sequence alignment for protein structure prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8100175/ https://www.ncbi.nlm.nih.gov/pubmed/33953201 http://dx.doi.org/10.1038/s41467-021-22869-8 |
work_keys_str_mv | AT jufusong copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction AT zhujianwei copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction AT shaobin copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction AT konglupeng copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction AT liutieyan copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction AT zhengweimou copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction AT budongbo copulanetlearningresiduecoevolutiondirectlyfrommultiplesequencealignmentforproteinstructureprediction |