Cargando…
Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction
Protein 3D structure prediction has advanced significantly in recent years due to improving contact prediction accuracy. This improvement has been largely due to deep learning approaches that predict inter-residue contacts and, more recently, distances using multiple sequence alignments (MSAs). In t...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8027171/ https://www.ncbi.nlm.nih.gov/pubmed/33828153 http://dx.doi.org/10.1038/s41598-021-87204-z |
_version_ | 1783675764419330048 |
---|---|
author | Jain, Aashish Terashi, Genki Kagaya, Yuki Maddhuri Venkata Subramaniya, Sai Raghavendra Christoffer, Charles Kihara, Daisuke |
author_facet | Jain, Aashish Terashi, Genki Kagaya, Yuki Maddhuri Venkata Subramaniya, Sai Raghavendra Christoffer, Charles Kihara, Daisuke |
author_sort | Jain, Aashish |
collection | PubMed |
description | Protein 3D structure prediction has advanced significantly in recent years due to improving contact prediction accuracy. This improvement has been largely due to deep learning approaches that predict inter-residue contacts and, more recently, distances using multiple sequence alignments (MSAs). In this work we present AttentiveDist, a novel approach that uses different MSAs generated with different E-values in a single model to increase the co-evolutionary information provided to the model. To determine the importance of each MSA’s feature at the inter-residue level, we added an attention layer to the deep neural network. We show that combining four MSAs of different E-value cutoffs improved the model prediction performance as compared to single E-value MSA features. A further improvement was observed when an attention layer was used and even more when additional prediction tasks of bond angle predictions were added. The improvement of distance predictions were successfully transferred to achieve better protein tertiary structure modeling. |
format | Online Article Text |
id | pubmed-8027171 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-80271712021-04-08 Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction Jain, Aashish Terashi, Genki Kagaya, Yuki Maddhuri Venkata Subramaniya, Sai Raghavendra Christoffer, Charles Kihara, Daisuke Sci Rep Article Protein 3D structure prediction has advanced significantly in recent years due to improving contact prediction accuracy. This improvement has been largely due to deep learning approaches that predict inter-residue contacts and, more recently, distances using multiple sequence alignments (MSAs). In this work we present AttentiveDist, a novel approach that uses different MSAs generated with different E-values in a single model to increase the co-evolutionary information provided to the model. To determine the importance of each MSA’s feature at the inter-residue level, we added an attention layer to the deep neural network. We show that combining four MSAs of different E-value cutoffs improved the model prediction performance as compared to single E-value MSA features. A further improvement was observed when an attention layer was used and even more when additional prediction tasks of bond angle predictions were added. The improvement of distance predictions were successfully transferred to achieve better protein tertiary structure modeling. Nature Publishing Group UK 2021-04-07 /pmc/articles/PMC8027171/ /pubmed/33828153 http://dx.doi.org/10.1038/s41598-021-87204-z Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Jain, Aashish Terashi, Genki Kagaya, Yuki Maddhuri Venkata Subramaniya, Sai Raghavendra Christoffer, Charles Kihara, Daisuke Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction |
title | Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction |
title_full | Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction |
title_fullStr | Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction |
title_full_unstemmed | Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction |
title_short | Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction |
title_sort | analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8027171/ https://www.ncbi.nlm.nih.gov/pubmed/33828153 http://dx.doi.org/10.1038/s41598-021-87204-z |
work_keys_str_mv | AT jainaashish analyzingeffectofquadruplemultiplesequencealignmentsondeeplearningbasedproteininterresiduedistanceprediction AT terashigenki analyzingeffectofquadruplemultiplesequencealignmentsondeeplearningbasedproteininterresiduedistanceprediction AT kagayayuki analyzingeffectofquadruplemultiplesequencealignmentsondeeplearningbasedproteininterresiduedistanceprediction AT maddhurivenkatasubramaniyasairaghavendra analyzingeffectofquadruplemultiplesequencealignmentsondeeplearningbasedproteininterresiduedistanceprediction AT christoffercharles analyzingeffectofquadruplemultiplesequencealignmentsondeeplearningbasedproteininterresiduedistanceprediction AT kiharadaisuke analyzingeffectofquadruplemultiplesequencealignmentsondeeplearningbasedproteininterresiduedistanceprediction |