Cargando…

Enhancing protein inter-residue real distance prediction by scrutinising deep learning models

Protein structure prediction (PSP) has achieved significant progress lately via prediction of inter-residue distances using deep learning models and exploitation of the predictions during conformational search. In this context, prediction of large inter-residue distances and also prediction of dista...

Descripción completa

Detalles Bibliográficos
Autores principales: Rahman, Julia, Newton, M. A. Hakim, Islam, Md Khaled Ben, Sattar, Abdul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8764118/
https://www.ncbi.nlm.nih.gov/pubmed/35039537
http://dx.doi.org/10.1038/s41598-021-04441-y
_version_ 1784634094305411072
author Rahman, Julia
Newton, M. A. Hakim
Islam, Md Khaled Ben
Sattar, Abdul
author_facet Rahman, Julia
Newton, M. A. Hakim
Islam, Md Khaled Ben
Sattar, Abdul
author_sort Rahman, Julia
collection PubMed
description Protein structure prediction (PSP) has achieved significant progress lately via prediction of inter-residue distances using deep learning models and exploitation of the predictions during conformational search. In this context, prediction of large inter-residue distances and also prediction of distances between residues separated largely in the protein sequence remain challenging. To deal with these challenges, state-of-the-art inter-residue distance prediction algorithms have used large sets of coevolutionary and non-coevolutionary features. In this paper, we argue that the more the types of features used, the more the kinds of noises introduced and then the deep learning model has to overcome the noises to improve the accuracy of the predictions. Also, multiple features capturing similar underlying characteristics might not necessarily have significantly better cumulative effect. So we scrutinise the feature space to reduce the types of features to be used, but at the same time, we strive to improve the prediction accuracy. Consequently, for inter-residue real distance prediction, in this paper, we propose a deep learning model named scrutinised distance predictor (SDP), which uses only 2 coevolutionary and 3 non-coevolutionary features. On several sets of benchmark proteins, our proposed SDP method improves mean Local Distance Different Test (LDDT) scores at least by 10% over existing state-of-the-art methods. The SDP program along with its data is available from the website https://gitlab.com/mahnewton/sdp.
format Online
Article
Text
id pubmed-8764118
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-87641182022-01-18 Enhancing protein inter-residue real distance prediction by scrutinising deep learning models Rahman, Julia Newton, M. A. Hakim Islam, Md Khaled Ben Sattar, Abdul Sci Rep Article Protein structure prediction (PSP) has achieved significant progress lately via prediction of inter-residue distances using deep learning models and exploitation of the predictions during conformational search. In this context, prediction of large inter-residue distances and also prediction of distances between residues separated largely in the protein sequence remain challenging. To deal with these challenges, state-of-the-art inter-residue distance prediction algorithms have used large sets of coevolutionary and non-coevolutionary features. In this paper, we argue that the more the types of features used, the more the kinds of noises introduced and then the deep learning model has to overcome the noises to improve the accuracy of the predictions. Also, multiple features capturing similar underlying characteristics might not necessarily have significantly better cumulative effect. So we scrutinise the feature space to reduce the types of features to be used, but at the same time, we strive to improve the prediction accuracy. Consequently, for inter-residue real distance prediction, in this paper, we propose a deep learning model named scrutinised distance predictor (SDP), which uses only 2 coevolutionary and 3 non-coevolutionary features. On several sets of benchmark proteins, our proposed SDP method improves mean Local Distance Different Test (LDDT) scores at least by 10% over existing state-of-the-art methods. The SDP program along with its data is available from the website https://gitlab.com/mahnewton/sdp. Nature Publishing Group UK 2022-01-17 /pmc/articles/PMC8764118/ /pubmed/35039537 http://dx.doi.org/10.1038/s41598-021-04441-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Rahman, Julia
Newton, M. A. Hakim
Islam, Md Khaled Ben
Sattar, Abdul
Enhancing protein inter-residue real distance prediction by scrutinising deep learning models
title Enhancing protein inter-residue real distance prediction by scrutinising deep learning models
title_full Enhancing protein inter-residue real distance prediction by scrutinising deep learning models
title_fullStr Enhancing protein inter-residue real distance prediction by scrutinising deep learning models
title_full_unstemmed Enhancing protein inter-residue real distance prediction by scrutinising deep learning models
title_short Enhancing protein inter-residue real distance prediction by scrutinising deep learning models
title_sort enhancing protein inter-residue real distance prediction by scrutinising deep learning models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8764118/
https://www.ncbi.nlm.nih.gov/pubmed/35039537
http://dx.doi.org/10.1038/s41598-021-04441-y
work_keys_str_mv AT rahmanjulia enhancingproteininterresiduerealdistancepredictionbyscrutinisingdeeplearningmodels
AT newtonmahakim enhancingproteininterresiduerealdistancepredictionbyscrutinisingdeeplearningmodels
AT islammdkhaledben enhancingproteininterresiduerealdistancepredictionbyscrutinisingdeeplearningmodels
AT sattarabdul enhancingproteininterresiduerealdistancepredictionbyscrutinisingdeeplearningmodels