Cargando…

Inter-protein residue covariation information unravels physically interacting protein dimers

BACKGROUND: Predicting physical interaction between proteins is one of the greatest challenges in computational biology. There are considerable various protein interactions and a huge number of protein sequences and synthetic peptides with unknown interacting counterparts. Most of co-evolutionary me...

Descripción completa

Detalles Bibliográficos
Autores principales: Salmanian, Sara, Pezeshk, Hamid, Sadeghi, Mehdi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7745481/
https://www.ncbi.nlm.nih.gov/pubmed/33334319
http://dx.doi.org/10.1186/s12859-020-03930-7
_version_ 1783624615494418432
author Salmanian, Sara
Pezeshk, Hamid
Sadeghi, Mehdi
author_facet Salmanian, Sara
Pezeshk, Hamid
Sadeghi, Mehdi
author_sort Salmanian, Sara
collection PubMed
description BACKGROUND: Predicting physical interaction between proteins is one of the greatest challenges in computational biology. There are considerable various protein interactions and a huge number of protein sequences and synthetic peptides with unknown interacting counterparts. Most of co-evolutionary methods discover a combination of physical interplays and functional associations. However, there are only a handful of approaches which specifically infer physical interactions. Hybrid co-evolutionary methods exploit inter-protein residue coevolution to unravel specific physical interacting proteins. In this study, we introduce a hybrid co-evolutionary-based approach to predict physical interplays between pairs of protein families, starting from protein sequences only. RESULTS: In the present analysis, pairs of multiple sequence alignments are constructed for each dimer and the covariation between residues in those pairs are calculated by CCMpred (Contacts from Correlated Mutations predicted) and three mutual information based approaches for ten accessible surface area threshold groups. Then, whole residue couplings between proteins of each dimer are unified into a single Frobenius norm value. Norms of residue contact matrices of all dimers in different accessible surface area thresholds are fed into support vector machine as single or multiple feature models. The results of training the classifiers by single features show no apparent different accuracies in distinct methods for different accessible surface area thresholds. Nevertheless, mutual information product and context likelihood of relatedness procedures may roughly have an overall higher and lower performances than other two methods for different accessible surface area cut-offs, respectively. The results also demonstrate that training support vector machine with multiple norm features for several accessible surface area thresholds leads to a considerable improvement of prediction performance. In this context, CCMpred roughly achieves an overall better performance than mutual information based approaches. The best accuracy, sensitivity, specificity, precision and negative predictive value for that method are 0.98, 1, 0.962, 0.96, and 0.962, respectively. CONCLUSIONS: In this paper, by feeding norm values of protein dimers into support vector machines in different accessible surface area thresholds, we demonstrate that even small number of proteins in pairs of multiple alignments could allow one to accurately discriminate between positive and negative dimers.
format Online
Article
Text
id pubmed-7745481
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77454812020-12-18 Inter-protein residue covariation information unravels physically interacting protein dimers Salmanian, Sara Pezeshk, Hamid Sadeghi, Mehdi BMC Bioinformatics Methodology Article BACKGROUND: Predicting physical interaction between proteins is one of the greatest challenges in computational biology. There are considerable various protein interactions and a huge number of protein sequences and synthetic peptides with unknown interacting counterparts. Most of co-evolutionary methods discover a combination of physical interplays and functional associations. However, there are only a handful of approaches which specifically infer physical interactions. Hybrid co-evolutionary methods exploit inter-protein residue coevolution to unravel specific physical interacting proteins. In this study, we introduce a hybrid co-evolutionary-based approach to predict physical interplays between pairs of protein families, starting from protein sequences only. RESULTS: In the present analysis, pairs of multiple sequence alignments are constructed for each dimer and the covariation between residues in those pairs are calculated by CCMpred (Contacts from Correlated Mutations predicted) and three mutual information based approaches for ten accessible surface area threshold groups. Then, whole residue couplings between proteins of each dimer are unified into a single Frobenius norm value. Norms of residue contact matrices of all dimers in different accessible surface area thresholds are fed into support vector machine as single or multiple feature models. The results of training the classifiers by single features show no apparent different accuracies in distinct methods for different accessible surface area thresholds. Nevertheless, mutual information product and context likelihood of relatedness procedures may roughly have an overall higher and lower performances than other two methods for different accessible surface area cut-offs, respectively. The results also demonstrate that training support vector machine with multiple norm features for several accessible surface area thresholds leads to a considerable improvement of prediction performance. In this context, CCMpred roughly achieves an overall better performance than mutual information based approaches. The best accuracy, sensitivity, specificity, precision and negative predictive value for that method are 0.98, 1, 0.962, 0.96, and 0.962, respectively. CONCLUSIONS: In this paper, by feeding norm values of protein dimers into support vector machines in different accessible surface area thresholds, we demonstrate that even small number of proteins in pairs of multiple alignments could allow one to accurately discriminate between positive and negative dimers. BioMed Central 2020-12-17 /pmc/articles/PMC7745481/ /pubmed/33334319 http://dx.doi.org/10.1186/s12859-020-03930-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Salmanian, Sara
Pezeshk, Hamid
Sadeghi, Mehdi
Inter-protein residue covariation information unravels physically interacting protein dimers
title Inter-protein residue covariation information unravels physically interacting protein dimers
title_full Inter-protein residue covariation information unravels physically interacting protein dimers
title_fullStr Inter-protein residue covariation information unravels physically interacting protein dimers
title_full_unstemmed Inter-protein residue covariation information unravels physically interacting protein dimers
title_short Inter-protein residue covariation information unravels physically interacting protein dimers
title_sort inter-protein residue covariation information unravels physically interacting protein dimers
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7745481/
https://www.ncbi.nlm.nih.gov/pubmed/33334319
http://dx.doi.org/10.1186/s12859-020-03930-7
work_keys_str_mv AT salmaniansara interproteinresiduecovariationinformationunravelsphysicallyinteractingproteindimers
AT pezeshkhamid interproteinresiduecovariationinformationunravelsphysicallyinteractingproteindimers
AT sadeghimehdi interproteinresiduecovariationinformationunravelsphysicallyinteractingproteindimers