Cargando…

CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information

Decrypting the interface residues of the protein complexes provides insight into the functions of the proteins and, hence, the overall cellular machinery. Computational methods have been devised in the past to predict the interface residues using amino acid sequence information, but all these method...

Descripción completa

Detalles Bibliográficos
Autores principales: Chopra, Kriti, Burdak, Bhawna, Sharma, Kaushal, Kembhavi, Ajit, Mande, Shekhar C., Chauhan, Radha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7356028/
https://www.ncbi.nlm.nih.gov/pubmed/32580303
http://dx.doi.org/10.3390/biom10060938
_version_ 1783558411991908352
author Chopra, Kriti
Burdak, Bhawna
Sharma, Kaushal
Kembhavi, Ajit
Mande, Shekhar C.
Chauhan, Radha
author_facet Chopra, Kriti
Burdak, Bhawna
Sharma, Kaushal
Kembhavi, Ajit
Mande, Shekhar C.
Chauhan, Radha
author_sort Chopra, Kriti
collection PubMed
description Decrypting the interface residues of the protein complexes provides insight into the functions of the proteins and, hence, the overall cellular machinery. Computational methods have been devised in the past to predict the interface residues using amino acid sequence information, but all these methods have been majorly applied to predict for prokaryotic protein complexes. Since the composition and rate of evolution of the primary sequence is different between prokaryotes and eukaryotes, it is important to develop a method specifically for eukaryotic complexes. Here, we report a new hybrid pipeline for predicting the protein-protein interaction interfaces in a pairwise manner from the amino acid sequence information of the interacting proteins. It is based on the framework of Co-evolution, machine learning (Random Forest), and Network Analysis named CoRNeA trained specifically on eukaryotic protein complexes. We use Co-evolution, physicochemical properties, and contact potential as major group of features to train the Random Forest classifier. We also incorporate the intra-contact information of the individual proteins to eliminate false positives from the predictions keeping in mind that the amino acid sequence of a protein also holds information for its own folding and not only the interface propensities. Our prediction on example datasets shows that CoRNeA not only enhances the prediction of true interface residues but also reduces false positive rates significantly.
format Online
Article
Text
id pubmed-7356028
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73560282020-07-22 CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information Chopra, Kriti Burdak, Bhawna Sharma, Kaushal Kembhavi, Ajit Mande, Shekhar C. Chauhan, Radha Biomolecules Article Decrypting the interface residues of the protein complexes provides insight into the functions of the proteins and, hence, the overall cellular machinery. Computational methods have been devised in the past to predict the interface residues using amino acid sequence information, but all these methods have been majorly applied to predict for prokaryotic protein complexes. Since the composition and rate of evolution of the primary sequence is different between prokaryotes and eukaryotes, it is important to develop a method specifically for eukaryotic complexes. Here, we report a new hybrid pipeline for predicting the protein-protein interaction interfaces in a pairwise manner from the amino acid sequence information of the interacting proteins. It is based on the framework of Co-evolution, machine learning (Random Forest), and Network Analysis named CoRNeA trained specifically on eukaryotic protein complexes. We use Co-evolution, physicochemical properties, and contact potential as major group of features to train the Random Forest classifier. We also incorporate the intra-contact information of the individual proteins to eliminate false positives from the predictions keeping in mind that the amino acid sequence of a protein also holds information for its own folding and not only the interface propensities. Our prediction on example datasets shows that CoRNeA not only enhances the prediction of true interface residues but also reduces false positive rates significantly. MDPI 2020-06-22 /pmc/articles/PMC7356028/ /pubmed/32580303 http://dx.doi.org/10.3390/biom10060938 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chopra, Kriti
Burdak, Bhawna
Sharma, Kaushal
Kembhavi, Ajit
Mande, Shekhar C.
Chauhan, Radha
CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information
title CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information
title_full CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information
title_fullStr CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information
title_full_unstemmed CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information
title_short CoRNeA: A Pipeline to Decrypt the Inter-Protein Interfaces from Amino Acid Sequence Information
title_sort cornea: a pipeline to decrypt the inter-protein interfaces from amino acid sequence information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7356028/
https://www.ncbi.nlm.nih.gov/pubmed/32580303
http://dx.doi.org/10.3390/biom10060938
work_keys_str_mv AT choprakriti corneaapipelinetodecrypttheinterproteininterfacesfromaminoacidsequenceinformation
AT burdakbhawna corneaapipelinetodecrypttheinterproteininterfacesfromaminoacidsequenceinformation
AT sharmakaushal corneaapipelinetodecrypttheinterproteininterfacesfromaminoacidsequenceinformation
AT kembhaviajit corneaapipelinetodecrypttheinterproteininterfacesfromaminoacidsequenceinformation
AT mandeshekharc corneaapipelinetodecrypttheinterproteininterfacesfromaminoacidsequenceinformation
AT chauhanradha corneaapipelinetodecrypttheinterproteininterfacesfromaminoacidsequenceinformation