Cargando…

A feature-based approach to modeling protein–protein interaction hot spots

Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed fe...

Descripción completa

Detalles Bibliográficos
Autores principales: Cho, Kyu-il, Kim, Dongsup, Lee, Doheon
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2677884/
https://www.ncbi.nlm.nih.gov/pubmed/19273533
http://dx.doi.org/10.1093/nar/gkp132
_version_ 1782166804484849664
author Cho, Kyu-il
Kim, Dongsup
Lee, Doheon
author_facet Cho, Kyu-il
Kim, Dongsup
Lee, Doheon
author_sort Cho, Kyu-il
collection PubMed
description Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed features with conventional features, we develop a predictive model for interaction hot spots. Initially, 54 multifaceted features, composed of different levels of information including structure, sequence and molecular interaction information, are quantified. Then, to identify the best subset of features for predicting hot spots, feature selection is performed using a decision tree. Based on the selected features, a predictive model for hot spots is created using support vector machine (SVM) and tested on an independent test set. Our model shows better overall predictive accuracy than previous methods such as the alanine scanning methods Robetta and FOLDEF, and the knowledge-based method KFC. Subsequent analysis yields several findings about hot spots. As expected, hot spots have a larger relative surface area burial and are more hydrophobic than other residues. Unexpectedly, however, residue conservation displays a rather complicated tendency depending on the types of protein complexes, indicating that this feature is not good for identifying hot spots. Of the selected features, the weighted atomic packing density, relative surface area burial and weighted hydrophobicity are the top 3, with the weighted atomic packing density proving to be the most effective feature for predicting hot spots. Notably, we find that hot spots are closely related to π–related interactions, especially π · · · π interactions.
format Text
id pubmed-2677884
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26778842009-05-15 A feature-based approach to modeling protein–protein interaction hot spots Cho, Kyu-il Kim, Dongsup Lee, Doheon Nucleic Acids Res Computational Biology Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed features with conventional features, we develop a predictive model for interaction hot spots. Initially, 54 multifaceted features, composed of different levels of information including structure, sequence and molecular interaction information, are quantified. Then, to identify the best subset of features for predicting hot spots, feature selection is performed using a decision tree. Based on the selected features, a predictive model for hot spots is created using support vector machine (SVM) and tested on an independent test set. Our model shows better overall predictive accuracy than previous methods such as the alanine scanning methods Robetta and FOLDEF, and the knowledge-based method KFC. Subsequent analysis yields several findings about hot spots. As expected, hot spots have a larger relative surface area burial and are more hydrophobic than other residues. Unexpectedly, however, residue conservation displays a rather complicated tendency depending on the types of protein complexes, indicating that this feature is not good for identifying hot spots. Of the selected features, the weighted atomic packing density, relative surface area burial and weighted hydrophobicity are the top 3, with the weighted atomic packing density proving to be the most effective feature for predicting hot spots. Notably, we find that hot spots are closely related to π–related interactions, especially π · · · π interactions. Oxford University Press 2009-05 2009-03-09 /pmc/articles/PMC2677884/ /pubmed/19273533 http://dx.doi.org/10.1093/nar/gkp132 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Cho, Kyu-il
Kim, Dongsup
Lee, Doheon
A feature-based approach to modeling protein–protein interaction hot spots
title A feature-based approach to modeling protein–protein interaction hot spots
title_full A feature-based approach to modeling protein–protein interaction hot spots
title_fullStr A feature-based approach to modeling protein–protein interaction hot spots
title_full_unstemmed A feature-based approach to modeling protein–protein interaction hot spots
title_short A feature-based approach to modeling protein–protein interaction hot spots
title_sort feature-based approach to modeling protein–protein interaction hot spots
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2677884/
https://www.ncbi.nlm.nih.gov/pubmed/19273533
http://dx.doi.org/10.1093/nar/gkp132
work_keys_str_mv AT chokyuil afeaturebasedapproachtomodelingproteinproteininteractionhotspots
AT kimdongsup afeaturebasedapproachtomodelingproteinproteininteractionhotspots
AT leedoheon afeaturebasedapproachtomodelingproteinproteininteractionhotspots
AT chokyuil featurebasedapproachtomodelingproteinproteininteractionhotspots
AT kimdongsup featurebasedapproachtomodelingproteinproteininteractionhotspots
AT leedoheon featurebasedapproachtomodelingproteinproteininteractionhotspots