Cargando…
A feature-based approach to modeling protein–protein interaction hot spots
Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed fe...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2677884/ https://www.ncbi.nlm.nih.gov/pubmed/19273533 http://dx.doi.org/10.1093/nar/gkp132 |
_version_ | 1782166804484849664 |
---|---|
author | Cho, Kyu-il Kim, Dongsup Lee, Doheon |
author_facet | Cho, Kyu-il Kim, Dongsup Lee, Doheon |
author_sort | Cho, Kyu-il |
collection | PubMed |
description | Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed features with conventional features, we develop a predictive model for interaction hot spots. Initially, 54 multifaceted features, composed of different levels of information including structure, sequence and molecular interaction information, are quantified. Then, to identify the best subset of features for predicting hot spots, feature selection is performed using a decision tree. Based on the selected features, a predictive model for hot spots is created using support vector machine (SVM) and tested on an independent test set. Our model shows better overall predictive accuracy than previous methods such as the alanine scanning methods Robetta and FOLDEF, and the knowledge-based method KFC. Subsequent analysis yields several findings about hot spots. As expected, hot spots have a larger relative surface area burial and are more hydrophobic than other residues. Unexpectedly, however, residue conservation displays a rather complicated tendency depending on the types of protein complexes, indicating that this feature is not good for identifying hot spots. Of the selected features, the weighted atomic packing density, relative surface area burial and weighted hydrophobicity are the top 3, with the weighted atomic packing density proving to be the most effective feature for predicting hot spots. Notably, we find that hot spots are closely related to π–related interactions, especially π · · · π interactions. |
format | Text |
id | pubmed-2677884 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-26778842009-05-15 A feature-based approach to modeling protein–protein interaction hot spots Cho, Kyu-il Kim, Dongsup Lee, Doheon Nucleic Acids Res Computational Biology Identifying features that effectively represent the energetic contribution of an individual interface residue to the interactions between proteins remains problematic. Here, we present several new features and show that they are more effective than conventional features. By combining the proposed features with conventional features, we develop a predictive model for interaction hot spots. Initially, 54 multifaceted features, composed of different levels of information including structure, sequence and molecular interaction information, are quantified. Then, to identify the best subset of features for predicting hot spots, feature selection is performed using a decision tree. Based on the selected features, a predictive model for hot spots is created using support vector machine (SVM) and tested on an independent test set. Our model shows better overall predictive accuracy than previous methods such as the alanine scanning methods Robetta and FOLDEF, and the knowledge-based method KFC. Subsequent analysis yields several findings about hot spots. As expected, hot spots have a larger relative surface area burial and are more hydrophobic than other residues. Unexpectedly, however, residue conservation displays a rather complicated tendency depending on the types of protein complexes, indicating that this feature is not good for identifying hot spots. Of the selected features, the weighted atomic packing density, relative surface area burial and weighted hydrophobicity are the top 3, with the weighted atomic packing density proving to be the most effective feature for predicting hot spots. Notably, we find that hot spots are closely related to π–related interactions, especially π · · · π interactions. Oxford University Press 2009-05 2009-03-09 /pmc/articles/PMC2677884/ /pubmed/19273533 http://dx.doi.org/10.1093/nar/gkp132 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Cho, Kyu-il Kim, Dongsup Lee, Doheon A feature-based approach to modeling protein–protein interaction hot spots |
title | A feature-based approach to modeling protein–protein interaction hot spots |
title_full | A feature-based approach to modeling protein–protein interaction hot spots |
title_fullStr | A feature-based approach to modeling protein–protein interaction hot spots |
title_full_unstemmed | A feature-based approach to modeling protein–protein interaction hot spots |
title_short | A feature-based approach to modeling protein–protein interaction hot spots |
title_sort | feature-based approach to modeling protein–protein interaction hot spots |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2677884/ https://www.ncbi.nlm.nih.gov/pubmed/19273533 http://dx.doi.org/10.1093/nar/gkp132 |
work_keys_str_mv | AT chokyuil afeaturebasedapproachtomodelingproteinproteininteractionhotspots AT kimdongsup afeaturebasedapproachtomodelingproteinproteininteractionhotspots AT leedoheon afeaturebasedapproachtomodelingproteinproteininteractionhotspots AT chokyuil featurebasedapproachtomodelingproteinproteininteractionhotspots AT kimdongsup featurebasedapproachtomodelingproteinproteininteractionhotspots AT leedoheon featurebasedapproachtomodelingproteinproteininteractionhotspots |