Cargando…
Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction
BACKGROUND: The correct determination of protein–protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5802066/ https://www.ncbi.nlm.nih.gov/pubmed/29409446 http://dx.doi.org/10.1186/s12859-018-2043-3 |
_version_ | 1783298468710711296 |
---|---|
author | Daberdaku, Sebastian Ferrari, Carlo |
author_facet | Daberdaku, Sebastian Ferrari, Carlo |
author_sort | Daberdaku, Sebastian |
collection | PubMed |
description | BACKGROUND: The correct determination of protein–protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is still not fully understood. Experimental evidence suggests that the location of binding sites is imprinted in the protein structure, but there are major differences among the interfaces of the various protein types: the characterising properties can vary a lot depending on the interaction type and function. The selection of an optimal set of features characterising the protein interface and the development of an effective method to represent and capture the complex protein recognition patterns are of paramount importance for this task. RESULTS: In this work we investigate the potential of a novel local surface descriptor based on 3D Zernike moments for the interface prediction task. Descriptors invariant to roto-translations are extracted from circular patches of the protein surface enriched with physico-chemical properties from the HQI8 amino acid index set, and are used as samples for a binary classification problem. Support Vector Machines are used as a classifier to distinguish interface local surface patches from non-interface ones. The proposed method was validated on 16 classes of proteins extracted from the Protein–Protein Docking Benchmark 5.0 and compared to other state-of-the-art protein interface predictors (SPPIDER, PrISE and NPS-HomPPI). CONCLUSIONS: The 3D Zernike descriptors are able to capture the similarity among patterns of physico-chemical and biochemical properties mapped on the protein surface arising from the various spatial arrangements of the underlying residues, and their usage can be easily extended to other sets of amino acid properties. The results suggest that the choice of a proper set of features characterising the protein interface is crucial for the interface prediction task, and that optimality strongly depends on the class of proteins whose interface we want to characterise. We postulate that different protein classes should be treated separately and that it is necessary to identify an optimal set of features for each protein class. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2043-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5802066 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58020662018-02-14 Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction Daberdaku, Sebastian Ferrari, Carlo BMC Bioinformatics Research Article BACKGROUND: The correct determination of protein–protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is still not fully understood. Experimental evidence suggests that the location of binding sites is imprinted in the protein structure, but there are major differences among the interfaces of the various protein types: the characterising properties can vary a lot depending on the interaction type and function. The selection of an optimal set of features characterising the protein interface and the development of an effective method to represent and capture the complex protein recognition patterns are of paramount importance for this task. RESULTS: In this work we investigate the potential of a novel local surface descriptor based on 3D Zernike moments for the interface prediction task. Descriptors invariant to roto-translations are extracted from circular patches of the protein surface enriched with physico-chemical properties from the HQI8 amino acid index set, and are used as samples for a binary classification problem. Support Vector Machines are used as a classifier to distinguish interface local surface patches from non-interface ones. The proposed method was validated on 16 classes of proteins extracted from the Protein–Protein Docking Benchmark 5.0 and compared to other state-of-the-art protein interface predictors (SPPIDER, PrISE and NPS-HomPPI). CONCLUSIONS: The 3D Zernike descriptors are able to capture the similarity among patterns of physico-chemical and biochemical properties mapped on the protein surface arising from the various spatial arrangements of the underlying residues, and their usage can be easily extended to other sets of amino acid properties. The results suggest that the choice of a proper set of features characterising the protein interface is crucial for the interface prediction task, and that optimality strongly depends on the class of proteins whose interface we want to characterise. We postulate that different protein classes should be treated separately and that it is necessary to identify an optimal set of features for each protein class. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2043-3) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-06 /pmc/articles/PMC5802066/ /pubmed/29409446 http://dx.doi.org/10.1186/s12859-018-2043-3 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Daberdaku, Sebastian Ferrari, Carlo Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction |
title | Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction |
title_full | Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction |
title_fullStr | Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction |
title_full_unstemmed | Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction |
title_short | Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction |
title_sort | exploring the potential of 3d zernike descriptors and svm for protein–protein interface prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5802066/ https://www.ncbi.nlm.nih.gov/pubmed/29409446 http://dx.doi.org/10.1186/s12859-018-2043-3 |
work_keys_str_mv | AT daberdakusebastian exploringthepotentialof3dzernikedescriptorsandsvmforproteinproteininterfaceprediction AT ferraricarlo exploringthepotentialof3dzernikedescriptorsandsvmforproteinproteininterfaceprediction |