Cargando…

Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction

BACKGROUND: The correct determination of protein–protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is...

Descripción completa

Detalles Bibliográficos
Autores principales: Daberdaku, Sebastian, Ferrari, Carlo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5802066/
https://www.ncbi.nlm.nih.gov/pubmed/29409446
http://dx.doi.org/10.1186/s12859-018-2043-3
_version_ 1783298468710711296
author Daberdaku, Sebastian
Ferrari, Carlo
author_facet Daberdaku, Sebastian
Ferrari, Carlo
author_sort Daberdaku, Sebastian
collection PubMed
description BACKGROUND: The correct determination of protein–protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is still not fully understood. Experimental evidence suggests that the location of binding sites is imprinted in the protein structure, but there are major differences among the interfaces of the various protein types: the characterising properties can vary a lot depending on the interaction type and function. The selection of an optimal set of features characterising the protein interface and the development of an effective method to represent and capture the complex protein recognition patterns are of paramount importance for this task. RESULTS: In this work we investigate the potential of a novel local surface descriptor based on 3D Zernike moments for the interface prediction task. Descriptors invariant to roto-translations are extracted from circular patches of the protein surface enriched with physico-chemical properties from the HQI8 amino acid index set, and are used as samples for a binary classification problem. Support Vector Machines are used as a classifier to distinguish interface local surface patches from non-interface ones. The proposed method was validated on 16 classes of proteins extracted from the Protein–Protein Docking Benchmark 5.0 and compared to other state-of-the-art protein interface predictors (SPPIDER, PrISE and NPS-HomPPI). CONCLUSIONS: The 3D Zernike descriptors are able to capture the similarity among patterns of physico-chemical and biochemical properties mapped on the protein surface arising from the various spatial arrangements of the underlying residues, and their usage can be easily extended to other sets of amino acid properties. The results suggest that the choice of a proper set of features characterising the protein interface is crucial for the interface prediction task, and that optimality strongly depends on the class of proteins whose interface we want to characterise. We postulate that different protein classes should be treated separately and that it is necessary to identify an optimal set of features for each protein class. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2043-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5802066
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58020662018-02-14 Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction Daberdaku, Sebastian Ferrari, Carlo BMC Bioinformatics Research Article BACKGROUND: The correct determination of protein–protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is still not fully understood. Experimental evidence suggests that the location of binding sites is imprinted in the protein structure, but there are major differences among the interfaces of the various protein types: the characterising properties can vary a lot depending on the interaction type and function. The selection of an optimal set of features characterising the protein interface and the development of an effective method to represent and capture the complex protein recognition patterns are of paramount importance for this task. RESULTS: In this work we investigate the potential of a novel local surface descriptor based on 3D Zernike moments for the interface prediction task. Descriptors invariant to roto-translations are extracted from circular patches of the protein surface enriched with physico-chemical properties from the HQI8 amino acid index set, and are used as samples for a binary classification problem. Support Vector Machines are used as a classifier to distinguish interface local surface patches from non-interface ones. The proposed method was validated on 16 classes of proteins extracted from the Protein–Protein Docking Benchmark 5.0 and compared to other state-of-the-art protein interface predictors (SPPIDER, PrISE and NPS-HomPPI). CONCLUSIONS: The 3D Zernike descriptors are able to capture the similarity among patterns of physico-chemical and biochemical properties mapped on the protein surface arising from the various spatial arrangements of the underlying residues, and their usage can be easily extended to other sets of amino acid properties. The results suggest that the choice of a proper set of features characterising the protein interface is crucial for the interface prediction task, and that optimality strongly depends on the class of proteins whose interface we want to characterise. We postulate that different protein classes should be treated separately and that it is necessary to identify an optimal set of features for each protein class. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2043-3) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-06 /pmc/articles/PMC5802066/ /pubmed/29409446 http://dx.doi.org/10.1186/s12859-018-2043-3 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Daberdaku, Sebastian
Ferrari, Carlo
Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction
title Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction
title_full Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction
title_fullStr Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction
title_full_unstemmed Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction
title_short Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction
title_sort exploring the potential of 3d zernike descriptors and svm for protein–protein interface prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5802066/
https://www.ncbi.nlm.nih.gov/pubmed/29409446
http://dx.doi.org/10.1186/s12859-018-2043-3
work_keys_str_mv AT daberdakusebastian exploringthepotentialof3dzernikedescriptorsandsvmforproteinproteininterfaceprediction
AT ferraricarlo exploringthepotentialof3dzernikedescriptorsandsvmforproteinproteininterfaceprediction