Cargando…

IntPred: a structure-based predictor of protein–protein interaction sites

MOTIVATION: Protein–protein interactions are vital for protein function with the average protein having between three and ten interacting partners. Knowledge of precise protein–protein interfaces comes from crystal structures deposited in the Protein Data Bank (PDB), but only 50% of structures in th...

Descripción completa

Detalles Bibliográficos
Autores principales: Northey, Thomas C, Barešić, Anja, Martin, Andrew C R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860208/
https://www.ncbi.nlm.nih.gov/pubmed/28968673
http://dx.doi.org/10.1093/bioinformatics/btx585
_version_ 1783307960910348288
author Northey, Thomas C
Barešić, Anja
Martin, Andrew C R
author_facet Northey, Thomas C
Barešić, Anja
Martin, Andrew C R
author_sort Northey, Thomas C
collection PubMed
description MOTIVATION: Protein–protein interactions are vital for protein function with the average protein having between three and ten interacting partners. Knowledge of precise protein–protein interfaces comes from crystal structures deposited in the Protein Data Bank (PDB), but only 50% of structures in the PDB are complexes. There is therefore a need to predict protein–protein interfaces in silico and various methods for this purpose. Here we explore the use of a predictor based on structural features and which exploits random forest machine learning, comparing its performance with a number of popular established methods. RESULTS: On an independent test set of obligate and transient complexes, our IntPred predictor performs well (MCC = 0.370, ACC = 0.811, SPEC = 0.916, SENS = 0.411) and compares favourably with other methods. Overall, IntPred ranks second of six methods tested with SPPIDER having slightly better overall performance (MCC = 0.410, ACC = 0.759, SPEC = 0.783, SENS = 0.676), but considerably worse specificity than IntPred. As with SPPIDER, using an independent test set of obligate complexes enhanced performance (MCC = 0.381) while performance is somewhat reduced on a dataset of transient complexes (MCC = 0.303). The trade-off between sensitivity and specificity compared with SPPIDER suggests that the choice of the appropriate tool is application-dependent. AVAILABILITY AND IMPLEMENTATION: IntPred is implemented in Perl and may be downloaded for local use or run via a web server at www.bioinf.org.uk/intpred/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5860208
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58602082018-03-21 IntPred: a structure-based predictor of protein–protein interaction sites Northey, Thomas C Barešić, Anja Martin, Andrew C R Bioinformatics Original Papers MOTIVATION: Protein–protein interactions are vital for protein function with the average protein having between three and ten interacting partners. Knowledge of precise protein–protein interfaces comes from crystal structures deposited in the Protein Data Bank (PDB), but only 50% of structures in the PDB are complexes. There is therefore a need to predict protein–protein interfaces in silico and various methods for this purpose. Here we explore the use of a predictor based on structural features and which exploits random forest machine learning, comparing its performance with a number of popular established methods. RESULTS: On an independent test set of obligate and transient complexes, our IntPred predictor performs well (MCC = 0.370, ACC = 0.811, SPEC = 0.916, SENS = 0.411) and compares favourably with other methods. Overall, IntPred ranks second of six methods tested with SPPIDER having slightly better overall performance (MCC = 0.410, ACC = 0.759, SPEC = 0.783, SENS = 0.676), but considerably worse specificity than IntPred. As with SPPIDER, using an independent test set of obligate complexes enhanced performance (MCC = 0.381) while performance is somewhat reduced on a dataset of transient complexes (MCC = 0.303). The trade-off between sensitivity and specificity compared with SPPIDER suggests that the choice of the appropriate tool is application-dependent. AVAILABILITY AND IMPLEMENTATION: IntPred is implemented in Perl and may be downloaded for local use or run via a web server at www.bioinf.org.uk/intpred/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-01-15 2017-09-18 /pmc/articles/PMC5860208/ /pubmed/28968673 http://dx.doi.org/10.1093/bioinformatics/btx585 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Northey, Thomas C
Barešić, Anja
Martin, Andrew C R
IntPred: a structure-based predictor of protein–protein interaction sites
title IntPred: a structure-based predictor of protein–protein interaction sites
title_full IntPred: a structure-based predictor of protein–protein interaction sites
title_fullStr IntPred: a structure-based predictor of protein–protein interaction sites
title_full_unstemmed IntPred: a structure-based predictor of protein–protein interaction sites
title_short IntPred: a structure-based predictor of protein–protein interaction sites
title_sort intpred: a structure-based predictor of protein–protein interaction sites
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860208/
https://www.ncbi.nlm.nih.gov/pubmed/28968673
http://dx.doi.org/10.1093/bioinformatics/btx585
work_keys_str_mv AT northeythomasc intpredastructurebasedpredictorofproteinproteininteractionsites
AT baresicanja intpredastructurebasedpredictorofproteinproteininteractionsites
AT martinandrewcr intpredastructurebasedpredictorofproteinproteininteractionsites