Cargando…

A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins

Physical interaction between two proteins is strong evidence that the proteins are involved in the same biological process, making Protein-Protein Interaction (PPI) networks a valuable data resource for predicting the cellular functions of proteins. However, PPI networks are largely incomplete for n...

Descripción completa

Detalles Bibliográficos
Autores principales: Makrodimitris, Stavros, Reinders, Marcel, van Ham, Roeland
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7688180/
https://www.ncbi.nlm.nih.gov/pubmed/33237964
http://dx.doi.org/10.1371/journal.pone.0242723
_version_ 1783613660233465856
author Makrodimitris, Stavros
Reinders, Marcel
van Ham, Roeland
author_facet Makrodimitris, Stavros
Reinders, Marcel
van Ham, Roeland
author_sort Makrodimitris, Stavros
collection PubMed
description Physical interaction between two proteins is strong evidence that the proteins are involved in the same biological process, making Protein-Protein Interaction (PPI) networks a valuable data resource for predicting the cellular functions of proteins. However, PPI networks are largely incomplete for non-model species. Here, we tested to what extent these incomplete networks are still useful for genome-wide function prediction. We used two network-based classifiers to predict Biological Process Gene Ontology terms from protein interaction data in four species: Saccharomyces cerevisiae, Escherichia coli, Arabidopsis thaliana and Solanum lycopersicum (tomato). The classifiers had reasonable performance in the well-studied yeast, but performed poorly in the other species. We showed that this poor performance can be considerably improved by adding edges predicted from various data sources, such as text mining, and that associations from the STRING database are more useful than interactions predicted by a neural network from sequence-based features.
format Online
Article
Text
id pubmed-7688180
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-76881802020-12-05 A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins Makrodimitris, Stavros Reinders, Marcel van Ham, Roeland PLoS One Research Article Physical interaction between two proteins is strong evidence that the proteins are involved in the same biological process, making Protein-Protein Interaction (PPI) networks a valuable data resource for predicting the cellular functions of proteins. However, PPI networks are largely incomplete for non-model species. Here, we tested to what extent these incomplete networks are still useful for genome-wide function prediction. We used two network-based classifiers to predict Biological Process Gene Ontology terms from protein interaction data in four species: Saccharomyces cerevisiae, Escherichia coli, Arabidopsis thaliana and Solanum lycopersicum (tomato). The classifiers had reasonable performance in the well-studied yeast, but performed poorly in the other species. We showed that this poor performance can be considerably improved by adding edges predicted from various data sources, such as text mining, and that associations from the STRING database are more useful than interactions predicted by a neural network from sequence-based features. Public Library of Science 2020-11-25 /pmc/articles/PMC7688180/ /pubmed/33237964 http://dx.doi.org/10.1371/journal.pone.0242723 Text en © 2020 Makrodimitris et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Makrodimitris, Stavros
Reinders, Marcel
van Ham, Roeland
A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins
title A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins
title_full A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins
title_fullStr A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins
title_full_unstemmed A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins
title_short A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins
title_sort thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7688180/
https://www.ncbi.nlm.nih.gov/pubmed/33237964
http://dx.doi.org/10.1371/journal.pone.0242723
work_keys_str_mv AT makrodimitrisstavros athoroughanalysisofthecontributionofexperimentalderivedandsequencebasedpredictedproteinproteininteractionsforfunctionalannotationofproteins
AT reindersmarcel athoroughanalysisofthecontributionofexperimentalderivedandsequencebasedpredictedproteinproteininteractionsforfunctionalannotationofproteins
AT vanhamroeland athoroughanalysisofthecontributionofexperimentalderivedandsequencebasedpredictedproteinproteininteractionsforfunctionalannotationofproteins
AT makrodimitrisstavros thoroughanalysisofthecontributionofexperimentalderivedandsequencebasedpredictedproteinproteininteractionsforfunctionalannotationofproteins
AT reindersmarcel thoroughanalysisofthecontributionofexperimentalderivedandsequencebasedpredictedproteinproteininteractionsforfunctionalannotationofproteins
AT vanhamroeland thoroughanalysisofthecontributionofexperimentalderivedandsequencebasedpredictedproteinproteininteractionsforfunctionalannotationofproteins