Cargando…

Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system

Protein phosphorylation is a reversible post-translational modification where a protein kinase adds a phosphate group to a protein, potentially regulating its function, localization and/or activity. Phosphorylation can affect protein–protein interactions (PPIs), abolishing interaction with previous...

Descripción completa

Detalles Bibliográficos
Autores principales: Tudor, Catalina O., Ross, Karen E., Li, Gang, Vijay-Shanker, K., Wu, Cathy H., Arighi, Cecilia N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381107/
https://www.ncbi.nlm.nih.gov/pubmed/25833953
http://dx.doi.org/10.1093/database/bav020
_version_ 1782364399639461888
author Tudor, Catalina O.
Ross, Karen E.
Li, Gang
Vijay-Shanker, K.
Wu, Cathy H.
Arighi, Cecilia N.
author_facet Tudor, Catalina O.
Ross, Karen E.
Li, Gang
Vijay-Shanker, K.
Wu, Cathy H.
Arighi, Cecilia N.
author_sort Tudor, Catalina O.
collection PubMed
description Protein phosphorylation is a reversible post-translational modification where a protein kinase adds a phosphate group to a protein, potentially regulating its function, localization and/or activity. Phosphorylation can affect protein–protein interactions (PPIs), abolishing interaction with previous binding partners or enabling new interactions. Extracting phosphorylation information coupled with PPI information from the scientific literature will facilitate the creation of phosphorylation interaction networks of kinases, substrates and interacting partners, toward knowledge discovery of functional outcomes of protein phosphorylation. Increasingly, PPI databases are interested in capturing the phosphorylation state of interacting partners. We have previously developed the eFIP (Extracting Functional Impact of Phosphorylation) text mining system, which identifies phosphorylated proteins and phosphorylation-dependent PPIs. In this work, we present several enhancements for the eFIP system: (i) text mining for full-length articles from the PubMed Central open-access collection; (ii) the integration of the RLIMS-P 2.0 system for the extraction of phosphorylation events with kinase, substrate and site information; (iii) the extension of the PPI module with new trigger words/phrases describing interactions and (iv) the addition of the iSimp tool for sentence simplification to aid in the matching of syntactic patterns. We enhance the website functionality to: (i) support searches based on protein roles (kinases, substrates, interacting partners) or using keywords; (ii) link protein entities to their corresponding UniProt identifiers if mapped and (iii) support visual exploration of phosphorylation interaction networks using Cytoscape. The evaluation of eFIP on full-length articles achieved 92.4% precision, 76.5% recall and 83.7% F-measure on 100 article sections. To demonstrate eFIP for knowledge extraction and discovery, we constructed phosphorylation-dependent interaction networks involving 14-3-3 proteins identified from cancer-related versus diabetes-related articles. Comparison of the phosphorylation interaction network of kinases, phosphoproteins and interactants obtained from eFIP searches, along with enrichment analysis of the protein set, revealed several shared interactions, highlighting common pathways discussed in the context of both diseases. Database URL: http://proteininformationresource.org/efip
format Online
Article
Text
id pubmed-4381107
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43811072015-04-03 Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system Tudor, Catalina O. Ross, Karen E. Li, Gang Vijay-Shanker, K. Wu, Cathy H. Arighi, Cecilia N. Database (Oxford) Original Article Protein phosphorylation is a reversible post-translational modification where a protein kinase adds a phosphate group to a protein, potentially regulating its function, localization and/or activity. Phosphorylation can affect protein–protein interactions (PPIs), abolishing interaction with previous binding partners or enabling new interactions. Extracting phosphorylation information coupled with PPI information from the scientific literature will facilitate the creation of phosphorylation interaction networks of kinases, substrates and interacting partners, toward knowledge discovery of functional outcomes of protein phosphorylation. Increasingly, PPI databases are interested in capturing the phosphorylation state of interacting partners. We have previously developed the eFIP (Extracting Functional Impact of Phosphorylation) text mining system, which identifies phosphorylated proteins and phosphorylation-dependent PPIs. In this work, we present several enhancements for the eFIP system: (i) text mining for full-length articles from the PubMed Central open-access collection; (ii) the integration of the RLIMS-P 2.0 system for the extraction of phosphorylation events with kinase, substrate and site information; (iii) the extension of the PPI module with new trigger words/phrases describing interactions and (iv) the addition of the iSimp tool for sentence simplification to aid in the matching of syntactic patterns. We enhance the website functionality to: (i) support searches based on protein roles (kinases, substrates, interacting partners) or using keywords; (ii) link protein entities to their corresponding UniProt identifiers if mapped and (iii) support visual exploration of phosphorylation interaction networks using Cytoscape. The evaluation of eFIP on full-length articles achieved 92.4% precision, 76.5% recall and 83.7% F-measure on 100 article sections. To demonstrate eFIP for knowledge extraction and discovery, we constructed phosphorylation-dependent interaction networks involving 14-3-3 proteins identified from cancer-related versus diabetes-related articles. Comparison of the phosphorylation interaction network of kinases, phosphoproteins and interactants obtained from eFIP searches, along with enrichment analysis of the protein set, revealed several shared interactions, highlighting common pathways discussed in the context of both diseases. Database URL: http://proteininformationresource.org/efip Oxford University Press 2015-03-31 /pmc/articles/PMC4381107/ /pubmed/25833953 http://dx.doi.org/10.1093/database/bav020 Text en © The Author(s) 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Tudor, Catalina O.
Ross, Karen E.
Li, Gang
Vijay-Shanker, K.
Wu, Cathy H.
Arighi, Cecilia N.
Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system
title Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system
title_full Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system
title_fullStr Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system
title_full_unstemmed Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system
title_short Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system
title_sort construction of phosphorylation interaction networks by text mining of full-length articles using the efip system
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381107/
https://www.ncbi.nlm.nih.gov/pubmed/25833953
http://dx.doi.org/10.1093/database/bav020
work_keys_str_mv AT tudorcatalinao constructionofphosphorylationinteractionnetworksbytextminingoffulllengtharticlesusingtheefipsystem
AT rosskarene constructionofphosphorylationinteractionnetworksbytextminingoffulllengtharticlesusingtheefipsystem
AT ligang constructionofphosphorylationinteractionnetworksbytextminingoffulllengtharticlesusingtheefipsystem
AT vijayshankerk constructionofphosphorylationinteractionnetworksbytextminingoffulllengtharticlesusingtheefipsystem
AT wucathyh constructionofphosphorylationinteractionnetworksbytextminingoffulllengtharticlesusingtheefipsystem
AT arighicecilian constructionofphosphorylationinteractionnetworksbytextminingoffulllengtharticlesusingtheefipsystem