Cargando…

RLIMS-P: an online text-mining tool for literature-based extraction of protein phosphorylation information

Protein phosphorylation is central to the regulation of most aspects of cell function. Given its importance, it has been the subject of active research as well as the focus of curation in several biological databases. We have developed Rule-based Literature Mining System for protein Phosphorylation...

Descripción completa

Detalles Bibliográficos
Autores principales: Torii, Manabu, Li, Gang, Li, Zhiwen, Oughtred, Rose, Diella, Francesca, Çelen, Irem, Arighi, Cecilia N., Huang, Hongzhan, Vijay-Shanker, K., Wu, Cathy H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4131691/
https://www.ncbi.nlm.nih.gov/pubmed/25122463
http://dx.doi.org/10.1093/database/bau081
_version_ 1782330507555504128
author Torii, Manabu
Li, Gang
Li, Zhiwen
Oughtred, Rose
Diella, Francesca
Çelen, Irem
Arighi, Cecilia N.
Huang, Hongzhan
Vijay-Shanker, K.
Wu, Cathy H.
author_facet Torii, Manabu
Li, Gang
Li, Zhiwen
Oughtred, Rose
Diella, Francesca
Çelen, Irem
Arighi, Cecilia N.
Huang, Hongzhan
Vijay-Shanker, K.
Wu, Cathy H.
author_sort Torii, Manabu
collection PubMed
description Protein phosphorylation is central to the regulation of most aspects of cell function. Given its importance, it has been the subject of active research as well as the focus of curation in several biological databases. We have developed Rule-based Literature Mining System for protein Phosphorylation (RLIMS-P), an online text-mining tool to help curators identify biomedical research articles relevant to protein phosphorylation. The tool presents information on protein kinases, substrates and phosphorylation sites automatically extracted from the biomedical literature. The utility of the RLIMS-P Web site has been evaluated by curators from Phospho.ELM, PhosphoGRID/BioGrid and Protein Ontology as part of the BioCreative IV user interactive task (IAT). The system achieved F-scores of 0.76, 0.88 and 0.92 for the extraction of kinase, substrate and phosphorylation sites, respectively, and a precision of 0.88 in the retrieval of relevant phosphorylation literature. The system also received highly favorable feedback from the curators in a user survey. Based on the curators’ suggestions, the Web site has been enhanced to improve its usability. In the RLIMS-P Web site, phosphorylation information can be retrieved by PubMed IDs or keywords, with an option for selecting targeted species. The result page displays a sortable table with phosphorylation information. The text evidence page displays the abstract with color-coded entity mentions and includes links to UniProtKB entries via normalization, i.e. the linking of entity mentions to database identifiers, facilitated by the GenNorm tool and by the links to the bibliography in UniProt. Log in and editing capabilities are offered to any user interested in contributing to the validation of RLIMS-P results. Retrieved phosphorylation information can also be downloaded in CSV format and the text evidence in the BioC format. RLIMS-P is freely available. Database URL: http://www.proteininformationresource.org/rlimsp/
format Online
Article
Text
id pubmed-4131691
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41316912014-08-18 RLIMS-P: an online text-mining tool for literature-based extraction of protein phosphorylation information Torii, Manabu Li, Gang Li, Zhiwen Oughtred, Rose Diella, Francesca Çelen, Irem Arighi, Cecilia N. Huang, Hongzhan Vijay-Shanker, K. Wu, Cathy H. Database (Oxford) Original Article Protein phosphorylation is central to the regulation of most aspects of cell function. Given its importance, it has been the subject of active research as well as the focus of curation in several biological databases. We have developed Rule-based Literature Mining System for protein Phosphorylation (RLIMS-P), an online text-mining tool to help curators identify biomedical research articles relevant to protein phosphorylation. The tool presents information on protein kinases, substrates and phosphorylation sites automatically extracted from the biomedical literature. The utility of the RLIMS-P Web site has been evaluated by curators from Phospho.ELM, PhosphoGRID/BioGrid and Protein Ontology as part of the BioCreative IV user interactive task (IAT). The system achieved F-scores of 0.76, 0.88 and 0.92 for the extraction of kinase, substrate and phosphorylation sites, respectively, and a precision of 0.88 in the retrieval of relevant phosphorylation literature. The system also received highly favorable feedback from the curators in a user survey. Based on the curators’ suggestions, the Web site has been enhanced to improve its usability. In the RLIMS-P Web site, phosphorylation information can be retrieved by PubMed IDs or keywords, with an option for selecting targeted species. The result page displays a sortable table with phosphorylation information. The text evidence page displays the abstract with color-coded entity mentions and includes links to UniProtKB entries via normalization, i.e. the linking of entity mentions to database identifiers, facilitated by the GenNorm tool and by the links to the bibliography in UniProt. Log in and editing capabilities are offered to any user interested in contributing to the validation of RLIMS-P results. Retrieved phosphorylation information can also be downloaded in CSV format and the text evidence in the BioC format. RLIMS-P is freely available. Database URL: http://www.proteininformationresource.org/rlimsp/ Oxford University Press 2014-08-13 /pmc/articles/PMC4131691/ /pubmed/25122463 http://dx.doi.org/10.1093/database/bau081 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Torii, Manabu
Li, Gang
Li, Zhiwen
Oughtred, Rose
Diella, Francesca
Çelen, Irem
Arighi, Cecilia N.
Huang, Hongzhan
Vijay-Shanker, K.
Wu, Cathy H.
RLIMS-P: an online text-mining tool for literature-based extraction of protein phosphorylation information
title RLIMS-P: an online text-mining tool for literature-based extraction of protein phosphorylation information
title_full RLIMS-P: an online text-mining tool for literature-based extraction of protein phosphorylation information
title_fullStr RLIMS-P: an online text-mining tool for literature-based extraction of protein phosphorylation information
title_full_unstemmed RLIMS-P: an online text-mining tool for literature-based extraction of protein phosphorylation information
title_short RLIMS-P: an online text-mining tool for literature-based extraction of protein phosphorylation information
title_sort rlims-p: an online text-mining tool for literature-based extraction of protein phosphorylation information
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4131691/
https://www.ncbi.nlm.nih.gov/pubmed/25122463
http://dx.doi.org/10.1093/database/bau081
work_keys_str_mv AT toriimanabu rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT ligang rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT lizhiwen rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT oughtredrose rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT diellafrancesca rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT celenirem rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT arighicecilian rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT huanghongzhan rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT vijayshankerk rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation
AT wucathyh rlimspanonlinetextminingtoolforliteraturebasedextractionofproteinphosphorylationinformation