Cargando…
PathNER: a tool for systematic identification of biological pathway mentions in the literature
BACKGROUND: Biological pathways are central to many biomedical studies and are frequently discussed in the literature. Several curated databases have been established to collate the knowledge of molecular processes constituting pathways. Yet, there has been little focus on enabling systematic detect...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852116/ https://www.ncbi.nlm.nih.gov/pubmed/24555844 http://dx.doi.org/10.1186/1752-0509-7-S3-S2 |
_version_ | 1782478612611465216 |
---|---|
author | Wu, Chengkun Schwartz, Jean-Marc Nenadic, Goran |
author_facet | Wu, Chengkun Schwartz, Jean-Marc Nenadic, Goran |
author_sort | Wu, Chengkun |
collection | PubMed |
description | BACKGROUND: Biological pathways are central to many biomedical studies and are frequently discussed in the literature. Several curated databases have been established to collate the knowledge of molecular processes constituting pathways. Yet, there has been little focus on enabling systematic detection of pathway mentions in the literature. RESULTS: We developed a tool, named PathNER (Pathway Named Entity Recognition), for the systematic identification of pathway mentions in the literature. PathNER is based on soft dictionary matching and rules, with the dictionary generated from public pathway databases. The rules utilise general pathway-specific keywords, syntactic information and gene/protein mentions. Detection results from both components are merged. On a gold-standard corpus, PathNER achieved an F1-score of 84%. To illustrate its potential, we applied PathNER on a collection of articles related to Alzheimer's disease to identify associated pathways, highlighting cases that can complement an existing manually curated knowledgebase. CONCLUSIONS: In contrast to existing text-mining efforts that target the automatic reconstruction of pathway details from molecular interactions mentioned in the literature, PathNER focuses on identifying specific named pathway mentions. These mentions can be used to support large-scale curation and pathway-related systems biology applications, as demonstrated in the example of Alzheimer's disease. PathNER is implemented in Java and made freely available online at http://sourceforge.net/projects/pathner/. |
format | Online Article Text |
id | pubmed-3852116 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38521162013-12-20 PathNER: a tool for systematic identification of biological pathway mentions in the literature Wu, Chengkun Schwartz, Jean-Marc Nenadic, Goran BMC Syst Biol Research BACKGROUND: Biological pathways are central to many biomedical studies and are frequently discussed in the literature. Several curated databases have been established to collate the knowledge of molecular processes constituting pathways. Yet, there has been little focus on enabling systematic detection of pathway mentions in the literature. RESULTS: We developed a tool, named PathNER (Pathway Named Entity Recognition), for the systematic identification of pathway mentions in the literature. PathNER is based on soft dictionary matching and rules, with the dictionary generated from public pathway databases. The rules utilise general pathway-specific keywords, syntactic information and gene/protein mentions. Detection results from both components are merged. On a gold-standard corpus, PathNER achieved an F1-score of 84%. To illustrate its potential, we applied PathNER on a collection of articles related to Alzheimer's disease to identify associated pathways, highlighting cases that can complement an existing manually curated knowledgebase. CONCLUSIONS: In contrast to existing text-mining efforts that target the automatic reconstruction of pathway details from molecular interactions mentioned in the literature, PathNER focuses on identifying specific named pathway mentions. These mentions can be used to support large-scale curation and pathway-related systems biology applications, as demonstrated in the example of Alzheimer's disease. PathNER is implemented in Java and made freely available online at http://sourceforge.net/projects/pathner/. BioMed Central 2013-10-16 /pmc/articles/PMC3852116/ /pubmed/24555844 http://dx.doi.org/10.1186/1752-0509-7-S3-S2 Text en Copyright © 2013 Wu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Wu, Chengkun Schwartz, Jean-Marc Nenadic, Goran PathNER: a tool for systematic identification of biological pathway mentions in the literature |
title | PathNER: a tool for systematic identification of biological pathway mentions in the literature |
title_full | PathNER: a tool for systematic identification of biological pathway mentions in the literature |
title_fullStr | PathNER: a tool for systematic identification of biological pathway mentions in the literature |
title_full_unstemmed | PathNER: a tool for systematic identification of biological pathway mentions in the literature |
title_short | PathNER: a tool for systematic identification of biological pathway mentions in the literature |
title_sort | pathner: a tool for systematic identification of biological pathway mentions in the literature |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852116/ https://www.ncbi.nlm.nih.gov/pubmed/24555844 http://dx.doi.org/10.1186/1752-0509-7-S3-S2 |
work_keys_str_mv | AT wuchengkun pathneratoolforsystematicidentificationofbiologicalpathwaymentionsintheliterature AT schwartzjeanmarc pathneratoolforsystematicidentificationofbiologicalpathwaymentionsintheliterature AT nenadicgoran pathneratoolforsystematicidentificationofbiologicalpathwaymentionsintheliterature |