Cargando…
New challenges for text mining: mapping between text and manually curated pathways
BACKGROUND: Associating literature with pathways poses new challenges to the Text Mining (TM) community. There are three main challenges to this task: (1) the identification of the mapping position of a specific entity or reaction in a given pathway, (2) the recognition of the causal relationships a...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2352872/ https://www.ncbi.nlm.nih.gov/pubmed/18426550 http://dx.doi.org/10.1186/1471-2105-9-S3-S5 |
_version_ | 1782152860223406080 |
---|---|
author | Oda, Kanae Kim, Jin-Dong Ohta, Tomoko Okanohara, Daisuke Matsuzaki, Takuya Tateisi, Yuka Tsujii, Jun'ichi |
author_facet | Oda, Kanae Kim, Jin-Dong Ohta, Tomoko Okanohara, Daisuke Matsuzaki, Takuya Tateisi, Yuka Tsujii, Jun'ichi |
author_sort | Oda, Kanae |
collection | PubMed |
description | BACKGROUND: Associating literature with pathways poses new challenges to the Text Mining (TM) community. There are three main challenges to this task: (1) the identification of the mapping position of a specific entity or reaction in a given pathway, (2) the recognition of the causal relationships among multiple reactions, and (3) the formulation and implementation of required inferences based on biological domain knowledge. RESULTS: To address these challenges, we constructed new resources to link the text with a model pathway; they are: the GENIA pathway corpus with event annotation and NF-kB pathway. Through their detailed analysis, we address the untapped resource, ‘bio-inference,’ as well as the differences between text and pathway representation. Here, we show the precise comparisons of their representations and the nine classes of ‘bio-inference’ schemes observed in the pathway corpus. CONCLUSIONS: We believe that the creation of such rich resources and their detailed analysis is the significant first step for accelerating the research of the automatic construction of pathway from text. |
format | Text |
id | pubmed-2352872 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-23528722008-04-29 New challenges for text mining: mapping between text and manually curated pathways Oda, Kanae Kim, Jin-Dong Ohta, Tomoko Okanohara, Daisuke Matsuzaki, Takuya Tateisi, Yuka Tsujii, Jun'ichi BMC Bioinformatics Proceedings BACKGROUND: Associating literature with pathways poses new challenges to the Text Mining (TM) community. There are three main challenges to this task: (1) the identification of the mapping position of a specific entity or reaction in a given pathway, (2) the recognition of the causal relationships among multiple reactions, and (3) the formulation and implementation of required inferences based on biological domain knowledge. RESULTS: To address these challenges, we constructed new resources to link the text with a model pathway; they are: the GENIA pathway corpus with event annotation and NF-kB pathway. Through their detailed analysis, we address the untapped resource, ‘bio-inference,’ as well as the differences between text and pathway representation. Here, we show the precise comparisons of their representations and the nine classes of ‘bio-inference’ schemes observed in the pathway corpus. CONCLUSIONS: We believe that the creation of such rich resources and their detailed analysis is the significant first step for accelerating the research of the automatic construction of pathway from text. BioMed Central 2008-04-11 /pmc/articles/PMC2352872/ /pubmed/18426550 http://dx.doi.org/10.1186/1471-2105-9-S3-S5 Text en Copyright © 2008 Oda et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Oda, Kanae Kim, Jin-Dong Ohta, Tomoko Okanohara, Daisuke Matsuzaki, Takuya Tateisi, Yuka Tsujii, Jun'ichi New challenges for text mining: mapping between text and manually curated pathways |
title | New challenges for text mining: mapping between text and manually curated pathways |
title_full | New challenges for text mining: mapping between text and manually curated pathways |
title_fullStr | New challenges for text mining: mapping between text and manually curated pathways |
title_full_unstemmed | New challenges for text mining: mapping between text and manually curated pathways |
title_short | New challenges for text mining: mapping between text and manually curated pathways |
title_sort | new challenges for text mining: mapping between text and manually curated pathways |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2352872/ https://www.ncbi.nlm.nih.gov/pubmed/18426550 http://dx.doi.org/10.1186/1471-2105-9-S3-S5 |
work_keys_str_mv | AT odakanae newchallengesfortextminingmappingbetweentextandmanuallycuratedpathways AT kimjindong newchallengesfortextminingmappingbetweentextandmanuallycuratedpathways AT ohtatomoko newchallengesfortextminingmappingbetweentextandmanuallycuratedpathways AT okanoharadaisuke newchallengesfortextminingmappingbetweentextandmanuallycuratedpathways AT matsuzakitakuya newchallengesfortextminingmappingbetweentextandmanuallycuratedpathways AT tateisiyuka newchallengesfortextminingmappingbetweentextandmanuallycuratedpathways AT tsujiijunichi newchallengesfortextminingmappingbetweentextandmanuallycuratedpathways |