Cargando…
Chemical Entity Recognition and Resolution to ChEBI
Chemical entities are ubiquitous through the biomedical literature and the development of text-mining systems that can efficiently identify those entities are required. Due to the lack of available corpora and data resources, the community has focused its efforts in the development of gene and prote...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
International Scholarly Research Network
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393067/ https://www.ncbi.nlm.nih.gov/pubmed/25937941 http://dx.doi.org/10.5402/2012/619427 |
_version_ | 1782366093674807296 |
---|---|
author | Grego, Tiago Pesquita, Catia Bastos, Hugo P. Couto, Francisco M. |
author_facet | Grego, Tiago Pesquita, Catia Bastos, Hugo P. Couto, Francisco M. |
author_sort | Grego, Tiago |
collection | PubMed |
description | Chemical entities are ubiquitous through the biomedical literature and the development of text-mining systems that can efficiently identify those entities are required. Due to the lack of available corpora and data resources, the community has focused its efforts in the development of gene and protein named entity recognition systems, but with the release of ChEBI and the availability of an annotated corpus, this task can be addressed. We developed a machine-learning-based method for chemical entity recognition and a lexical-similarity-based method for chemical entity resolution and compared them with Whatizit, a popular-dictionary-based method. Our methods outperformed the dictionary-based method in all tasks, yielding an improvement in F-measure of 20% for the entity recognition task, 2–5% for the entity-resolution task, and 15% for combined entity recognition and resolution tasks. |
format | Online Article Text |
id | pubmed-4393067 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | International Scholarly Research Network |
record_format | MEDLINE/PubMed |
spelling | pubmed-43930672015-05-03 Chemical Entity Recognition and Resolution to ChEBI Grego, Tiago Pesquita, Catia Bastos, Hugo P. Couto, Francisco M. ISRN Bioinform Research Article Chemical entities are ubiquitous through the biomedical literature and the development of text-mining systems that can efficiently identify those entities are required. Due to the lack of available corpora and data resources, the community has focused its efforts in the development of gene and protein named entity recognition systems, but with the release of ChEBI and the availability of an annotated corpus, this task can be addressed. We developed a machine-learning-based method for chemical entity recognition and a lexical-similarity-based method for chemical entity resolution and compared them with Whatizit, a popular-dictionary-based method. Our methods outperformed the dictionary-based method in all tasks, yielding an improvement in F-measure of 20% for the entity recognition task, 2–5% for the entity-resolution task, and 15% for combined entity recognition and resolution tasks. International Scholarly Research Network 2012-02-15 /pmc/articles/PMC4393067/ /pubmed/25937941 http://dx.doi.org/10.5402/2012/619427 Text en Copyright © 2012 Tiago Grego et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Grego, Tiago Pesquita, Catia Bastos, Hugo P. Couto, Francisco M. Chemical Entity Recognition and Resolution to ChEBI |
title | Chemical Entity Recognition and Resolution to ChEBI |
title_full | Chemical Entity Recognition and Resolution to ChEBI |
title_fullStr | Chemical Entity Recognition and Resolution to ChEBI |
title_full_unstemmed | Chemical Entity Recognition and Resolution to ChEBI |
title_short | Chemical Entity Recognition and Resolution to ChEBI |
title_sort | chemical entity recognition and resolution to chebi |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393067/ https://www.ncbi.nlm.nih.gov/pubmed/25937941 http://dx.doi.org/10.5402/2012/619427 |
work_keys_str_mv | AT gregotiago chemicalentityrecognitionandresolutiontochebi AT pesquitacatia chemicalentityrecognitionandresolutiontochebi AT bastoshugop chemicalentityrecognitionandresolutiontochebi AT coutofranciscom chemicalentityrecognitionandresolutiontochebi |