Cargando…

Chemical Entity Recognition and Resolution to ChEBI

Chemical entities are ubiquitous through the biomedical literature and the development of text-mining systems that can efficiently identify those entities are required. Due to the lack of available corpora and data resources, the community has focused its efforts in the development of gene and prote...

Descripción completa

Detalles Bibliográficos
Autores principales: Grego, Tiago, Pesquita, Catia, Bastos, Hugo P., Couto, Francisco M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Scholarly Research Network 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393067/
https://www.ncbi.nlm.nih.gov/pubmed/25937941
http://dx.doi.org/10.5402/2012/619427
_version_ 1782366093674807296
author Grego, Tiago
Pesquita, Catia
Bastos, Hugo P.
Couto, Francisco M.
author_facet Grego, Tiago
Pesquita, Catia
Bastos, Hugo P.
Couto, Francisco M.
author_sort Grego, Tiago
collection PubMed
description Chemical entities are ubiquitous through the biomedical literature and the development of text-mining systems that can efficiently identify those entities are required. Due to the lack of available corpora and data resources, the community has focused its efforts in the development of gene and protein named entity recognition systems, but with the release of ChEBI and the availability of an annotated corpus, this task can be addressed. We developed a machine-learning-based method for chemical entity recognition and a lexical-similarity-based method for chemical entity resolution and compared them with Whatizit, a popular-dictionary-based method. Our methods outperformed the dictionary-based method in all tasks, yielding an improvement in F-measure of 20% for the entity recognition task, 2–5% for the entity-resolution task, and 15% for combined entity recognition and resolution tasks.
format Online
Article
Text
id pubmed-4393067
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher International Scholarly Research Network
record_format MEDLINE/PubMed
spelling pubmed-43930672015-05-03 Chemical Entity Recognition and Resolution to ChEBI Grego, Tiago Pesquita, Catia Bastos, Hugo P. Couto, Francisco M. ISRN Bioinform Research Article Chemical entities are ubiquitous through the biomedical literature and the development of text-mining systems that can efficiently identify those entities are required. Due to the lack of available corpora and data resources, the community has focused its efforts in the development of gene and protein named entity recognition systems, but with the release of ChEBI and the availability of an annotated corpus, this task can be addressed. We developed a machine-learning-based method for chemical entity recognition and a lexical-similarity-based method for chemical entity resolution and compared them with Whatizit, a popular-dictionary-based method. Our methods outperformed the dictionary-based method in all tasks, yielding an improvement in F-measure of 20% for the entity recognition task, 2–5% for the entity-resolution task, and 15% for combined entity recognition and resolution tasks. International Scholarly Research Network 2012-02-15 /pmc/articles/PMC4393067/ /pubmed/25937941 http://dx.doi.org/10.5402/2012/619427 Text en Copyright © 2012 Tiago Grego et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Grego, Tiago
Pesquita, Catia
Bastos, Hugo P.
Couto, Francisco M.
Chemical Entity Recognition and Resolution to ChEBI
title Chemical Entity Recognition and Resolution to ChEBI
title_full Chemical Entity Recognition and Resolution to ChEBI
title_fullStr Chemical Entity Recognition and Resolution to ChEBI
title_full_unstemmed Chemical Entity Recognition and Resolution to ChEBI
title_short Chemical Entity Recognition and Resolution to ChEBI
title_sort chemical entity recognition and resolution to chebi
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393067/
https://www.ncbi.nlm.nih.gov/pubmed/25937941
http://dx.doi.org/10.5402/2012/619427
work_keys_str_mv AT gregotiago chemicalentityrecognitionandresolutiontochebi
AT pesquitacatia chemicalentityrecognitionandresolutiontochebi
AT bastoshugop chemicalentityrecognitionandresolutiontochebi
AT coutofranciscom chemicalentityrecognitionandresolutiontochebi