Cargando…

Text Detective: a rule-based system for gene annotation in biomedical texts

BACKGROUND: The identification of mentions of gene or gene products in biomedical texts is a critical step in the development of text mining applications in biosciences. The complexity and ambiguity of gene nomenclature makes this a very difficult task. METHODS: Here we present a novel approach base...

Descripción completa

Detalles Bibliográficos
Autor principal: Tamames, Javier
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1869003/
https://www.ncbi.nlm.nih.gov/pubmed/15960822
http://dx.doi.org/10.1186/1471-2105-6-S1-S10
_version_ 1782133424605102080
author Tamames, Javier
author_facet Tamames, Javier
author_sort Tamames, Javier
collection PubMed
description BACKGROUND: The identification of mentions of gene or gene products in biomedical texts is a critical step in the development of text mining applications in biosciences. The complexity and ambiguity of gene nomenclature makes this a very difficult task. METHODS: Here we present a novel approach based on a combination of carefully designed rules and several lexicons of biological concepts, implemented in the Text Detective system. Text Detective is able to normalize the results of gene mentions found by offering the appropriate database reference. RESULTS: In BioCreAtIvE evaluation, Text Detective achieved results of 84% precision, 71% recall for task 1A, and 79% precision, 71% recall for mouse genes in task 1B.
format Text
id pubmed-1869003
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18690032007-05-18 Text Detective: a rule-based system for gene annotation in biomedical texts Tamames, Javier BMC Bioinformatics Report BACKGROUND: The identification of mentions of gene or gene products in biomedical texts is a critical step in the development of text mining applications in biosciences. The complexity and ambiguity of gene nomenclature makes this a very difficult task. METHODS: Here we present a novel approach based on a combination of carefully designed rules and several lexicons of biological concepts, implemented in the Text Detective system. Text Detective is able to normalize the results of gene mentions found by offering the appropriate database reference. RESULTS: In BioCreAtIvE evaluation, Text Detective achieved results of 84% precision, 71% recall for task 1A, and 79% precision, 71% recall for mouse genes in task 1B. BioMed Central 2005-05-24 /pmc/articles/PMC1869003/ /pubmed/15960822 http://dx.doi.org/10.1186/1471-2105-6-S1-S10 Text en Copyright © 2005 Tamames; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Report
Tamames, Javier
Text Detective: a rule-based system for gene annotation in biomedical texts
title Text Detective: a rule-based system for gene annotation in biomedical texts
title_full Text Detective: a rule-based system for gene annotation in biomedical texts
title_fullStr Text Detective: a rule-based system for gene annotation in biomedical texts
title_full_unstemmed Text Detective: a rule-based system for gene annotation in biomedical texts
title_short Text Detective: a rule-based system for gene annotation in biomedical texts
title_sort text detective: a rule-based system for gene annotation in biomedical texts
topic Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1869003/
https://www.ncbi.nlm.nih.gov/pubmed/15960822
http://dx.doi.org/10.1186/1471-2105-6-S1-S10
work_keys_str_mv AT tamamesjavier textdetectivearulebasedsystemforgeneannotationinbiomedicaltexts