Cargando…

GeneTUKit: a software for document-level gene normalization

Motivation: Linking gene mentions in an article to entries of biological databases can facilitate indexing and querying biological literature greatly. Due to the high ambiguity of gene names, this task is particularly challenging. Manual annotation for this task is cost expensive, time consuming and...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Minlie, Liu, Jingchen, Zhu, Xiaoyan
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3065680/
https://www.ncbi.nlm.nih.gov/pubmed/21303863
http://dx.doi.org/10.1093/bioinformatics/btr042
_version_ 1782201012658896896
author Huang, Minlie
Liu, Jingchen
Zhu, Xiaoyan
author_facet Huang, Minlie
Liu, Jingchen
Zhu, Xiaoyan
author_sort Huang, Minlie
collection PubMed
description Motivation: Linking gene mentions in an article to entries of biological databases can facilitate indexing and querying biological literature greatly. Due to the high ambiguity of gene names, this task is particularly challenging. Manual annotation for this task is cost expensive, time consuming and labor intensive. Therefore, providing assistive tools to facilitate the task is of high value. Results: We developed GeneTUKit, a document-level gene normalization software for full-text articles. This software employs both local context surrounding gene mentions and global context from the whole full-text document. It can normalize genes of different species simultaneously. When participating in BioCreAtIvE III, the system obtained good results among 37 runs: the system was ranked first, fourth and seventh in terms of TAP-20, TAP-10 and TAP-5, respectively on the 507 full-text test articles. Availability and implementation: The software is available at http://www.qanswers.net/GeneTUKit/. Contact: aihuang@tsinghua.edu.cn
format Text
id pubmed-3065680
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30656802011-03-30 GeneTUKit: a software for document-level gene normalization Huang, Minlie Liu, Jingchen Zhu, Xiaoyan Bioinformatics Applications Note Motivation: Linking gene mentions in an article to entries of biological databases can facilitate indexing and querying biological literature greatly. Due to the high ambiguity of gene names, this task is particularly challenging. Manual annotation for this task is cost expensive, time consuming and labor intensive. Therefore, providing assistive tools to facilitate the task is of high value. Results: We developed GeneTUKit, a document-level gene normalization software for full-text articles. This software employs both local context surrounding gene mentions and global context from the whole full-text document. It can normalize genes of different species simultaneously. When participating in BioCreAtIvE III, the system obtained good results among 37 runs: the system was ranked first, fourth and seventh in terms of TAP-20, TAP-10 and TAP-5, respectively on the 507 full-text test articles. Availability and implementation: The software is available at http://www.qanswers.net/GeneTUKit/. Contact: aihuang@tsinghua.edu.cn Oxford University Press 2011-04-01 2011-02-08 /pmc/articles/PMC3065680/ /pubmed/21303863 http://dx.doi.org/10.1093/bioinformatics/btr042 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Huang, Minlie
Liu, Jingchen
Zhu, Xiaoyan
GeneTUKit: a software for document-level gene normalization
title GeneTUKit: a software for document-level gene normalization
title_full GeneTUKit: a software for document-level gene normalization
title_fullStr GeneTUKit: a software for document-level gene normalization
title_full_unstemmed GeneTUKit: a software for document-level gene normalization
title_short GeneTUKit: a software for document-level gene normalization
title_sort genetukit: a software for document-level gene normalization
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3065680/
https://www.ncbi.nlm.nih.gov/pubmed/21303863
http://dx.doi.org/10.1093/bioinformatics/btr042
work_keys_str_mv AT huangminlie genetukitasoftwarefordocumentlevelgenenormalization
AT liujingchen genetukitasoftwarefordocumentlevelgenenormalization
AT zhuxiaoyan genetukitasoftwarefordocumentlevelgenenormalization