Cargando…
GeneTUKit: a software for document-level gene normalization
Motivation: Linking gene mentions in an article to entries of biological databases can facilitate indexing and querying biological literature greatly. Due to the high ambiguity of gene names, this task is particularly challenging. Manual annotation for this task is cost expensive, time consuming and...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3065680/ https://www.ncbi.nlm.nih.gov/pubmed/21303863 http://dx.doi.org/10.1093/bioinformatics/btr042 |
_version_ | 1782201012658896896 |
---|---|
author | Huang, Minlie Liu, Jingchen Zhu, Xiaoyan |
author_facet | Huang, Minlie Liu, Jingchen Zhu, Xiaoyan |
author_sort | Huang, Minlie |
collection | PubMed |
description | Motivation: Linking gene mentions in an article to entries of biological databases can facilitate indexing and querying biological literature greatly. Due to the high ambiguity of gene names, this task is particularly challenging. Manual annotation for this task is cost expensive, time consuming and labor intensive. Therefore, providing assistive tools to facilitate the task is of high value. Results: We developed GeneTUKit, a document-level gene normalization software for full-text articles. This software employs both local context surrounding gene mentions and global context from the whole full-text document. It can normalize genes of different species simultaneously. When participating in BioCreAtIvE III, the system obtained good results among 37 runs: the system was ranked first, fourth and seventh in terms of TAP-20, TAP-10 and TAP-5, respectively on the 507 full-text test articles. Availability and implementation: The software is available at http://www.qanswers.net/GeneTUKit/. Contact: aihuang@tsinghua.edu.cn |
format | Text |
id | pubmed-3065680 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-30656802011-03-30 GeneTUKit: a software for document-level gene normalization Huang, Minlie Liu, Jingchen Zhu, Xiaoyan Bioinformatics Applications Note Motivation: Linking gene mentions in an article to entries of biological databases can facilitate indexing and querying biological literature greatly. Due to the high ambiguity of gene names, this task is particularly challenging. Manual annotation for this task is cost expensive, time consuming and labor intensive. Therefore, providing assistive tools to facilitate the task is of high value. Results: We developed GeneTUKit, a document-level gene normalization software for full-text articles. This software employs both local context surrounding gene mentions and global context from the whole full-text document. It can normalize genes of different species simultaneously. When participating in BioCreAtIvE III, the system obtained good results among 37 runs: the system was ranked first, fourth and seventh in terms of TAP-20, TAP-10 and TAP-5, respectively on the 507 full-text test articles. Availability and implementation: The software is available at http://www.qanswers.net/GeneTUKit/. Contact: aihuang@tsinghua.edu.cn Oxford University Press 2011-04-01 2011-02-08 /pmc/articles/PMC3065680/ /pubmed/21303863 http://dx.doi.org/10.1093/bioinformatics/btr042 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Huang, Minlie Liu, Jingchen Zhu, Xiaoyan GeneTUKit: a software for document-level gene normalization |
title | GeneTUKit: a software for document-level gene normalization |
title_full | GeneTUKit: a software for document-level gene normalization |
title_fullStr | GeneTUKit: a software for document-level gene normalization |
title_full_unstemmed | GeneTUKit: a software for document-level gene normalization |
title_short | GeneTUKit: a software for document-level gene normalization |
title_sort | genetukit: a software for document-level gene normalization |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3065680/ https://www.ncbi.nlm.nih.gov/pubmed/21303863 http://dx.doi.org/10.1093/bioinformatics/btr042 |
work_keys_str_mv | AT huangminlie genetukitasoftwarefordocumentlevelgenenormalization AT liujingchen genetukitasoftwarefordocumentlevelgenenormalization AT zhuxiaoyan genetukitasoftwarefordocumentlevelgenenormalization |