Cargando…

SR4GN: A Species Recognition Software Tool for Gene Normalization

As suggested in recent studies, species recognition and disambiguation is one of the most critical and challenging steps in many downstream text-mining applications such as the gene normalization task and protein-protein interaction extraction. We report SR4GN: an open source tool for species recogn...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Chih-Hsuan, Kao, Hung-Yu, Lu, Zhiyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3367953/
https://www.ncbi.nlm.nih.gov/pubmed/22679507
http://dx.doi.org/10.1371/journal.pone.0038460
_version_ 1782234891345199104
author Wei, Chih-Hsuan
Kao, Hung-Yu
Lu, Zhiyong
author_facet Wei, Chih-Hsuan
Kao, Hung-Yu
Lu, Zhiyong
author_sort Wei, Chih-Hsuan
collection PubMed
description As suggested in recent studies, species recognition and disambiguation is one of the most critical and challenging steps in many downstream text-mining applications such as the gene normalization task and protein-protein interaction extraction. We report SR4GN: an open source tool for species recognition and disambiguation in biomedical text. In addition to the species detection function in existing tools, SR4GN is optimized for the Gene Normalization task. As such it is developed to link detected species with corresponding gene mentions in a document. SR4GN achieves 85.42% in accuracy and compares favorably to the other state-of-the-art techniques in benchmark experiments. Finally, SR4GN is implemented as a standalone software tool, thus making it convenient and robust for use in many text-mining applications. SR4GN can be downloaded at: http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/downloads/SR4GN
format Online
Article
Text
id pubmed-3367953
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33679532012-06-07 SR4GN: A Species Recognition Software Tool for Gene Normalization Wei, Chih-Hsuan Kao, Hung-Yu Lu, Zhiyong PLoS One Research Article As suggested in recent studies, species recognition and disambiguation is one of the most critical and challenging steps in many downstream text-mining applications such as the gene normalization task and protein-protein interaction extraction. We report SR4GN: an open source tool for species recognition and disambiguation in biomedical text. In addition to the species detection function in existing tools, SR4GN is optimized for the Gene Normalization task. As such it is developed to link detected species with corresponding gene mentions in a document. SR4GN achieves 85.42% in accuracy and compares favorably to the other state-of-the-art techniques in benchmark experiments. Finally, SR4GN is implemented as a standalone software tool, thus making it convenient and robust for use in many text-mining applications. SR4GN can be downloaded at: http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/downloads/SR4GN Public Library of Science 2012-06-05 /pmc/articles/PMC3367953/ /pubmed/22679507 http://dx.doi.org/10.1371/journal.pone.0038460 Text en This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Wei, Chih-Hsuan
Kao, Hung-Yu
Lu, Zhiyong
SR4GN: A Species Recognition Software Tool for Gene Normalization
title SR4GN: A Species Recognition Software Tool for Gene Normalization
title_full SR4GN: A Species Recognition Software Tool for Gene Normalization
title_fullStr SR4GN: A Species Recognition Software Tool for Gene Normalization
title_full_unstemmed SR4GN: A Species Recognition Software Tool for Gene Normalization
title_short SR4GN: A Species Recognition Software Tool for Gene Normalization
title_sort sr4gn: a species recognition software tool for gene normalization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3367953/
https://www.ncbi.nlm.nih.gov/pubmed/22679507
http://dx.doi.org/10.1371/journal.pone.0038460
work_keys_str_mv AT weichihhsuan sr4gnaspeciesrecognitionsoftwaretoolforgenenormalization
AT kaohungyu sr4gnaspeciesrecognitionsoftwaretoolforgenenormalization
AT luzhiyong sr4gnaspeciesrecognitionsoftwaretoolforgenenormalization