Cargando…

OntoMate: a text-mining tool aiding curation at the Rat Genome Database

The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Qu...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Weisong, Laulederkind, Stanley J. F., Hayman, G. Thomas, Wang, Shur-Jen, Nigam, Rajni, Smith, Jennifer R., De Pons, Jeff, Dwinell, Melinda R., Shimoyama, Mary
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4305386/
https://www.ncbi.nlm.nih.gov/pubmed/25619558
http://dx.doi.org/10.1093/database/bau129
_version_ 1782354227150979072
author Liu, Weisong
Laulederkind, Stanley J. F.
Hayman, G. Thomas
Wang, Shur-Jen
Nigam, Rajni
Smith, Jennifer R.
De Pons, Jeff
Dwinell, Melinda R.
Shimoyama, Mary
author_facet Liu, Weisong
Laulederkind, Stanley J. F.
Hayman, G. Thomas
Wang, Shur-Jen
Nigam, Rajni
Smith, Jennifer R.
De Pons, Jeff
Dwinell, Melinda R.
Shimoyama, Mary
author_sort Liu, Weisong
collection PubMed
description The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Quantitative Trait Locus (QTL) and strain information. The rapidly growing volume of biomedical literature and the active research in the biological natural language processing (bioNLP) community have given RGD the impetus to adopt text-mining tools to improve curation efficiency. Recently, RGD has initiated a project to use OntoMate, an ontology-driven, concept-based literature search engine developed at RGD, as a replacement for the PubMed (http://www.ncbi.nlm.nih.gov/pubmed) search engine in the gene curation workflow. OntoMate tags abstracts with gene names, gene mutations, organism name and most of the 16 ontologies/vocabularies used at RGD. All terms/ entities tagged to an abstract are listed with the abstract in the search results. All listed terms are linked both to data entry boxes and a term browser in the curation tool. OntoMate also provides user-activated filters for species, date and other parameters relevant to the literature search. Using the system for literature search and import has streamlined the process compared to using PubMed. The system was built with a scalable and open architecture, including features specifically designed to accelerate the RGD gene curation process. With the use of bioNLP tools, RGD has added more automation to its curation workflow. Database URL: http://rgd.mcw.edu
format Online
Article
Text
id pubmed-4305386
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43053862015-02-24 OntoMate: a text-mining tool aiding curation at the Rat Genome Database Liu, Weisong Laulederkind, Stanley J. F. Hayman, G. Thomas Wang, Shur-Jen Nigam, Rajni Smith, Jennifer R. De Pons, Jeff Dwinell, Melinda R. Shimoyama, Mary Database (Oxford) Database Tool The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Quantitative Trait Locus (QTL) and strain information. The rapidly growing volume of biomedical literature and the active research in the biological natural language processing (bioNLP) community have given RGD the impetus to adopt text-mining tools to improve curation efficiency. Recently, RGD has initiated a project to use OntoMate, an ontology-driven, concept-based literature search engine developed at RGD, as a replacement for the PubMed (http://www.ncbi.nlm.nih.gov/pubmed) search engine in the gene curation workflow. OntoMate tags abstracts with gene names, gene mutations, organism name and most of the 16 ontologies/vocabularies used at RGD. All terms/ entities tagged to an abstract are listed with the abstract in the search results. All listed terms are linked both to data entry boxes and a term browser in the curation tool. OntoMate also provides user-activated filters for species, date and other parameters relevant to the literature search. Using the system for literature search and import has streamlined the process compared to using PubMed. The system was built with a scalable and open architecture, including features specifically designed to accelerate the RGD gene curation process. With the use of bioNLP tools, RGD has added more automation to its curation workflow. Database URL: http://rgd.mcw.edu Oxford University Press 2015-01-25 /pmc/articles/PMC4305386/ /pubmed/25619558 http://dx.doi.org/10.1093/database/bau129 Text en © The Author(s) 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Liu, Weisong
Laulederkind, Stanley J. F.
Hayman, G. Thomas
Wang, Shur-Jen
Nigam, Rajni
Smith, Jennifer R.
De Pons, Jeff
Dwinell, Melinda R.
Shimoyama, Mary
OntoMate: a text-mining tool aiding curation at the Rat Genome Database
title OntoMate: a text-mining tool aiding curation at the Rat Genome Database
title_full OntoMate: a text-mining tool aiding curation at the Rat Genome Database
title_fullStr OntoMate: a text-mining tool aiding curation at the Rat Genome Database
title_full_unstemmed OntoMate: a text-mining tool aiding curation at the Rat Genome Database
title_short OntoMate: a text-mining tool aiding curation at the Rat Genome Database
title_sort ontomate: a text-mining tool aiding curation at the rat genome database
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4305386/
https://www.ncbi.nlm.nih.gov/pubmed/25619558
http://dx.doi.org/10.1093/database/bau129
work_keys_str_mv AT liuweisong ontomateatextminingtoolaidingcurationattheratgenomedatabase
AT laulederkindstanleyjf ontomateatextminingtoolaidingcurationattheratgenomedatabase
AT haymangthomas ontomateatextminingtoolaidingcurationattheratgenomedatabase
AT wangshurjen ontomateatextminingtoolaidingcurationattheratgenomedatabase
AT nigamrajni ontomateatextminingtoolaidingcurationattheratgenomedatabase
AT smithjenniferr ontomateatextminingtoolaidingcurationattheratgenomedatabase
AT deponsjeff ontomateatextminingtoolaidingcurationattheratgenomedatabase
AT dwinellmelindar ontomateatextminingtoolaidingcurationattheratgenomedatabase
AT shimoyamamary ontomateatextminingtoolaidingcurationattheratgenomedatabase