Cargando…

Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II

Manual curation of data from the biomedical literature is a rate-limiting factor for many expert curated databases. Despite the continuing advances in biomedical text mining and the pressing needs of biocurators for better tools, few existing text-mining tools have been successfully integrated into...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Zhiyong, Hirschman, Lynette
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3500522/
https://www.ncbi.nlm.nih.gov/pubmed/23160416
http://dx.doi.org/10.1093/database/bas043
_version_ 1782250117217124352
author Lu, Zhiyong
Hirschman, Lynette
author_facet Lu, Zhiyong
Hirschman, Lynette
author_sort Lu, Zhiyong
collection PubMed
description Manual curation of data from the biomedical literature is a rate-limiting factor for many expert curated databases. Despite the continuing advances in biomedical text mining and the pressing needs of biocurators for better tools, few existing text-mining tools have been successfully integrated into production literature curation systems such as those used by the expert curated databases. To close this gap and better understand all aspects of literature curation, we invited submissions of written descriptions of curation workflows from expert curated databases for the BioCreative 2012 Workshop Track II. We received seven qualified contributions, primarily from model organism databases. Based on these descriptions, we identified commonalities and differences across the workflows, the common ontologies and controlled vocabularies used and the current and desired uses of text mining for biocuration. Compared to a survey done in 2009, our 2012 results show that many more databases are now using text mining in parts of their curation workflows. In addition, the workshop participants identified text-mining aids for finding gene names and symbols (gene indexing), prioritization of documents for curation (document triage) and ontology concept assignment as those most desired by the biocurators. Database URL: http://www.biocreative.org/tasks/bc-workshop-2012/workflow/
format Online
Article
Text
id pubmed-3500522
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35005222012-11-19 Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II Lu, Zhiyong Hirschman, Lynette Database (Oxford) BioCreative Virtual Issue Manual curation of data from the biomedical literature is a rate-limiting factor for many expert curated databases. Despite the continuing advances in biomedical text mining and the pressing needs of biocurators for better tools, few existing text-mining tools have been successfully integrated into production literature curation systems such as those used by the expert curated databases. To close this gap and better understand all aspects of literature curation, we invited submissions of written descriptions of curation workflows from expert curated databases for the BioCreative 2012 Workshop Track II. We received seven qualified contributions, primarily from model organism databases. Based on these descriptions, we identified commonalities and differences across the workflows, the common ontologies and controlled vocabularies used and the current and desired uses of text mining for biocuration. Compared to a survey done in 2009, our 2012 results show that many more databases are now using text mining in parts of their curation workflows. In addition, the workshop participants identified text-mining aids for finding gene names and symbols (gene indexing), prioritization of documents for curation (document triage) and ontology concept assignment as those most desired by the biocurators. Database URL: http://www.biocreative.org/tasks/bc-workshop-2012/workflow/ Oxford University Press 2012-11-15 /pmc/articles/PMC3500522/ /pubmed/23160416 http://dx.doi.org/10.1093/database/bas043 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle BioCreative Virtual Issue
Lu, Zhiyong
Hirschman, Lynette
Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II
title Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II
title_full Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II
title_fullStr Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II
title_full_unstemmed Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II
title_short Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II
title_sort biocuration workflows and text mining: overview of the biocreative 2012 workshop track ii
topic BioCreative Virtual Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3500522/
https://www.ncbi.nlm.nih.gov/pubmed/23160416
http://dx.doi.org/10.1093/database/bas043
work_keys_str_mv AT luzhiyong biocurationworkflowsandtextminingoverviewofthebiocreative2012workshoptrackii
AT hirschmanlynette biocurationworkflowsandtextminingoverviewofthebiocreative2012workshoptrackii