Cargando…

Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts

Today’s biomedical research has become heavily dependent on access to the biological knowledge encoded in expert curated biological databases. As the volume of biological literature grows rapidly, it becomes increasingly difficult for biocurators to keep up with the literature because manual curatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Chih-Hsuan, Harris, Bethany R., Li, Donghui, Berardini, Tanya Z., Huala, Eva, Kao, Hung-Yu, Lu, Zhiyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3500520/
https://www.ncbi.nlm.nih.gov/pubmed/23160414
http://dx.doi.org/10.1093/database/bas041
_version_ 1782250116746313728
author Wei, Chih-Hsuan
Harris, Bethany R.
Li, Donghui
Berardini, Tanya Z.
Huala, Eva
Kao, Hung-Yu
Lu, Zhiyong
author_facet Wei, Chih-Hsuan
Harris, Bethany R.
Li, Donghui
Berardini, Tanya Z.
Huala, Eva
Kao, Hung-Yu
Lu, Zhiyong
author_sort Wei, Chih-Hsuan
collection PubMed
description Today’s biomedical research has become heavily dependent on access to the biological knowledge encoded in expert curated biological databases. As the volume of biological literature grows rapidly, it becomes increasingly difficult for biocurators to keep up with the literature because manual curation is an expensive and time-consuming endeavour. Past research has suggested that computer-assisted curation can improve efficiency, but few text-mining systems have been formally evaluated in this regard. Through participation in the interactive text-mining track of the BioCreative 2012 workshop, we developed PubTator, a PubMed-like system that assists with two specific human curation tasks: document triage and bioconcept annotation. On the basis of evaluation results from two external user groups, we find that the accuracy of PubTator-assisted curation is comparable with that of manual curation and that PubTator can significantly increase human curatorial speed. These encouraging findings warrant further investigation with a larger number of publications to be annotated. Database URL: http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/PubTator/
format Online
Article
Text
id pubmed-3500520
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35005202012-11-19 Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts Wei, Chih-Hsuan Harris, Bethany R. Li, Donghui Berardini, Tanya Z. Huala, Eva Kao, Hung-Yu Lu, Zhiyong Database (Oxford) BioCreative Virtual Issue Today’s biomedical research has become heavily dependent on access to the biological knowledge encoded in expert curated biological databases. As the volume of biological literature grows rapidly, it becomes increasingly difficult for biocurators to keep up with the literature because manual curation is an expensive and time-consuming endeavour. Past research has suggested that computer-assisted curation can improve efficiency, but few text-mining systems have been formally evaluated in this regard. Through participation in the interactive text-mining track of the BioCreative 2012 workshop, we developed PubTator, a PubMed-like system that assists with two specific human curation tasks: document triage and bioconcept annotation. On the basis of evaluation results from two external user groups, we find that the accuracy of PubTator-assisted curation is comparable with that of manual curation and that PubTator can significantly increase human curatorial speed. These encouraging findings warrant further investigation with a larger number of publications to be annotated. Database URL: http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/PubTator/ Oxford University Press 2012-11-15 /pmc/articles/PMC3500520/ /pubmed/23160414 http://dx.doi.org/10.1093/database/bas041 Text en Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle BioCreative Virtual Issue
Wei, Chih-Hsuan
Harris, Bethany R.
Li, Donghui
Berardini, Tanya Z.
Huala, Eva
Kao, Hung-Yu
Lu, Zhiyong
Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts
title Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts
title_full Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts
title_fullStr Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts
title_full_unstemmed Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts
title_short Accelerating literature curation with text-mining tools: a case study of using PubTator to curate genes in PubMed abstracts
title_sort accelerating literature curation with text-mining tools: a case study of using pubtator to curate genes in pubmed abstracts
topic BioCreative Virtual Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3500520/
https://www.ncbi.nlm.nih.gov/pubmed/23160414
http://dx.doi.org/10.1093/database/bas041
work_keys_str_mv AT weichihhsuan acceleratingliteraturecurationwithtextminingtoolsacasestudyofusingpubtatortocurategenesinpubmedabstracts
AT harrisbethanyr acceleratingliteraturecurationwithtextminingtoolsacasestudyofusingpubtatortocurategenesinpubmedabstracts
AT lidonghui acceleratingliteraturecurationwithtextminingtoolsacasestudyofusingpubtatortocurategenesinpubmedabstracts
AT berardinitanyaz acceleratingliteraturecurationwithtextminingtoolsacasestudyofusingpubtatortocurategenesinpubmedabstracts
AT hualaeva acceleratingliteraturecurationwithtextminingtoolsacasestudyofusingpubtatortocurategenesinpubmedabstracts
AT kaohungyu acceleratingliteraturecurationwithtextminingtoolsacasestudyofusingpubtatortocurategenesinpubmedabstracts
AT luzhiyong acceleratingliteraturecurationwithtextminingtoolsacasestudyofusingpubtatortocurategenesinpubmedabstracts