Cargando…

Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges

Text mining in the biomedical sciences is rapidly transitioning from small-scale evaluation to large-scale application. In this article, we argue that text-mining technologies have become essential tools in real-world biomedical research. We describe four large scale applications of text mining, as...

Descripción completa

Detalles Bibliográficos
Autores principales: Singhal, Ayush, Leaman, Robert, Catlett, Natalie, Lemberger, Thomas, McEntyre, Johanna, Polson, Shawn, Xenarios, Ioannis, Arighi, Cecilia, Lu, Zhiyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5199160/
https://www.ncbi.nlm.nih.gov/pubmed/28025348
http://dx.doi.org/10.1093/database/baw161
_version_ 1782488958764056576
author Singhal, Ayush
Leaman, Robert
Catlett, Natalie
Lemberger, Thomas
McEntyre, Johanna
Polson, Shawn
Xenarios, Ioannis
Arighi, Cecilia
Lu, Zhiyong
author_facet Singhal, Ayush
Leaman, Robert
Catlett, Natalie
Lemberger, Thomas
McEntyre, Johanna
Polson, Shawn
Xenarios, Ioannis
Arighi, Cecilia
Lu, Zhiyong
author_sort Singhal, Ayush
collection PubMed
description Text mining in the biomedical sciences is rapidly transitioning from small-scale evaluation to large-scale application. In this article, we argue that text-mining technologies have become essential tools in real-world biomedical research. We describe four large scale applications of text mining, as showcased during a recent panel discussion at the BioCreative V Challenge Workshop. We draw on these applications as case studies to characterize common requirements for successfully applying text-mining techniques to practical biocuration needs. We note that system ‘accuracy’ remains a challenge and identify several additional common difficulties and potential research directions including (i) the ‘scalability’ issue due to the increasing need of mining information from millions of full-text articles, (ii) the ‘interoperability’ issue of integrating various text-mining systems into existing curation workflows and (iii) the ‘reusability’ issue on the difficulty of applying trained systems to text genres that are not seen previously during development. We then describe related efforts within the text-mining community, with a special focus on the BioCreative series of challenge workshops. We believe that focusing on the near-term challenges identified in this work will amplify the opportunities afforded by the continued adoption of text-mining tools. Finally, in order to sustain the curation ecosystem and have text-mining systems adopted for practical benefits, we call for increased collaboration between text-mining researchers and various stakeholders, including researchers, publishers and biocurators.
format Online
Article
Text
id pubmed-5199160
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-51991602017-01-06 Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges Singhal, Ayush Leaman, Robert Catlett, Natalie Lemberger, Thomas McEntyre, Johanna Polson, Shawn Xenarios, Ioannis Arighi, Cecilia Lu, Zhiyong Database (Oxford) Perspective Text mining in the biomedical sciences is rapidly transitioning from small-scale evaluation to large-scale application. In this article, we argue that text-mining technologies have become essential tools in real-world biomedical research. We describe four large scale applications of text mining, as showcased during a recent panel discussion at the BioCreative V Challenge Workshop. We draw on these applications as case studies to characterize common requirements for successfully applying text-mining techniques to practical biocuration needs. We note that system ‘accuracy’ remains a challenge and identify several additional common difficulties and potential research directions including (i) the ‘scalability’ issue due to the increasing need of mining information from millions of full-text articles, (ii) the ‘interoperability’ issue of integrating various text-mining systems into existing curation workflows and (iii) the ‘reusability’ issue on the difficulty of applying trained systems to text genres that are not seen previously during development. We then describe related efforts within the text-mining community, with a special focus on the BioCreative series of challenge workshops. We believe that focusing on the near-term challenges identified in this work will amplify the opportunities afforded by the continued adoption of text-mining tools. Finally, in order to sustain the curation ecosystem and have text-mining systems adopted for practical benefits, we call for increased collaboration between text-mining researchers and various stakeholders, including researchers, publishers and biocurators. Oxford University Press 2016-12-26 /pmc/articles/PMC5199160/ /pubmed/28025348 http://dx.doi.org/10.1093/database/baw161 Text en Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.
spellingShingle Perspective
Singhal, Ayush
Leaman, Robert
Catlett, Natalie
Lemberger, Thomas
McEntyre, Johanna
Polson, Shawn
Xenarios, Ioannis
Arighi, Cecilia
Lu, Zhiyong
Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
title Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
title_full Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
title_fullStr Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
title_full_unstemmed Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
title_short Pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
title_sort pressing needs of biomedical text mining in biocuration and beyond: opportunities and challenges
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5199160/
https://www.ncbi.nlm.nih.gov/pubmed/28025348
http://dx.doi.org/10.1093/database/baw161
work_keys_str_mv AT singhalayush pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges
AT leamanrobert pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges
AT catlettnatalie pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges
AT lembergerthomas pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges
AT mcentyrejohanna pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges
AT polsonshawn pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges
AT xenariosioannis pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges
AT arighicecilia pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges
AT luzhiyong pressingneedsofbiomedicaltextmininginbiocurationandbeyondopportunitiesandchallenges